Skip to main content

Selenium Open Source Search Engine

Project description

SOSSE 🦦

SOSSE (Selenium Open Source Search Engine) is a Web archiving software, crawler and search engine written in Python, distributed under the GNU-AGPLv3 license. It is hosted on both Gitlab and Github site, please use any of them to open feature requests, bug report or merge requests, or open a discussion.

SOSSE main features are:

  • 🌍 Browser based crawling: SOSSE uses Mozilla Firefox, or Google Chromium and Selenium to index pages that use Javascript. Requests can also be used for faster crawling
  • 📚 Offline browsing: SOSSE can save HTML copy or take screenshots of crawled pages to create archives suitable for offline browsing
  • 📉 Low resources requirements: SOSSE is entirely written in Python and uses PostgreSQL for data storage
  • 🔓 Authentication: the crawlers can submit authentication forms with provided credentials
  • 🔗 Search engines shortcuts: shortcuts search queries can be used to redirect to external search engines (sometime called "bang" searches)
  • 🔖 Search history: users can authenticate to log their search query history privately

See the documentation and screenshots.

Try it out

You can try the latest version with Docker:

docker run -p 8005:80 biolds/sosse:latest

Open http://127.0.0.1:8005/, and log in with user admin, password admin.

To persist Docker data, or find alternative installation methods, please check the documentation.

Keep in touch

Join the Discord server to get help and share ideas!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sosse-1.10.1.tar.gz (3.4 MB view details)

Uploaded Source

Built Distribution

sosse-1.10.1-py3-none-any.whl (3.4 MB view details)

Uploaded Python 3

File details

Details for the file sosse-1.10.1.tar.gz.

File metadata

  • Download URL: sosse-1.10.1.tar.gz
  • Upload date:
  • Size: 3.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.2

File hashes

Hashes for sosse-1.10.1.tar.gz
Algorithm Hash digest
SHA256 2109963793c30ac981a1f8d03c674b02a0a84919c82e9db3c6765a64165051b0
MD5 9562aaf75f893eb2abe70c3b44a6d666
BLAKE2b-256 7049e6f206183a338b4c42df895442c80acebdaf55499b112873f99b6b37027c

See more details on using hashes here.

File details

Details for the file sosse-1.10.1-py3-none-any.whl.

File metadata

  • Download URL: sosse-1.10.1-py3-none-any.whl
  • Upload date:
  • Size: 3.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.2

File hashes

Hashes for sosse-1.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f0d154d91d20c2b43a4aed175aa7dabca56cf4b54ab22f37b1151a41a634a50c
MD5 c47970347821294134865691558d3f03
BLAKE2b-256 869280b34e8e03170320f840667d22978cc35f182445c92c347fc2986d0330c4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page