Selenium Open Source Search Engine
Project description
SOSSE 🦦
SOSSE (Selenium Open Source Search Engine) is a Web archiving software, crawler and search engine written in Python, distributed under the GNU-AGPLv3 license. It is hosted on both Gitlab and Github site, please use any of them to open feature requests, bug report or merge requests, or open a discussion.
SOSSE main features are:
- 🌍 Browser based crawling: SOSSE uses Mozilla Firefox, or Google Chromium and Selenium to index pages that use Javascript. Requests can also be used for faster crawling
- 📚 Offline browsing: SOSSE can save HTML copy or take screenshots of crawled pages to create archives suitable for offline browsing
- 📉 Low resources requirements: SOSSE is entirely written in Python and uses PostgreSQL for data storage
- 🔓 Authentication: the crawlers can submit authentication forms with provided credentials
- 🔗 Search engines shortcuts: shortcuts search queries can be used to redirect to external search engines (sometime called "bang" searches)
- 🔖 Search history: users can authenticate to log their search query history privately
See the documentation and screenshots.
Try it out
You can try the latest version with Docker:
docker run -p 8005:80 biolds/sosse:latest
Open http://127.0.0.1:8005/, and log in with user admin
, password admin
.
To persist Docker data, or find alternative installation methods, please check the documentation.
Keep in touch
Join the Discord server to get help and share ideas!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sosse-1.10.1.tar.gz
.
File metadata
- Download URL: sosse-1.10.1.tar.gz
- Upload date:
- Size: 3.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2109963793c30ac981a1f8d03c674b02a0a84919c82e9db3c6765a64165051b0 |
|
MD5 | 9562aaf75f893eb2abe70c3b44a6d666 |
|
BLAKE2b-256 | 7049e6f206183a338b4c42df895442c80acebdaf55499b112873f99b6b37027c |
File details
Details for the file sosse-1.10.1-py3-none-any.whl
.
File metadata
- Download URL: sosse-1.10.1-py3-none-any.whl
- Upload date:
- Size: 3.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f0d154d91d20c2b43a4aed175aa7dabca56cf4b54ab22f37b1151a41a634a50c |
|
MD5 | c47970347821294134865691558d3f03 |
|
BLAKE2b-256 | 869280b34e8e03170320f840667d22978cc35f182445c92c347fc2986d0330c4 |