Selenium Open Source Search Engine
Project description
SOSSE 🦦
SOSSE (Selenium Open Source Search Engine) is a search engine and crawler written in Python, distributed under the GNU-AGPLv3 license. It is hosted on both Gitlab and Github site, please use any of them to open feature requests, bug report or merge requests, or open a discussion.
SOSSE main features are:
- 🌍 Browser based crawling: the crawler can use Google Chromium and Selenium to index pages that use Javascript. Requests can also be used for faster crawling
- 🏖 Low resources requirements: SOSSE is entirely written in Python and uses PostgreSQL for data storage
- 🖼 Offline cache: SOSSE can take screenshots of crawled pages and make them browsable offline
- 🔓 Authentication: the crawlers can submit authentication forms with provided credentials
- 🔗 Search engines shortcuts: shortcuts search queries can be used to redirect to external search engines (sometime called "bang" searches)
- 🔖 Search history: users can authenticate to log their search query history privately
See the documentation and screenshots.
Try it out
You can try the latest version with Docker:
docker run -p 8005:80 biolds/sosse:latest
Default user is admin
, with password admin
.
You can also find alternative installation methods in the documentation.
Keep in touch
Join the Discord server to get help and share ideas!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sosse-1.0.0.tar.gz
(143.7 kB
view hashes)
Built Distribution
sosse-1.0.0-py3-none-any.whl
(148.9 kB
view hashes)