Skip to main content

A library for doing search on different kind of files

Project description


SimpleSearch lets you index and search your documents. It was designed to manipulate different types of documents transparently.


I developed simplesearch for curiosity reasons, so don't try to run it in production, however, you may find the code helpful as it's well documented.


You can install simplesearch using pip

$ pip install simplesearch

or install it from source

$ git clone
$ cd simplesearch
$ python3 install


Simple users will only find two function calls useful, add_file_to_index() and search_keyword(), those two will allow you to build the index of your documents as well as searching using a list of keywords.


Keep in mind that using the add_file_to_index() function will create an sqlite3 database file (.simplesearch.db) in your current directory, this same database file will be used for doing search, so doing other operations in another directory will create another index and thus different results.


Below is a code snippet that index some local files and then do some search operations. Here we used PDFs as it was the only supported document type while writing this example.

import simplesearch

# We assume that this file contains words like
# programming python indexing

# We assume that this file contains words like
# machine-learning deep-learning python

# Both files have been indexed now, we can do some search operations

# We searched a specific keyword found only in the second indexed document

# We now do a search on a common keyword for both docs
['/home/youben/simplesearch.pdf', '/home/youben/ml.pdf']

# We can also use multiple keywords
simplesearch.search_keywords(["python", "machine-learning"])
['/home/youben/ml.pdf', '/home/youben/simplesearch.pdf']

# The last result was sorted by best match, as the first document matches with two keywods
# while the second match with only one

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simplesearch-0.1a0.tar.gz (4.9 kB view hashes)

Uploaded source

Built Distribution

simplesearch-0.1a0-py3-none-any.whl (11.3 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page