Index DBnomics data with Apache Solr for full-text and faceted search
Project description
DBnomics Solr
Index DBnomics data into Apache Solr for full-text and faceted search.
Requirements:
- a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.
See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.
Configuration
Environment variables:
DEBUG_PYSOLR
: display pysolr DEBUG logging messages (cf https://github.com/django-haystack/pysolr)
Index a provider
Replace wto
by the real provider slug in the following command:
dbnomics-solr index-provider /path/to/wto-json-data
Full mode vs incremental mode
When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.
When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.
It is possible to force the full mode with the --full
option.
Bare repositories
The script has an option --bare-repo-fallback
which tries to add .git
at the end of the storage dir name, if not found.
Remove all data from a provider
To remove all the documents related to a provider (type:provider
, type:dataset
and type:series
):
dbnomics-solr --debug delete-provider --code <provider_code>
dbnomics-solr --debug delete-provider --slug <provider_slug>
# Examples:
dbnomics-solr --debug delete-provider --code WTO
dbnomics-solr --debug delete-provider --slug wto
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for dbnomics_solr-1.1.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8490d01f471237097adf128c1a02a208ac27c0aac2ed586ee903a92f66f397d |
|
MD5 | b715626dddda6e66628cef768a2e490c |
|
BLAKE2b-256 | 831c91786ca7c96ab059a9ab159ee5326cb40442312ad78d5125e39b2dd2335b |