Index DBnomics data with Apache Solr for full-text and faceted search
Project description
DBnomics Solr
Index DBnomics data into Apache Solr for full-text and faceted search.
Requirements:
- a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.
See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.
Configuration
Environment variables:
DEBUG_PYSOLR: display pysolr DEBUG logging messages (cf https://github.com/django-haystack/pysolr)
Index a provider
Replace wto by the real provider slug in the following command:
dbnomics-solr index-provider /path/to/wto-json-data
Full mode vs incremental mode
When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.
When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.
It is possible to force the full mode with the --full option.
Bare repositories
The script has an option --bare-repo-fallback which tries to add .git at the end of the storage dir name, if not found.
Remove all data from a provider
To remove all the documents related to a provider (type:provider, type:dataset and type:series):
dbnomics-solr --debug delete-provider --code <provider_code>
dbnomics-solr --debug delete-provider --slug <provider_slug>
# Examples:
dbnomics-solr --debug delete-provider --code WTO
dbnomics-solr --debug delete-provider --slug wto
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dbnomics_solr-1.1.19.tar.gz.
File metadata
- Download URL: dbnomics_solr-1.1.19.tar.gz
- Upload date:
- Size: 51.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4c659cb720d5787dc0df71d277052e79d675b3219bf25bb52d893aacb2a455e7
|
|
| MD5 |
be518ebf2a7408ea30d7a08f4f5ca7f7
|
|
| BLAKE2b-256 |
491813e5c0494273c54241435b8f9dcaa11d3912ab79fe0ac454627b09b532c0
|
File details
Details for the file dbnomics_solr-1.1.19-py3-none-any.whl.
File metadata
- Download URL: dbnomics_solr-1.1.19-py3-none-any.whl
- Upload date:
- Size: 45.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e94c606d1f4124a1d658a52cbb82db9eaacbd294468d12f2f75d356d60d85008
|
|
| MD5 |
8024dd862da2286593437c31ed9d843f
|
|
| BLAKE2b-256 |
be206a53972a61994001271cad4c31d2d8074b6c07370fd0f51321e94d0cac5f
|