Backend for Karp

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- POSIX
- Unix
Programming Language
Topic
- Utilities

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

karp-backend-5

This package is the legacy version of Karp, [go here for the current version(https://github.com/spraakbanken/karp-backend)]

master

Karp is the lexical platform of Språkbanken. Now migrated to Python 3.6+.

Karp in Docker

For easy testing, use Docker to run Karp-b.

Follow the steps given here
Run docker-compose up -d
Test it by running curl localhost:8081/app/test

If you want to use Karp without Docker, keep on reading.

Prerequisites

ElasticSearch6
SQL, preferrably MariaDB
a WSGI server for example mod_wsgi with Apache, Waitress, Gunicorn, uWSGI. . .
an authentication server. Read more about this here
Python >= 3.6 with pip

Installation

Karp uses virtuals envs for python. To get running:

run make install
or:
1. Create the virtual environment using python3 -m venv venv.
2. Activate the virtual environment with source venv/bin/activate.
3. pip install -r requirements.txt

Configuration

Set the environment varibles KARP5_INSTANCE_PATH and KARP5_ELASTICSEARCH_URL:

using export VAR=value
or creating a file .env in the root of your cloned path with VAR=value
KARP5_INSTANCE_PATH - the path where your configs are. If you have cloned this repo you can use /path/to/karp-backend/.
KARP5_ELASTICSEARCH_URL - the url to elasticsearch. Typically localhost:9200

Copy config.json.example to config.json and make your changes. You will also need to make configurations for your lexicons. Read more here.

Tests

TODO: DO MORE TESTS! Run the tests by typing: make test

Test that karp-backend is working by starting it make run or python run.py

Known bugs

Counts from the statistics call may not be accurate when performing subaggregations (multiple buckets) on big indices unless the query restricts the search space. Using breadth_first mode does not (always) help.

Possible workarounds:

use composite aggregation instead, but this does not work with filtering.
set a bigger shard_size (27 000 works for saldo), but this might break your ES cluster.
have smaller indices (one lexicon per index) but this does not help for big lexicons or statistics over many lexicons.
don't allow deeper subaggregations than 2. Chaning the size won't help.

Elasticsearch

If saving stops working because of Database Exception: Error during update. Message: TransportError(403, u'cluster_block_exception', u'blocked by: [FORBIDDEN/12/index read-only / allow delete (api)];')., you need to unlock the relevant ES index.

This is how you do it:

Repeat for every combination of host and port that is relevant for you. But you only need to do it once per cluster.

Check if any index is locked: curl <host>:<port>/_all/_settings/index.blocks*
- If all is open, Elasticsearch answers with {}
- else it answers with {<index>: { "settings": { "index": { "blocks": {"read_only_allow_delete": "true"} } } }, ... }
To unlock all locked indices on a host and port:
- curl -X PUT <host>:<port>/_all/_settings -H 'Content-Type: application' -d '{"index.blocks.read_only_allow_delete": null}'

Project details

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- POSIX
- Unix
Programming Language
Topic
- Utilities

Release history Release notifications | RSS feed

This version

5.29.0

Feb 19, 2024

5.28.0

Feb 19, 2024

5.27.5

Jun 7, 2023

5.26.4

Jan 12, 2022

5.26.2

May 7, 2020

5.26.1

May 7, 2020

5.25.0

Apr 23, 2020

5.24.11

Apr 22, 2020

5.24.10

Apr 22, 2020

5.24.9

Apr 22, 2020

5.24.8

Mar 5, 2020

5.24.6

Mar 5, 2020

5.24.5

Mar 4, 2020

5.24.4

Mar 4, 2020

5.24.3

Mar 4, 2020

5.24.2

Mar 2, 2020

5.24.1

Feb 26, 2020

5.23.3

Feb 24, 2020

5.23.2

Feb 20, 2020

5.22.1

Jan 31, 2020

5.22.0

Jan 31, 2020

5.21.5

Jan 31, 2020

5.21.4

Jan 22, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

karp-backend-5-5.29.0.tar.gz (2.4 MB view details)

Uploaded Feb 19, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

karp_backend_5-5.29.0-py3-none-any.whl (1.0 MB view details)

Uploaded Feb 19, 2024 Python 3

File details

Details for the file karp-backend-5-5.29.0.tar.gz.

File metadata

Download URL: karp-backend-5-5.29.0.tar.gz
Upload date: Feb 19, 2024
Size: 2.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for karp-backend-5-5.29.0.tar.gz
Algorithm	Hash digest
SHA256	`fd4816665f816d2d9e9143cc46a4f2d7edf332df12fa9eb88fdcf981c749fedd`
MD5	`db4bee7016235d37bc31a2bd55e5ca59`
BLAKE2b-256	`e73ec4c1b868e0ea39be0f94a8c658072123b989be72e87a7c762d6af8502335`

See more details on using hashes here.

File details

Details for the file karp_backend_5-5.29.0-py3-none-any.whl.

File metadata

Download URL: karp_backend_5-5.29.0-py3-none-any.whl
Upload date: Feb 19, 2024
Size: 1.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for karp_backend_5-5.29.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`736127bcdbb4b15c597cdee5ad003e89baf7a3be699e76af022b9486daa35c39`
MD5	`7cfd626ce38e47ac7c21af9611edcf4c`
BLAKE2b-256	`ca12bedfb53e8259e7254b22f1118e9553b41eab8e0ea4adb715a3d8f51ce537`

See more details on using hashes here.

karp-backend-5 5.29.0

Navigation

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

karp-backend-5

master

Karp in Docker

Prerequisites

Installation

Configuration

Tests

Known bugs

Elasticsearch

This is how you do it:

Project details

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes