Skip to main content

Kahi plugin to insert and update works from scienti

Project description

Kahi scienti works plugin

Kahi will use this plugin to insert or update the works information from scienti database.

Description

Plugin that reads the information from a scienti dump to insert or update the information of the of academic products in colav's database.

Installation

You could download the repository from github. Go into the folder where the setup.py is located and run

pip3 install .

From the package you can install by running

pip3 install kahi_scienti_works

Similarity support

To process works without doi, similarity is mandaroty. Then a elastic search server must be running. The plugin will use the server to find the most similar works in the database. To deply it please read https://github.com/colav/Chia/tree/main/elasticsaerch and follow the instructions.

Docker and docker-compose are required to deploy the server.

if you only wants to process works with doi, you can skip this step and remove the es_index, es_url, es_user and es_password from the yaml file.

But it is mandatory to put scienti_works/doi in the yaml file.

Dependencies

Software dependencies will automatically be installed when installing the plugin. The user must have at least one database obtained from minciencias and previously processed by kayPacha and uploaded on a mongodb database. C++ library libhunspell-dev must be installed on your system. On ubuntu you can do it by typing

$ sudo apt install libhunspell-dev

Usage

To use this plugin you must have kahi installed in your system and construct a yaml file such as

config:
  database_url: localhost:27017
  database_name: kahi
  log_database: kahi
  log_collection: log
workflow:
  scienti_works/doi:
    es_index: kahi_es
    es_url: http://localhost:9200
    es_user: elastic_user
    es_password: elastic_pass
    databases:
    - database_url: localhost:27017
      database_name: scienti
      collection_name: products
    num_jobs: 5
    verbose: 5
  scienti_works:
    es_index: kahi_es
    es_url: http://localhost:9200
    es_user: elastic_user
    es_password: elastic_pass
    databases:
    - database_url: localhost:27017
      database_name: scienti
      collection_name: products
    num_jobs: 5
    verbose: 5

If you have several scienti databases use the example below

config:
  database_url: localhost:27017
  database_name: kahi
  log_database: kahi
  log_collection: log
workflow:
  scienti_works/doi:
    es_index: kahi_es
    es_url: http://localhost:9200
    es_user: elastic_user
    es_password: elastic_pass
    databases:
      - database_url: localhost:27017
        database_name: scienti_udea
        collection_name: products
      - database_url: localhost:27017
        database_name: scienti_uec_2022
        collection_name: product
    num_jobs: 5
    verbose: 5
  • WARNING *. This process could take several hours

License

BSD-3-Clause License

Links

http://colav.udea.edu.co/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kahi_scienti_works-0.1.5b0.tar.gz (12.6 kB view details)

Uploaded Source

Built Distribution

Kahi_scienti_works-0.1.5b0-py3-none-any.whl (13.7 kB view details)

Uploaded Python 3

File details

Details for the file kahi_scienti_works-0.1.5b0.tar.gz.

File metadata

  • Download URL: kahi_scienti_works-0.1.5b0.tar.gz
  • Upload date:
  • Size: 12.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for kahi_scienti_works-0.1.5b0.tar.gz
Algorithm Hash digest
SHA256 f78e08d256d58b316d7e74974f1aefa0f613f469adb66737f93676a534ec0164
MD5 5c72d70f45674db26535cd926dbce0d4
BLAKE2b-256 5f6cc322de77a275394cf771498949ad20f5de23da102215e75559adf5ec2604

See more details on using hashes here.

File details

Details for the file Kahi_scienti_works-0.1.5b0-py3-none-any.whl.

File metadata

File hashes

Hashes for Kahi_scienti_works-0.1.5b0-py3-none-any.whl
Algorithm Hash digest
SHA256 5eff5cb58406b06fe243186832adb6c6a278b389fe7b1a587fb3fb8cce6f683b
MD5 aec54e110232dcd06a1f2b992f4cd8de
BLAKE2b-256 27893817bbe2cf111eb69b15c2660a095ba3400c4963a225779eb75da21be396

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page