Skip to main content

Kahi plugin to insert and update works from all databases available

Project description

Kahi works plugin

Kahi will use this plugin to insert or update the works information.

Description

In order to insert a work first, the plugin must find its occurrence on either of the following sources:

  • OpenAlex
  • Scopus
  • Web of Science
  • Google Scholar
  • Scienti
  • UdeA's ranking table THe plugin then will aggregate all the information in CoLav's database schema and link the resulting entities to the ones already available in the database from the provious tasks within the workflow.

Installation

You could download the repository from github. Go into the folder where the setup.py is located and run

pip3 install .

From the package you can install by running

pip3 install kahi_works

Dependencies

Software dependencies will automatically be installed when installing the plugin. For the data dependencies the user must have:

  • A copy of the openalex dumpwith the collection of works of interest (take a subset since this database is huge) which can be downloaded at OpenAlex data dump website and import it on a mongodb database.
  • A subset of Scopus papers' metadata loaded on a mongodb database.
  • A subset of Web of Science papers' metadata loaded on a mongodb database.
  • The output of Moai's scrapping of google scholar.
  • The dump from minciencias scienti database parsed with kayPacha and uploaded on a mongodb database.
  • The file from UdeA's ranking office.

Usage

To use this plugin you must have kahi installed in your system and construct a yaml file such as

config:
  database_url: localhost:27017
  database_name: kahi
  log_database: kahi_log
  log_collection: log
workflow:
  works:
    num_jobs: 5
    verbose: 5
    openalex:
      database_url: localhost:27017
      database_name: openalexco
      collection_name: works
    scopus:
      database_url: localhost:27017
      database_name: scopus_colombia
      collection_name: stage
    wos:
      database_url: localhost:27017
      database_name: wos_colombia
      collection_name: stage
    scholar:
      database_url: localhost:27017
      database_name: scholar_colombia
      collection_name: stage
    scienti:
      database_url: localhost:27017
      database_name: scienti_111
      collection_name: products
    puntaje:
      file_path: /current/data/colombia/udea/produccion 2018-2022 al 27 oct 2022.xlsx
  • WARNING *. This process could take several hours

License

BSD-3-Clause License

Links

http://colav.udea.edu.co/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Kahi_works-0.0.4a0.tar.gz (13.6 kB view details)

Uploaded Source

Built Distribution

Kahi_works-0.0.4a0-py3-none-any.whl (13.1 kB view details)

Uploaded Python 3

File details

Details for the file Kahi_works-0.0.4a0.tar.gz.

File metadata

  • Download URL: Kahi_works-0.0.4a0.tar.gz
  • Upload date:
  • Size: 13.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for Kahi_works-0.0.4a0.tar.gz
Algorithm Hash digest
SHA256 bbae53fb549e02b49c83576b872ae06178c0cfcddf66928a7f95247c021f0a01
MD5 7be0a96471dcd001ca7c718c61c19e10
BLAKE2b-256 17d021581dbfd7cced993d85ea024b649e869590d20c88aabed600e49809961b

See more details on using hashes here.

File details

Details for the file Kahi_works-0.0.4a0-py3-none-any.whl.

File metadata

  • Download URL: Kahi_works-0.0.4a0-py3-none-any.whl
  • Upload date:
  • Size: 13.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for Kahi_works-0.0.4a0-py3-none-any.whl
Algorithm Hash digest
SHA256 b6a97b3302b1df43168857d127a586de4dd6c801c21a6dadf05d493b29866f6f
MD5 b788bc101820d0351f96416513e36241
BLAKE2b-256 d72c578a5be5a01150d73beb3d832a2980e03a23844ef6f1033e336bab180a4d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page