Skip to main content

Academic Observatory Workflows provides Apache Airflow Workflows for fetching, processing and analysing data about academic institutions.

Project description

Academic Observatory Workflows

Academic Observatory Workflows provides Apache Airflow workflows for fetching, processing and analysing data about academic institutions.

License Python Version Python Version Python package Documentation Status codecov

Telescope Workflows

A telescope a type of workflow used to ingest data from different data sources, and to run workflows that process and output data to other places. Workflows are built on top of Apache Airflow's DAGs.

The workflows include: Crossref Events, Crossref Fundref, Crossref Metadata, Geonames, GRID, Microsoft Academic Graph, Open Citations, ORCID, Scopus, Unpaywall and Web of Science.

Telescope Workflow Description
Crossref Events Crossref Event Data captures discussion on scholarly content and acts as a hub for the storage and distribution of this data. An event may be a citation in a dataset or patent, a mention in a news article, Wikipedia page or on a blog, or discussion and comment on social media.
Crossref Funder Registry The Crossref Funder Registry is an open registry of grant-giving organization names and identifiers, which can be used to find funder IDs and include them as part of metadata deposits. It is a freely-downloadable RDF file. It is CC0-licensed and available to integrate with your own systems. Funder names from acknowledgements should be matched with the corresponding unique funder ID from the Funder Registry
Crossref Metadata Crossref is a non-for-profit membership organisation working on making scholarly communications better. It is an official Digital Object Identifier (DOI) Registration Agency of the International DOI Foundation. They provide metadata for every DOI that is registered with Crossref.
Geonames The GeoNames geographical database covers all countries. It contains over 25 million geographical names and consists of over 11 million unique features whereof 4.8 million populated places and 13 million alternate names
GRID GRID is a free, openly accessible database of research institution identifiers which enables users to make sense of their data. It does so by minimising the work required to link datasets together using a unique and persistent identifier.
Microsoft Academic Graph Microsoft Academic Graph contains scientific publication records, citation relationship between those publications, as well as authors, institutions, journals, conferences, and field of study. It is updated on a weekly basis. It currently indexes over 220 million publications, 88 million of which are journal articles
Open Citations OpenCitations is an independent not-for-profit infrastructure organization for open scholarship dedicated to the publication of open bibliographic and citation data
ORCID ORCID is a non-profit organization that provides researchers with a unique digital identifier which eliminates the risk of confusing an identity with another researcher having the same name. ORCID provides a record that supports automatic links among all the researcher's professional activities.
Scopus SCOPUS is an Elsevier bibliometrics database containing abstracts, citations, of journals, books, and conference proceedings
Unpaywall Unpaywall is an open database of free scholarly articles. It includes data from open indexes like Crossref and DOAJ where it exists. Data comes from “monitoring over 50,000 unique online content hosting locations, including Gold OA journals, Hybrid journals, institutional repositories, and disciplinary repositories.
Web of Science Web of science, previously Web of knowledge, provides bibliometric information, including funding acknowledgements, international publication identifiers, and abstracts

Documentation

For detailed documentation about the Academic Observatory see the Read the Docs website https://academic-observatory-workflows.readthedocs.io

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

academic-observatory-workflows-2022.3.0.tar.gz (945.0 kB view details)

Uploaded Source

File details

Details for the file academic-observatory-workflows-2022.3.0.tar.gz.

File metadata

  • Download URL: academic-observatory-workflows-2022.3.0.tar.gz
  • Upload date:
  • Size: 945.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.11

File hashes

Hashes for academic-observatory-workflows-2022.3.0.tar.gz
Algorithm Hash digest
SHA256 2b0b79efd995540c2df8b5f0e90a5e849316157e507da8901ede7ff8ba902557
MD5 858ffd9bf01887ab32aa42a20f4ee05c
BLAKE2b-256 8526c2faf9028a1ebf8d3cf1b8965f89ae686ce67e5b66b0d627303eef1ac13b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page