Academic Observatory Workflows provides Apache Airflow Workflows for fetching, processing and analysing data about academic institutions.
Project description
Academic Observatory Workflows provides Apache Airflow workflows for fetching, processing and analysing data about academic institutions.
Telescope Workflows
A telescope a type of workflow used to ingest data from different data sources, and to run workflows that process and output data to other places. Workflows are built on top of Apache Airflow's DAGs.
The workflows include: Crossref Events, Crossref Fundref, Crossref Metadata, Geonames, GRID, Microsoft Academic Graph, Open Citations, ORCID, Scopus, Unpaywall and Web of Science.
Telescope Workflow | Description |
---|---|
Crossref Event Data captures discussion on scholarly content and acts as a hub for the storage and distribution of this data. An event may be a citation in a dataset or patent, a mention in a news article, Wikipedia page or on a blog, or discussion and comment on social media. | |
The Crossref Funder Registry is an open registry of grant-giving organization names and identifiers, which can be used to find funder IDs and include them as part of metadata deposits. It is a freely-downloadable RDF file. It is CC0-licensed and available to integrate with your own systems. Funder names from acknowledgements should be matched with the corresponding unique funder ID from the Funder Registry | |
Crossref is a non-for-profit membership organisation working on making scholarly communications better. It is an official Digital Object Identifier (DOI) Registration Agency of the International DOI Foundation. They provide metadata for every DOI that is registered with Crossref. | |
The GeoNames geographical database covers all countries. It contains over 25 million geographical names and consists of over 11 million unique features whereof 4.8 million populated places and 13 million alternate names | |
GRID is a free, openly accessible database of research institution identifiers which enables users to make sense of their data. It does so by minimising the work required to link datasets together using a unique and persistent identifier. | |
Microsoft Academic Graph contains scientific publication records, citation relationship between those publications, as well as authors, institutions, journals, conferences, and field of study. It is updated on a weekly basis. It currently indexes over 220 million publications, 88 million of which are journal articles | |
OpenCitations is an independent not-for-profit infrastructure organization for open scholarship dedicated to the publication of open bibliographic and citation data | |
ORCID is a non-profit organization that provides researchers with a unique digital identifier which eliminates the risk of confusing an identity with another researcher having the same name. ORCID provides a record that supports automatic links among all the researcher's professional activities. | |
SCOPUS is an Elsevier bibliometrics database containing abstracts, citations, of journals, books, and conference proceedings | |
Unpaywall is an open database of free scholarly articles. It includes data from open indexes like Crossref and DOAJ where it exists. Data comes from “monitoring over 50,000 unique online content hosting locations, including Gold OA journals, Hybrid journals, institutional repositories, and disciplinary repositories. | |
Web of science, previously Web of knowledge, provides bibliometric information, including funding acknowledgements, international publication identifiers, and abstracts |
Documentation
For detailed documentation about the Academic Observatory see the Read the Docs website https://academic-observatory-workflows.readthedocs.io
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for academic-observatory-workflows-2022.3.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2b0b79efd995540c2df8b5f0e90a5e849316157e507da8901ede7ff8ba902557 |
|
MD5 | 858ffd9bf01887ab32aa42a20f4ee05c |
|
BLAKE2b-256 | 8526c2faf9028a1ebf8d3cf1b8965f89ae686ce67e5b66b0d627303eef1ac13b |