Skip to main content

doi search and return metadata

Project description

Auto fetch Meta data Entries

This repository is intended as a starting point for fetching metadata from online sites.

It is to be integrated into the ColLi backend for the collaborative Literature management.

Install

The following packages have to be installed:

python3 BeautifulSoup4 URLlib python bibtexparser

to install under Ubuntu:

apt install python3 python3-bs4 python3-urllib3 python3-bibtexparser

Alternatively, just install python3 and do the rest via pip from the project directory:

pip install -r requirements.txt

Adding more sites.

If there is no parser for a specific site, it can be created in the sites folder.

Naming convention is, that the file name and the class name inside this file must be the domain name without any subdomains or top-level domains. So for https://dl.acm.org/ it is just acm

An example is provided in sites/example.py

Every class must be a subclass of landingPage and can implement one of the following functions:

    def getBibTeX(self, link, soup):
        return None

    def getDict(self, link, soup):
        return None

Parameters are in both cases the URL of the landing page of the DOI resolver and the page as a parsed beautifulsoup object.

They are seperated by their return value.

getBibTeX must return a valid BibTeX entry or None. getDict must return a python Dictionary with the field names according to the Dublin Core Standard.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metadata_expander-0.2.4.tar.gz (11.8 kB view details)

Uploaded Source

Built Distribution

metadata_expander-0.2.4-py3-none-any.whl (12.7 kB view details)

Uploaded Python 3

File details

Details for the file metadata_expander-0.2.4.tar.gz.

File metadata

  • Download URL: metadata_expander-0.2.4.tar.gz
  • Upload date:
  • Size: 11.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for metadata_expander-0.2.4.tar.gz
Algorithm Hash digest
SHA256 26da1ef4887534f656f730f38587f475dab7e630808c028baa09ad46c44568cb
MD5 dadaf2c5c17c712175b45da4f9e4baa9
BLAKE2b-256 18eb0ab0ddefac43bc3ae8e849520f650968e710b8a174fb45d5dd94b473bbe4

See more details on using hashes here.

File details

Details for the file metadata_expander-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: metadata_expander-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 12.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for metadata_expander-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 4dee2cb1705ce39c978243cb98f14c7c961e890ea109018201b93f59b760a8e0
MD5 cf9233667e5932c0991d7f71e6b13896
BLAKE2b-256 f5772a101c3bea4197b65ab0ad1a3297e6a00252d9ec5239b6114f6f4f8b15ba

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page