Project description

Auto fetch Meta data Entries

This repository is intended as a starting point for fetching metadata from online sites.

It is to be integrated into the ColLi backend for the collaborative Literature management.

Install

The following packages have to be installed:

python3 BeautifulSoup4 URLlib python bibtexparser

to install under Ubuntu:

apt install python3 python3-bs4 python3-urllib3 python3-bibtexparser

Alternatively, just install python3 and do the rest via pip from the project directory:

pip install -r requirements.txt

Adding more sites.

If there is no parser for a specific site, it can be created in the sites folder.

Naming convention is, that the file name and the class name inside this file must be the domain name without any subdomains or top-level domains. So for https://dl.acm.org/ it is just acm

An example is provided in sites/example.py

Every class must be a subclass of landingPage and can implement one of the following functions:

    def getBibTeX(self, link, soup):
        return None

    def getDict(self, link, soup):
        return None

Parameters are in both cases the URL of the landing page of the DOI resolver and the page as a parsed beautifulsoup object.

They are seperated by their return value.

getBibTeX must return a valid BibTeX entry or None. getDict must return a python Dictionary with the field names according to the Dublin Core Standard.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.2.4

Feb 12, 2019

0.2.2

Dec 20, 2018

0.2.1

Dec 20, 2018

0.2

Dec 20, 2018

0.1

Dec 20, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metadata_expander-0.2.4.tar.gz (11.8 kB view hashes)

Uploaded Feb 12, 2019 Source

Built Distribution

metadata_expander-0.2.4-py3-none-any.whl (12.7 kB view hashes)

Uploaded Feb 12, 2019 Python 3

Hashes for metadata_expander-0.2.4.tar.gz

Hashes for metadata_expander-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`26da1ef4887534f656f730f38587f475dab7e630808c028baa09ad46c44568cb`
MD5	`dadaf2c5c17c712175b45da4f9e4baa9`
BLAKE2b-256	`18eb0ab0ddefac43bc3ae8e849520f650968e710b8a174fb45d5dd94b473bbe4`

Hashes for metadata_expander-0.2.4-py3-none-any.whl

Hashes for metadata_expander-0.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4dee2cb1705ce39c978243cb98f14c7c961e890ea109018201b93f59b760a8e0`
MD5	`cf9233667e5932c0991d7f71e6b13896`
BLAKE2b-256	`f5772a101c3bea4197b65ab0ad1a3297e6a00252d9ec5239b6114f6f4f8b15ba`