Maps BibTex venue/journal/booktitle strings to their acronyms.
Project description
Venue mappings
Venue mappings is a library project developed by Laurens Versluis to map venue BibTex strings found in sources such as DBLP and Google Scholar to the acronym of the venue.
For example the strings FGCS
, Future Generation Comp. Syst.
and Future Generation Computer Systems
can be found in BibTex entries and other corpuses. The venue mapper maps these strings to their corresponding acronym - FGCS
.
Article meta-data stored in, e.g., a database then becomes easier to query, as well as sanitizing and making uniform your BibTex entries.
Usage
from venue_mapper.venue_mapper import VenueMapper
venue_mapper = VenueMapper()
acronym = venue_mapper.get_abbreviation("FGCS")
Scope
Currently, this repository contains venues from the systems community. Naturally, we encourage additions through pull-requests.
Version structure
Currently, the version is in the form of x.y.z
.
- If any update is not backwards compatible and users need to modify code to make the upgrade work,
x
will be upped.y
andz
will be reset, e.g.,1.2.3
->2.0.0
- If one or more new venues are added,
y
will be upped.x
will not change andz
will be reset, e.g.,1.2.3
->1.3.0
. - If an update only applies fixes because some BibTex strings were missed,
z
will be upped.x
andy
will not change, e.g.,1.2.3
->1.2.4
.
Please note that any of x
, y
, and z
can go beyond 9, i.e., 1.10.1610
is a perfectly fine version.
Project Structure
The file venue_mapper.py
contains the VenueMapper
class.
This class contains the venues
dictionary which contains the matching rules.
There are currently five matching types: EXACT
, STARTS_WITH
, ENDS_WITH
, CONTAINS
, REGEX
All tests are located in the tests
folder. Each test is named test_<acronym of venue>.py
for clarity.
We use the BaseRunner
class to make each test as simple as possible. We believe each test is self-explanatory.
What's up with these five matching types?
Initially, this project used regex rules for all matches, including exact matches. After some micro benchmarks, we found that moving to this type of matching, and caching already seen strings led to a 6-8x speedup depending on the computer.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for Venue_Mapper-1.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cc655abcfbd38c9ec97d232550ac5f445b662ff77867eb359ac7b2decdeb6420 |
|
MD5 | 433463431ba8be026e74dace81876033 |
|
BLAKE2b-256 | 4d168b7dac1278902471dca6cc2ae0648681d181df49836d813eb69adc5a0c7a |