Read and write BibTeX files
Project description
bibparse -- read and write BibTeX files
Description
bibparse
reads and writes BibTeX files.
The main class, Biblio
, is a dict
with methods for parsing, reading, writing and searching for BibTeX data. Each entry in the Biblio
is another kind of special dict, BibItem
. The user usually only needs to access Biblio
.
NOTE Just because I use only {...}
myself, I’ve never thought of adding the more traditional "..."
field marking before version 1.2.*.
Current version
Version 1.2.0-dev.9 -- 29 July 2021.
Copyrights
Copyright © 2019–2021 Legisign.org mailto:software@legisign.org
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.
Example usage
import bibparse
db = bibparse.Biblio(filename)
print(db)
The sample code loads, parses and pretty-prints a BibTeX file.
Module contents
1. Helper functions
The values in some fields (namely author, editor, translator, publisher, address, and pages, all of which may have several names, locations, or other data in their values) of BibItem
are stored internally as list
s.
to_bibtex(key, val)
-- convert an internal Python value into a BibTeX stringto_python(key, val)
-- convert a BibTeX string into an internal Python value
Both functions take a BibTeX field name (key) in order to decide how to handle the value. Except for "pages" where the separator is a single dash "-", the separator is the string " and ", the leading and trailing whitespace included.
Examples:
to_bibtex('pages', [100, 110])
→ '100-110'to_python('address', 'London and New York')
→ ['London', 'New York']
2. Exceptions
BibError
-- the base exceptionDuplicateError
-- duplicate ID’sNoIDError
-- missing ID in an entryPreambleError
-- invalid preamble
Each exception has a property lineno
in order to refer to the line that has raised the exception:
try:
db = bibparse.Biblio(filename)
except bibparse.DuplicateError as exc:
print(f'duplicate bibid on line {exc.lineno}')
3. BibItem class
A dict
-derived object representing a single BibTeX entry.
3.1 Methods
These are derived from dict
but modified to ensure lower-case keys, reasonable ordering of keys in a printout, and sort ordering.
__lt__()
__repr__()
__setitem__()
parse(data)
-- parse string data into a BibTeX entryupdate()
-- updateBiblio
using another object
parse()
can be manually called for str
input.
update()
has an additional optional overwrite=bool
parameter. If True
(the default), update()
functions exactly like dict.update()
, updating BibItem contents from data in the supplied dict
. If False
, only new keys in supplied data is added but existing values are not overwritten.
4. Biblio class
The main class. The constructor can be given an optional filename argument; the file is opened and parsed automatically.
4.1 Methods
The __repr__()
method is provided so that merely print()
ing the Biblio
object produces valid BibTeX output.
by_bibid(bibids)
-- return all entries whose bibid is in the list (or set)by_regex(field, regex)
-- search in field by regexby_types(bibtypes, complement=False)
-- search by BibTeX typeparse(data)
-- parse string as BibTeX dataread(filename)
-- read and parse file as BibTeX datawrite(filename)
-- write file in BibTeX format
parse()
can be manually called for str
input; however, read()
automatically calls it, as does the constructor Biblio()
when given a filename argument.
by_bibid(bibids)
was written in order to make combining searches easier. Each by_regex()
call returns a Biblio
object whose keys can be obtained with Biblio.keys()
. These keys can be used in set
operations to provide a new list of keys that match either any (intersection) or all (union) of the keys, and by_bibid()
can then be used to return all the entries.
by_regex(field, regex)
searches the database by field values and returns the matches in a new Biblio
object. E.g., by_regex('author', '.*Smith.*')
returns all entries where the "author" field contains "Smith".
by_types(bibtypes, complement=False)
searches the database by BibTeX types (given without the initial "@"
) and returns the matches in a new Biblio
object. bibtypes
can be a string specifying a single type (e.g., "article"
) or a list of strings specifying several types (e.g., ["article", "book"]
). If the optional complement
parameter is set to True, the function returns the complement, i.e., all entries not matching the criteria.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file bibparse-1.2.0.tar.gz
.
File metadata
- Download URL: bibparse-1.2.0.tar.gz
- Upload date:
- Size: 19.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aaf891e6d523567bd8b2d3e12549b363492c34551bd4f10fc47620f041d034c9 |
|
MD5 | fa1d844b43e4a0c5b9a12772bb5638ad |
|
BLAKE2b-256 | 3d422074d94e853075a747fc196f015612f65aa510b325981ad748cb5fdc927c |
File details
Details for the file bibparse-1.2.0-py3-none-any.whl
.
File metadata
- Download URL: bibparse-1.2.0-py3-none-any.whl
- Upload date:
- Size: 19.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe86d93e88d08cb8ce95173eababb38c65377e16b6c28783545619d9d64d395a |
|
MD5 | 5110f6193f40f00aa3ae7e316581d7d0 |
|
BLAKE2b-256 | 68e1f09f1e69f0f9949bf6bdb1aa3988a1617c2d532d6dcb2f5ac2020f4a7a81 |