ScienceBeam Parser, parse scientific documents.
Project description
ScienceBeam Parser Python Library
ScienceBeam Parser allows you to parse scientific documents. It provides a REST API Service, as well as a Python API.
Installation
pip install sciencebeam-parser
CLI
CLI: Start Server
python -m sciencebeam_parser.service.server --port=8080
The server will start to listen on port 8080.
The default config.yml defines what models to load.
Python API
Python API: Start Server
from sciencebeam_parser.config.config import AppConfig
from sciencebeam_parser.resources.default_config import DEFAULT_CONFIG_FILE
from sciencebeam_parser.service.server import create_app
config = AppConfig.load_yaml(DEFAULT_CONFIG_FILE)
app = create_app(config)
app.run(port=8080, host='127.0.0.1', threaded=True)
The server will start to listen on port 8080.
Python API: Parse Multiple Files
from sciencebeam_parser.resources.default_config import DEFAULT_CONFIG_FILE
from sciencebeam_parser.config.config import AppConfig
from sciencebeam_parser.utils.media_types import MediaTypes
from sciencebeam_parser.app.parser import ScienceBeamParser
config = AppConfig.load_yaml(DEFAULT_CONFIG_FILE)
# the parser contains all of the models
sciencebeam_parser = ScienceBeamParser.from_config(config)
# a session provides a scope and temporary directory for intermediate files
# it is recommended to create a separate session for every document
with sciencebeam_parser.get_new_session() as session:
session_source = session.get_source(
'test-data/minimal-example.pdf',
MediaTypes.PDF
)
converted_file = session_source.get_local_file_for_response_media_type(
MediaTypes.TEI_XML
)
# Note: the converted file will be in the temporary directory of the session
print('converted file:', converted_file)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sciencebeam_parser-0.1.5.tar.gz
(88.3 kB
view details)
File details
Details for the file sciencebeam_parser-0.1.5.tar.gz.
File metadata
- Download URL: sciencebeam_parser-0.1.5.tar.gz
- Upload date:
- Size: 88.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.6.0 importlib_metadata/4.2.0 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
610dbd6040750795648c5d03405638b6aeea071692d128daee024c914694a6f0
|
|
| MD5 |
b510d248a244a0333370b0e7fa88ac06
|
|
| BLAKE2b-256 |
6dffedc3402806d48da934e02ecd5e7a294177964d122389297656cbe86d38dd
|