A parser for Web of Science XML data in Python.
Project description
Web of Science XML Parser
A parser for Web of Science XML data in Python.
Installation
The package can be installed from PyPI:
pip install wos_parser
Getting Started
The parser can read *.xml
files included in the Web of Science XML dataset. Note: The dataset is distributed as a collection of zipped archives (one for each record year), which in turn contain zipped versions of the xml
files. These need to be unpacked first before passing them to the parser.
from wos_parser import WosParser
xml_path = "dataset/2023_CORE/WR_2023_20230111080536_CORE_0001.xml"
wos_parser = WosParser()
records = wos_parser.parse_records(xml_path)
Generating the documentation
To view the documentation, you currently have to build it locally. To do that, follow these steps:
-
Clone the package repository.
-
Install additional dependencies:
pip install myst_parser pydata_sphinx_theme
-
Go to the project folder's subdirectory
doc/
. -
Run
make html
. -
Open the file
doc/_build/html/index.html
in a browser.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file wos_parser-0.1.0.tar.gz
.
File metadata
- Download URL: wos_parser-0.1.0.tar.gz
- Upload date:
- Size: 11.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd0f3e1609ecbb1fd8893ad468486c7a805ed8bd05c3dc1011c3bd202822a0fe |
|
MD5 | 5d8a894fbdfebfa23b1da8e0a9ed0f1b |
|
BLAKE2b-256 | ed0aca1fbe703d5c1ce7acb7d328b730b4c2263ba68ca155abb5580480c54811 |
File details
Details for the file wos_parser-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: wos_parser-0.1.0-py3-none-any.whl
- Upload date:
- Size: 19.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9c8e65ff89efed584df59cf1d2fb2abb5e4cb6e564ab04ddf8e5c58b9a6e43ce |
|
MD5 | d410fcfdbb2144cec572971f25ac9dab |
|
BLAKE2b-256 | 1e9200a9be5375a047c3dacf0d07c9fe82ffca9ab08475a211a83fa55d10a154 |