Skip to main content

A parser for Web of Science XML data in Python.

Project description

Web of Science XML Parser

A parser for Web of Science XML data in Python.

Installation

The package can be installed from PyPI:

pip install wos_parser

Getting Started

The parser can read *.xml files included in the Web of Science XML dataset. Note: The dataset is distributed as a collection of zipped archives (one for each record year), which in turn contain zipped versions of the xml files. These need to be unpacked first before passing them to the parser.

from wos_parser import WosParser


xml_path = "dataset/2023_CORE/WR_2023_20230111080536_CORE_0001.xml"

wos_parser = WosParser()

records = wos_parser.parse_records(xml_path)

Generating the documentation

To view the documentation, you currently have to build it locally. To do that, follow these steps:

  1. Clone the package repository.

  2. Install Sphinx.

  3. Install additional dependencies:

    pip install myst_parser pydata_sphinx_theme

  4. Go to the project folder's subdirectory doc/.

  5. Run make html.

  6. Open the file doc/_build/html/index.html in a browser.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wos_parser-0.1.0.tar.gz (11.8 kB view details)

Uploaded Source

Built Distribution

wos_parser-0.1.0-py3-none-any.whl (19.5 kB view details)

Uploaded Python 3

File details

Details for the file wos_parser-0.1.0.tar.gz.

File metadata

  • Download URL: wos_parser-0.1.0.tar.gz
  • Upload date:
  • Size: 11.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for wos_parser-0.1.0.tar.gz
Algorithm Hash digest
SHA256 dd0f3e1609ecbb1fd8893ad468486c7a805ed8bd05c3dc1011c3bd202822a0fe
MD5 5d8a894fbdfebfa23b1da8e0a9ed0f1b
BLAKE2b-256 ed0aca1fbe703d5c1ce7acb7d328b730b4c2263ba68ca155abb5580480c54811

See more details on using hashes here.

File details

Details for the file wos_parser-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: wos_parser-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 19.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for wos_parser-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9c8e65ff89efed584df59cf1d2fb2abb5e4cb6e564ab04ddf8e5c58b9a6e43ce
MD5 d410fcfdbb2144cec572971f25ac9dab
BLAKE2b-256 1e9200a9be5375a047c3dacf0d07c9fe82ffca9ab08475a211a83fa55d10a154

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page