Accessing and processing data from the DFG-funded SPP Computational Literary Studies
Project description
Python code for working with the data of the DFG-funded SPP Computational Literary Studies.
- sppcls.py: the sppcls Python
module to access the data:
- blocking:
from sppcls import sppcls df = sppcls.load_df(work="test", projects=["project"]) print(df.describe())
- non blocking:
from sppcls import sppcls df = await sppcls.load_df_async(work="test", projects=["project"]) print(df.describe())
Installation
PyPI
pip install sppcls
From source
Setup an virtual environment, if necessary:
python3 -m venv env
source env/bin/activate
Install dependencies:
pip install -r requirements.txt
python -m spacy download de_core_news_lg
Usage
tokenise.py
python tokenise.py path_to_input_txt path_to_output_folder
TODO: fix character offset to be byte instead
check.py
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sppcls-0.0.5.tar.gz
(8.4 kB
view hashes)
Built Distribution
sppcls-0.0.5-py3-none-any.whl
(9.0 kB
view hashes)