Text processing and analysis for HathiTrust Research Center
Project description
htrc-text-processing Library [Under Development]
Table of Contents
About htrc-text-processing Library
Description goes here.
How to Install
currenlty only by downloading htrc_text_processing
folder and placed in your working directory.
easiest way is, just clone the repo and run example1.py
.
TODO need to create a pip install verion (after creating all functionalities)
What you can do with this.
A function that finds the zip files at the end of the pairtree, moves them to a new folder and expands them, removing the zips
import htrc_text_processing as htrc_tp
# Expand all zip files seperately into a given folder
htrc_tp.get_zips_extract('sample-pairtree-data-parent/sample-pairtree-data', 'output_unziped_files')
# In case you only need zip files use this function
htrc_tp.get_zips_only('pairtree-data', 'output_only_zip_files')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for htrc-text-processing-0.0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 213634b903befd537217f839d00e0a2b0db7eff8e1af1062a43afbc91a283a7d |
|
MD5 | 88c15b59795b49dd1433f42785021917 |
|
BLAKE2b-256 | 0e234cf991c3774347ec9b2224b98f7654fb2c1276a68627e421265e6c664988 |
Close
Hashes for htrc_text_processing-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 62b48ad3e6283eb88e51595da447211fe40a49486db90c11bda5bbf53941bb24 |
|
MD5 | f6db5a969ee206fa6add8ffa24e584c6 |
|
BLAKE2b-256 | 04a4f7fe7865e0faaa8878b469666934d63b04b6677391644880f9d7ae21db3d |