The Entire Transcripts from the Office in Tidy Format
Project description
The Entire Transcript from the Office in Tidy Format
Also available in R: schrute package
What is it
The entire text transcripts from the American version of The Office TV show in pandas dataframe. Use this package to practice or learn NLP, text analysis or deep learning.
Getting started
You can install easily from PyPi
Install
pip install schrutepy
Usage
Pull the transcripts into a data frame with this library's only method:
from schrutepy import schrutepy
df = schrutepy.load_schrute()
df.head(5)
Demo
View the full demo on the website: technistema
Contributors
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file schrutepy-0.1.3.tar.gz.
File metadata
- Download URL: schrutepy-0.1.3.tar.gz
- Upload date:
- Size: 5.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
377434f45676e488cf7e6d879757ac49c99bad322eda2021909ce83aab4216c3
|
|
| MD5 |
e8e2445828e955803dbeccf2201804e8
|
|
| BLAKE2b-256 |
d86e5761edd3ef65d20781384a9c67adff86b8dddb24d4fecee78f513cc4df53
|
File details
Details for the file schrutepy-0.1.3-py3.8.egg.
File metadata
- Download URL: schrutepy-0.1.3-py3.8.egg
- Upload date:
- Size: 3.5 kB
- Tags: Egg
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2158b336f37bdefae1b499bfb85e5d93787e24886e8793e66b3788f4ded912ee
|
|
| MD5 |
0da7a0a59fa0600823a291236b87a9f3
|
|
| BLAKE2b-256 |
c5bc99bd70f7f12a196f77228cbe456317d228ae20f7bbbb3c9c9689d8694771
|