The Entire Transcripts from the Office in Tidy Format
Project description
The complete transcripts of The Office (US) in a tidy dataframe
"The worst thing about prison was the dementors" --- Prison Mike
What is it
The entire text transcripts from the American version of The Office TV show in pandas dataframe. Use this package to practice or learn NLP, text analysis or deep learning.
Getting started
You can install easily from PyPi
Install
pip install schrutepy
Usage
Pull the transcripts into a data frame with this library's only method:
from schrutepy import schrutepy
df = schrutepy.load_schrute()
df.head(5)
Contributors
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
schrutepy-0.1.1.tar.gz
(5.4 MB
view hashes)
Built Distribution
Close
Hashes for schrutepy-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1e0e2a46cc289510f095cf9e2476fbfef0972dbea5f9b8e4914abcc1ed9c1a28 |
|
MD5 | 513caf0696bc32f4458ca2ea24aaf4ab |
|
BLAKE2b-256 | b593a2e2822a400081abaa109b24a05edb079141ff53d3333261aba183ed6c7b |