Skip to main content

The Entire Transcripts from the Office in Tidy Format

Project description


The complete transcripts of The Office (US) in a tidy dataframe

PyPI version Coverage Status PyUp Black

"The worst thing about prison was the dementors" --- Prison Mike

What is it

The entire text transcripts from the American version of The Office TV show in pandas dataframe. Use this package to practice or learn NLP, text analysis or deep learning.

Getting started

You can install easily from PyPi


pip install schrutepy


Pull the transcripts into a data frame with this library's only method:

from schrutepy import schrutepy

df = schrutepy.load_schrute()



Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for schrutepy, version 0.1.1
Filename, size File type Python version Upload date Hashes
Filename, size schrutepy-0.1.1-py3-none-any.whl (2.5 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size schrutepy-0.1.1.tar.gz (5.4 MB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page