Skip to main content

The Entire Transcripts from the Office in Tidy Format

Project description


The Entire Transcript from the Office in Tidy Format

PyPI version Coverage Status PyUp Black Black

Also available in R: schrute package

schrute R package

What is it

The entire text transcripts from the American version of The Office TV show in pandas dataframe. Use this package to practice or learn NLP, text analysis or deep learning.

Getting started

You can install easily from PyPi


pip install schrutepy


Pull the transcripts into a data frame with this library's only method:

from schrutepy import schrutepy

df = schrutepy.load_schrute()



View the full demo on the website: technistema


Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

schrutepy-0.1.3.tar.gz (5.4 MB view hashes)

Uploaded source

Built Distribution

schrutepy-0.1.3-py3.8.egg (3.5 kB view hashes)

Uploaded 0 1 3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page