Skip to main content

A set of python modules for cornel movie-dialogs corpus with storm

Project description

A set of python modules for cornel movie-dialogs corpus with storm.

Abstract

This module include some classes extending storm ORM for cornel movie-dialogs corpus data.

Install

pip install storm                # if you not
pip install cornel-movie-dialogs-corpus-storm

Usage

from mdcorpus.orm import *
from mdcorpus.parser import *

...

Class List

  • MovieTitlesMetadata

  • Genre

  • MovieGenreLine

  • MovieCharactersMetadata

  • MovieConversation

  • MovieLine

  • RawScriptUrl

Corpus Problem

This is memo when I dealt with corpus problems.

movie_titles_metadata.txt

  • I ignored an alphabet following year.

    • for example, line 34, 1989/I

  • I adjust title data for Acute accent manually.

    • line 115, léon

  • I ignored duplication for genre data.

    • line 58, ['horror', 'mystery', 'mystery', 'sci-fi', 'sci-fi']

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page