A set of python modules for cornel movie-dialogs corpus with storm
Project description
A set of python modules for cornel movie-dialogs corpus with storm.
Abstract
This module include some classes extending storm ORM for cornel movie-dialogs corpus data.
Install
pip install storm # if you not pip install cornel-movie-dialogs-corpus-storm
Usage
from mdcorpus.orm import * from mdcorpus.parser import * ...
Class List
MovieTitlesMetadata
Genre
MovieGenreLine
MovieCharactersMetadata
MovieConversation
MovieLine
RawScriptUrl
Corpus Problem
This is memo when I dealt with corpus problems.
movie_titles_metadata.txt
I ignored an alphabet following year.
for example, line 34, 1989/I
I adjust title data for Acute accent manually.
line 115, léon
I ignored duplication for genre data.
line 58, ['horror', 'mystery', 'mystery', 'sci-fi', 'sci-fi']
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Close
Hashes for cornel-movie-dialogs-corpus-storm-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6513bb5b7e3bd15b0086e84fa392b1ee5bc4d12e3d2735217ac29b8ad7c313ec |
|
MD5 | f85bd7cd4db91eb7e9979c2b0cbd600a |
|
BLAKE2b-256 | d1596e7d97aefc6575217842f1936eeae39b9a052c646372ec08b012cfa23733 |