Skip to main content

Package to get Content data from wikipedia and IMDB

Project description

ContentData

by Hunter Kempf

Get Content Data from Wikipedia and IMDb

Get Data from Wikipedia Example:

from ContentData.WikiScrape import Company

url = 'https://en.wikipedia.org/wiki/List_of_original_programs_distributed_by_Netflix'
table_names = ["Drama",
               "Marvel series",
               "Comedy",
               "Animation",
               "Anime",
               "Children's programming - Animation",
               "Children's programming - Musical shorts",
               "Children's programming - Live action"]
table_indexes = range(0,len(table_names))

NFLX = Company("Netflix",url,table_names,table_indexes)
NFLX.get_dataframe()

Get Data from IMDb Example:

from ContentData.IMDbScrape import imdb_data

IMDb = imdb_data()
IMDb.get_list_mixed_data(["House of Cards","Hemlock Grove"])

The WikiScrape section only handles tabular data similar to the netflix example. This can be further expanded to include List like data similar to the HBO wikipedia article.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hunterk-0.0.1.tar.gz (3.6 kB view details)

Uploaded Source

File details

Details for the file hunterk-0.0.1.tar.gz.

File metadata

  • Download URL: hunterk-0.0.1.tar.gz
  • Upload date:
  • Size: 3.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.3

File hashes

Hashes for hunterk-0.0.1.tar.gz
Algorithm Hash digest
SHA256 ee26a277b5a3cdaa52ce866c267dde71ad1fe501d061c270a6fe3b2d10bac6fa
MD5 307cc2313b59965e585f91c4dc03bf82
BLAKE2b-256 de7d731c118bbd2514f57da8f8e30e956a5560e1f63d39de2278fbebb0a4599d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page