Skip to main content

Scrape and analyze articles from minecraft.wiki

Project description

Wiki Scraper

Python Tests Coverage Lint PyPI

Scrape and analyze articles from minecraft.wiki.


Features

  • Summary – Get the first paragraph of an article.
  • Table Extraction – Extract tables and save them as CSV.
  • Count Words – Count words in an article and aggregate results in word-counts.json.
  • Analyze Relative Word Frequency – Compare word frequencies across articles or the whole language.
  • Auto Count Words – Traverse links automatically and count words in articles.

Installation

Requires Python 3.13+. Install to a virtual environment (recommended):

pip install .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mc_wiki_scraper-0.1.0.tar.gz (130.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mc_wiki_scraper-0.1.0-py3-none-any.whl (20.3 kB view details)

Uploaded Python 3

File details

Details for the file mc_wiki_scraper-0.1.0.tar.gz.

File metadata

  • Download URL: mc_wiki_scraper-0.1.0.tar.gz
  • Upload date:
  • Size: 130.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.3 cpython/3.13.7 HTTPX/0.28.1

File hashes

Hashes for mc_wiki_scraper-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2bf9228fc6fd980a5fb9fad197e5bb4bacdbedb69a9ecf8c225216a87fc0f026
MD5 095cd6d7985c766cb260c4ee23fe4591
BLAKE2b-256 d6533c6e3afd11b9e6cf9906a80f4e6db9ac77e7ed6defd0c4c3074d4119229a

See more details on using hashes here.

File details

Details for the file mc_wiki_scraper-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: mc_wiki_scraper-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 20.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.3 cpython/3.13.7 HTTPX/0.28.1

File hashes

Hashes for mc_wiki_scraper-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 840653687c5ec607a198e1c192de320b6c48e8d6ebe717edffbe87cf9dc0fd5d
MD5 03c2e9233f90c1dc24b8e76a4671c9a5
BLAKE2b-256 33a40c0c32aad6056744075b7940ce95902bde0ccf01df8c492c57d2dd2724a4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page