Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

Installation

pipx install mark-scraper
# or
uv tool install mark-scraper

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.1.1.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.1.1-py3-none-any.whl (4.1 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.1.1.tar.gz.

File metadata

  • Download URL: mark_scraper-0.1.1.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.1.1.tar.gz
Algorithm Hash digest
SHA256 44cb612baa30e7d120bd86dc3215296a2ed5eb96e49f395adedbc18963e5e23d
MD5 d523d71326a4fb3f98753956a96dcfc1
BLAKE2b-256 dd25c6074b4980cd38087052dfe4eb897c89f9b481dadf08cce90623348b659d

See more details on using hashes here.

File details

Details for the file mark_scraper-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a5b43eb73eac2c8683585cac4b086daa12e550bbe5ebc63a216776a33b99c00d
MD5 ecb39ff05c0214063464fed634e719bc
BLAKE2b-256 84b59d7519ff5a954d87c319d3b309c7b749b79eca2b864e4216aebbe09c6733

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page