Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

Installation

pipx install mark-scraper
# or
uv tool install mark-scraper

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.2.0.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.2.0-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.2.0.tar.gz.

File metadata

  • Download URL: mark_scraper-0.2.0.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.2.0.tar.gz
Algorithm Hash digest
SHA256 3ca743c2e7e62d50d8c3f6b2a2487edb9cc0028fb6ea7f8c6a90eabf32a4a497
MD5 046ccc7e10bb484028e7ed57e846a82b
BLAKE2b-256 1ee7a5827097697242542e25a4ff1e5a3e1f0fecf805aa6043cf3abcabe89f50

See more details on using hashes here.

File details

Details for the file mark_scraper-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9a111302354b12e69be4c6c07226bbcd66b723e942399c63f7c1ea57c71cb1c3
MD5 7f925c0b6074c44ff928f52f632100f2
BLAKE2b-256 b7fac2c66892dd17f1379aa0e2d42747f085534eee9346fe7085fd2210ea0fdc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page