Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

Installation

pipx install mark-scraper
# or
uv tool install mark-scraper

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.3.0.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.3.0-py3-none-any.whl (4.5 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.3.0.tar.gz.

File metadata

  • Download URL: mark_scraper-0.3.0.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.3.0.tar.gz
Algorithm Hash digest
SHA256 071437cb301fa094817a58bd91cad97a960ba6be2fa924fd6d93e3447510f470
MD5 7306470b4df808fab79dc77ad9c702c2
BLAKE2b-256 596a75fd86066396855bcc4bdd11dab414ed4796b945623bfbadbebaed9ba2af

See more details on using hashes here.

File details

Details for the file mark_scraper-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2825cc88f5724e13918c377967ae0aeec313e4263760eaa39fc052c94c402206
MD5 fb8927264d99cc04411382e6b990590f
BLAKE2b-256 b2f1738c2eb39149fd0f166d683ce838632fd2f22e298b2eeccee7fcff83e15f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page