Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

Installation

pipx install mark-scraper
# or
uv tool install mark-scraper

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.1.3.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.1.3-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.1.3.tar.gz.

File metadata

  • Download URL: mark_scraper-0.1.3.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.1.3.tar.gz
Algorithm Hash digest
SHA256 48208331b0df4a2fc802d5e138018ca63a9ab6c393508b63887df6b520f03d5b
MD5 12f0c9796c7e03c3ac5a321d3dcca353
BLAKE2b-256 bf49d495675188fccf048bdaf763a3a066978cf497f765108ef48a27a2c720e6

See more details on using hashes here.

File details

Details for the file mark_scraper-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 b9d761477c012f757551b349cf63ef820efe97d593efaa6635d2391c77f4540d
MD5 1a8aefc6824fb73eb7109d59547e429b
BLAKE2b-256 6c1edfd697b77ee67595567440e09695189f1907adfca38f4b94acaf3d0801f4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page