Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

Installation

pipx install mark-scraper
# or
uv tool install mark-scraper

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.2.1.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.2.1-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.2.1.tar.gz.

File metadata

  • Download URL: mark_scraper-0.2.1.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.2.1.tar.gz
Algorithm Hash digest
SHA256 5976d1b5517878213c7ed7f8cd2fc9134e94623d42efa4bd062798dd7f9da3d2
MD5 959c643a401f5eb453563444a05f44b8
BLAKE2b-256 f7e12ff2b2f2350e38bcafa52d9e747308f6555ad31c1e796fb0268416e8ca9b

See more details on using hashes here.

File details

Details for the file mark_scraper-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ee1c92dd9c443252abad6d1bcb5eed449cf5db3e6aa4eed8c2f2ef3ffceec3d4
MD5 b4d8088fb236e95a66f0a4670869802c
BLAKE2b-256 4094195ab8981e166103aeb9b8f6c0cd10a9f2dbe8534fc3f4891b73d4b02cb7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page