Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

Installation

pipx install mark-scraper
# or
uv tool install mark-scraper

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.1.2.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.1.2-py3-none-any.whl (4.1 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.1.2.tar.gz.

File metadata

  • Download URL: mark_scraper-0.1.2.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.1.2.tar.gz
Algorithm Hash digest
SHA256 8c2255a57c75c3f337c07a14377e82076f313aea0f81761f470182b6fff95589
MD5 f9bb1cb837b940ada00edf823b7d43a0
BLAKE2b-256 a2003dedeac44e6481b85eb856fdba8a51e772487a5b6166c6b3b9604239a9a0

See more details on using hashes here.

File details

Details for the file mark_scraper-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d6103c40934226b9bcdd2b09410d9a935b272475928f4857e5f338801a665174
MD5 6893937e355238d80f50cf85c186c100
BLAKE2b-256 07880128ca9df0a485472ffec955680af23437e94120f3e2468ba9fcd976d27b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page