Skip to main content

Add your description here

Project description

Mark Scraper

CLI to scrape webpages and return them in markdown format

mark-scraper http://example.com
# Example Domain
# ==============

# This domain is for use in illustrative examples in documents. You may use this
# domain in literature without prior coordination or asking for permission.

# [More information...](https://www.iana.org/domains/example)

TODO

Here is a list of features I would like to add to this project

  • Also save html
    • mark-scraper -h/--html example.com.html http://example.com > example.com.md
  • Take a screen shot of web page
    • mark-scraper -s/--screenshot screenshot.png http://example.com > example.com.md
  • Add Page Caching
    • Will return from cache if exists mark-scraper -c/--cache http://example.com > example.com.md
    • Bust cache mark-scraper -b/--bust-cache http://example.com > example.com.md
    • Clear cache rm -rf .mark-scraper/cache
  • Add log of pages scrapped

Development

uv sync
uv run pytest
uv run mark-scraper http://example.com

Publishing

uv build

# UV_PUBLISH_TOKEN required
uv publish

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mark_scraper-0.1.0.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mark_scraper-0.1.0-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file mark_scraper-0.1.0.tar.gz.

File metadata

  • Download URL: mark_scraper-0.1.0.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.13

File hashes

Hashes for mark_scraper-0.1.0.tar.gz
Algorithm Hash digest
SHA256 5aab819bc614614a81e32c5b075cba431c8370c073b18d3652e8a7169700a290
MD5 7b6e99d3e7f741135a6fe06c2626d81f
BLAKE2b-256 ab1e00ca0acc8e298c4674ad5535fff4dcee79963a5ba10704a2a03028023cea

See more details on using hashes here.

File details

Details for the file mark_scraper-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mark_scraper-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 97914956302d03c16fe3317666f3fb9a8b5eb16f1210e9d1781f8671491e824d
MD5 52ec127cfcf63aea75618c9d03f6f7c8
BLAKE2b-256 6d853241942106578f3a15fb2a5ab6c5f5b13aa5697865519d5eff478b1436c7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page