scrape

a webpage scraping tool

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Environment
- Console
- Web Environment
Intended Audience
Programming Language

Project description

# scrape

## a webpage scraping tool

## Installation * pip install scrape

## Usage usage: scrape.py [-h] [-c [CRAWL [CRAWL …]]] [-ca] [-l LIMIT] [-t]       url [keywords [keywords …]]

a webpage scraping tool

  positional arguments:

    url   url to scrape

    keywords   keywords to search

  optional arguments:

    -h, –help  show this help message and exit

    -c [CRAWL [CRAWL …]], –crawl [CRAWL [CRAWL …]]       crawl links based on these keywords

    -ca, –crawl-all  crawl all links

  -l LIMIT, –limit LIMIT    crawl page limit

  -t, –text    write to text instead of pdf

## Author * Hunter Hammond (huntrar@gmail.com)

## Notes * Unless specified using the –text flag, all webpages are saved as pdf files using pdfkit.

Entering keyword arguments while using the –text flag allows users to save only lines matching one of the given keywords.
You can crawl subsequent webpages using by passing a substring of the url you wish to match using –crawl, or by using –crawl-all.
There is no limit to the number of pages to be crawled unless one is set using the –limit flag.

News

0.0.5

added –verbose argument for use with pdfkit

improved output file name processing

0.0.4

accepts 0 or 1 url’s, allowing a call with just –version

0.0.3

Moved utils.py to scrape/

0.0.2

First entry

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Environment
- Console
- Web Environment
Intended Audience
Programming Language

Release history Release notifications | RSS feed

0.11.3

Feb 20, 2022

0.11.2

Feb 20, 2022

0.11.1

Mar 24, 2021

0.11.0

Mar 19, 2021

0.10.2

Jan 8, 2021

0.10.1

Aug 24, 2020

0.10.0

Mar 12, 2020

0.9.15

Jan 5, 2019

0.9.14

Jan 5, 2019

0.9.12

Jan 10, 2017

0.9.11

Aug 23, 2016

0.9.10

Jun 26, 2016

0.9.9

Jun 24, 2016

0.9.8

Jun 24, 2016

0.9.6

Jun 23, 2016

0.9.5

Jun 23, 2016

0.9.4

Jun 23, 2016

0.9.3

Jun 23, 2016

0.9.2

Jun 20, 2016

0.9.1

Jun 20, 2016

0.9.0

Jun 18, 2016

0.8.11

Jun 16, 2016

0.8.10

Jun 16, 2016

0.8.9

Jun 16, 2016

0.8.8

Jun 10, 2016

0.8.7

Mar 30, 2016

0.8.6

Feb 17, 2016

0.8.5

Feb 4, 2016

0.8.4

Feb 4, 2016

0.8.3

Feb 4, 2016

0.8.2

Feb 2, 2016

0.8.1

Jan 30, 2016

0.8.0

Jan 30, 2016

0.7.9

Jan 23, 2016

0.7.8

Jan 22, 2016

0.7.7

Jan 22, 2016

0.7.6

Jan 5, 2016

0.7.5

Jan 2, 2016

0.7.4

Jan 2, 2016

0.7.3

Jan 2, 2016

0.7.2

Jan 2, 2016

0.7.1

Dec 19, 2015

0.7.0

Dec 7, 2015

0.6.9

Dec 6, 2015

0.6.8

Dec 5, 2015

0.6.7

Dec 5, 2015

0.6.6

Dec 5, 2015

0.6.5

Dec 4, 2015

0.6.4

Nov 28, 2015

0.6.3

Nov 26, 2015

0.6.2

Nov 24, 2015

0.6.1

Nov 23, 2015

0.6.0

Nov 23, 2015

0.5.9

Nov 19, 2015

0.5.8

Nov 19, 2015

0.5.7

Nov 10, 2015

0.5.6

Nov 10, 2015

0.5.5

Nov 10, 2015

0.5.4

Nov 8, 2015

0.5.3

Nov 8, 2015

0.5.2

Nov 8, 2015

0.5.1

Nov 8, 2015

0.5.0

Nov 8, 2015

0.4.6

Oct 30, 2015

0.4.5

Oct 29, 2015

0.4.4

Oct 28, 2015

0.4.3

Oct 28, 2015

0.4.2

Oct 20, 2015

0.4.1

Oct 20, 2015

0.4.0

Oct 19, 2015

0.3.9

Oct 15, 2015

0.3.8

Oct 15, 2015

0.3.7

Oct 12, 2015

0.3.6

Sep 17, 2015

0.3.5

Sep 16, 2015

0.3.4

Sep 15, 2015

0.3.3

Sep 15, 2015

0.3.2

Sep 15, 2015

0.3.1

Sep 15, 2015

0.3.0

Sep 15, 2015

0.2.10

Sep 13, 2015

0.2.9

Sep 11, 2015

0.2.8

Aug 13, 2015

0.2.7

Aug 5, 2015

0.2.6

Jul 25, 2015

0.2.5

Jul 25, 2015

0.2.4

Jul 20, 2015

0.2.3

Jul 19, 2015

0.2.2

Jul 19, 2015

0.2.1

Jul 16, 2015

0.2.0

Jul 15, 2015

0.1.10

Jul 13, 2015

0.1.9

Jul 13, 2015

0.1.8

Jul 13, 2015

0.1.7

Jul 11, 2015

0.1.6

Jul 11, 2015

0.1.5

Jul 11, 2015

0.1.4

Jul 11, 2015

0.1.3

Jul 11, 2015

0.1.2

Jul 11, 2015

0.1.1

Jul 11, 2015

0.1.0

Jul 11, 2015

0.0.11

Jul 10, 2015

0.0.10

Jul 10, 2015

0.0.9

Jul 9, 2015

0.0.8

Jul 8, 2015

0.0.7

Jul 7, 2015

0.0.6

Jul 7, 2015

0.0.5

Jul 7, 2015

This version

0.0.4

Jul 7, 2015

0.0.3

Jul 7, 2015

0.0.2

Jul 7, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

scrape-0.0.4-py2-none-any.whl (6.0 kB view hashes)

Uploaded Jul 7, 2015 Python 2

Hashes for scrape-0.0.4-py2-none-any.whl

Hashes for scrape-0.0.4-py2-none-any.whl
Algorithm	Hash digest
SHA256	`371a0f4895e168ee44b455f7a680092b519253f46cef441a296f929a8d2b3214`
MD5	`5dd7da4d2bdcb4133a801412f5ec94b0`
BLAKE2b-256	`872d0ed4e554c657ff5acca6e7ab3e71f7c01a9284d9a22700ef9964c455f0b8`