Skip to main content

Misc python tools

Project description

# avoin Python code for data scraping (c) 2012 Ville Korhonen (ville@xd.fi), GPLv3

## Installation You may install directly from PyPI using:

pip install avoin

or latest HEAD via git:

pip install git+https://github.com/ypcs/avoin.git#egg=avoin

## Using (via Python Shell)
>>> from avoin.scraper.scraper import DefaultScraper, html_title_parser
>>> scraper = DefaultScraper()
>>> url = 'http://www.google.com/'
>>> result = scraper.parse(url, parser=html_title_parser)
>>> print result
Google
>>> result = scraper.parse(url, parser=html_title_parser, format='json')
>>> print result
"Google"

## Using (via command line) Fetch all URLs from webpage as plaintext

avoin xpath –url ‘http://ypcs.fi/’ –xpath ‘//a/@href’ –format text

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

avoin-0.0.9.dev-20121012.tar.gz (5.5 kB view details)

Uploaded Source

File details

Details for the file avoin-0.0.9.dev-20121012.tar.gz.

File metadata

File hashes

Hashes for avoin-0.0.9.dev-20121012.tar.gz
Algorithm Hash digest
SHA256 151eba5a9998714df68860d418668e1ceb987339fb447625886163e4e6ef3f2a
MD5 2048a28153d81c84ee6e4e4df6f3b6f4
BLAKE2b-256 97968de5ce8b76a3f549b6eb21deb0b70c1cc92fac39c40ad662b85f0fd60bb6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page