a small web crawler for the pastebin.com website
Project description
Simple-Pastebin-Parser
this is a simpler parser for the pastebin.com website.
it will iterate posts and parse their elements using lxml
installation:
pip install simple-pastebin-parser
Release notes:
v0.1.0 - P.O.C
initial proof of concept. nothing special, just doing the dirty work of parsing the posts.
how to execute: 1. create a virtual env of python 3.6 2. install requirements 3. run python poc.py
v0.2.5 (2020-03-07)
integration with travis.ci
v0.2.6 (2020-03-07)
changing the POC code to work with installed pypi package
v0.3.0 (2020-03-07)
created the Post() object for pastebin posts
ability to stream data
v0.3.3 (2020-03-07)
small fixes
v0.3.5 (2020-03-07)
update README
History
0.1.0 (2020-03-07)
First release on PyPI.
0.2.5 (2020-03-07)
integration with travis.ci
0.2.6 (2020-03-07)
changing the POC code to work with installed pypi package
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for simple_pastebin_parser-0.4.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | c8dce3b781d5b052ad5ca70d14fa0edd2b690ff17446551af8a963b287d0b22e |
|
MD5 | 1a497f319785d9dc3197c8f3220f6c5c |
|
BLAKE2b-256 | d226d52c2438e6a7a469071eb292379774a8bcab46a2fe93c74f7d6b3fe4c064 |
Hashes for simple_pastebin_parser-0.4.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ae4121352c8012fcd32b83c79a3741494f75ebac26610210cee64263de3f7aee |
|
MD5 | 274ca93ff6508bceae7500ba78398090 |
|
BLAKE2b-256 | 4af897f5d9f5ebc24b46b03d1d5859f23a42df92488a682c4fe0c318d8d05066 |