a small web crawler for the pastebin.com website
Project description
Simple-Pastebin-Parser
this is a simpler parser for the pastebin.com website.
it will iterate posts and parse their elements using lxml
installation:
pip install simple-pastebin-parser
Release notes:
v0.1.0 - P.O.C
initial proof of concept. nothing special, just doing the dirty work of parsing the posts.
how to execute: 1. create a virtual env of python 3.6 2. install requirements 3. run python poc.py
v0.2.5 (2020-03-07)
integration with travis.ci
v0.2.6 (2020-03-07)
changing the POC code to work with installed pypi package
v0.3.0 (2020-03-07)
created the Paste() object for pastebin posts
ability to stream data
v0.3.3 (2020-03-07)
small fixes
v0.3.5 (2020-03-07)
update README
v0.4.0 (2020-03-08)
added documentation
cleaned most pep8 issues
some tests
v0.5.0 (2020-03-08)
parse date in UTC
add some logs
add id to Paste()
History
0.1.0 (2020-03-07)
First release on PyPI.
0.2.5 (2020-03-07)
integration with travis.ci
0.2.6 (2020-03-07)
changing the POC code to work with installed pypi package
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for simple_pastebin_parser-0.5.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 705d42b51309382985789a08326cd2c774b001525f238aa7de718fd55129b59e |
|
MD5 | 96f076812d9f158a823b1f5d511931d8 |
|
BLAKE2b-256 | b7a6527df2b1031ee8565e2b223c0ce72c4707a834f0a942030d19443a02566f |
Hashes for simple_pastebin_parser-0.5.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b4a4ede8da97da84d65a84dc62ac60ee454de55d7948338559dc26abbd0aac9d |
|
MD5 | 890573c168f6b511af7c156a59a8e576 |
|
BLAKE2b-256 | c0bcc79d3e104bc02e05a382cdb8a986ce8d6bc175abd6942d26e0acdb0da8fc |