This package extracts/parses information from source HTML.
Project description
# HTML Parser
extracts/parses information from source HTML.
# construct a Pypi package
python3 setup.py sdist bdist_wheel
twine upload dist/*
# install package
python3 -m pip install htmlparsingbs4based
# create CLI from dist
python3 -m pip install /home/yaxiong/html_parsing/dist/htmlparsingbs4based-0.0.8.tar.gz
# run CLI
mode1: eleasticsearch
PARSE -i ‘http://www.mineracamargo.com/MCA_Investors.html’ -gpf elasticsearch -esusr readwrite -espw ‘’
mode2: local
PARSE -i ‘http://www.mineracamargo.com/MCA_Investors.html’ -f /home/yaxiong/crawled_websites2
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
htmlparsingbs4based-0.1.0.tar.gz
(55.9 kB
view hashes)
Built Distribution
Close
Hashes for htmlparsingbs4based-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3259d5ec977d2729fb9325394154e99fe31b65dcb5cca73be2e639e2777b5de9 |
|
MD5 | 10b2390b2f5f5852921ea13f37f5775e |
|
BLAKE2b-256 | 3ddd91948699e3d80d0e60b13cefc5acd844d8099ed25ea8f4f383585fd7db3c |
Close
Hashes for htmlparsingbs4based-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 75989ad685396d48d70c6e892455b3a378d5dd8dc157fd8eb9f5072563d83f05 |
|
MD5 | d457c94d3dceeb0e5a9470fe2d7b13ac |
|
BLAKE2b-256 | 533bfe8eae21ef41964e579c108496b580b90ebf23731adadf91cb4189c42049 |