corona chan scraper for gob mx
Project description
corona_chan_gob_mx
corona chan scraper for gob mx
Free software: WTFPL
Documentation: https://corona-chan-gob-mx.readthedocs.io.
Features
le los pdf publicados en casos de covid-19 en mexico
transforma las tablas de los pdf en listas de dicionarios para poder ser procesadas en python de manera mas facil
How to use
el uso basico seria con
import corona_chan_gob_mx import get_today_cases()
table = get_today_cases()
for row in table:
assert isinstance( row, dict )
se puede adquirir la lista de pdfs con
import corona_chan_gob_mx.scraper import list_of_pdfs
links = list_of_pdfs.get()
print( links.native )
# [
# 'https://www.gob.mx/cms/uploads/attachment/file/544087/'
# 'Tabla_casos_sospechosos_COVID-19_2020.03.29.pdf',
# 'https://www.gob.mx/cms/uploads/attachment/file/544086/'
# 'Tabla_casos_positivos_COVID-19_resultado_InDRE_2020.03.29.pdf' ]
for link in links.native:
table = link.get()
for row in table.native:
assert isinstance( row, dict )
para leer pdf sin descargarlos
import corona_chan_gob_mx.scraper import pdf_to_dicts
tabla = pdf_to_dict( "/path/to/pdf/tabla.pdf" )
for row in table:
assert isinstance( row, dict )
History
0.0.1 (2020-03-28)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
corona_chan_gob_mx-1.0.0.tar.gz
(13.7 MB
view hashes)
Built Distribution
Close
Hashes for corona_chan_gob_mx-1.0.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fb651c75a273c7d3a205b55ab4b8eaf345a7bc5c9d3467f47d84301e6f3d7855 |
|
MD5 | 84b29551c20747c5ffa55cb02358fe29 |
|
BLAKE2b-256 | 56a697dd2d9224640d8566f0ccfe260de3ed831cc92ec005acf83518434f5f06 |