Skip to main content

corona chan scraper for gob mx

Project description

corona_chan_gob_mx

https://img.shields.io/pypi/v/corona_chan_gob_mx.svg https://img.shields.io/travis/dem4ply/corona_chan_gob_mx.svg Documentation Status

corona chan scraper for gob mx

Features

  • le los pdf publicados en casos de covid-19 en mexico

  • transforma las tablas de los pdf en listas de dicionarios para poder ser procesadas en python de manera mas facil

How to use

el uso basico seria con

import corona_chan_gob_mx import get_today_cases()
table = get_today_cases()
for row in table:
        assert isinstance( row, dict )

se puede adquirir la lista de pdfs con

import corona_chan_gob_mx.scraper import list_of_pdfs
links = list_of_pdfs.get()
print( links.native )
# [
#       'https://www.gob.mx/cms/uploads/attachment/file/544087/'
#       'Tabla_casos_sospechosos_COVID-19_2020.03.29.pdf',
#       'https://www.gob.mx/cms/uploads/attachment/file/544086/'
#       'Tabla_casos_positivos_COVID-19_resultado_InDRE_2020.03.29.pdf' ]
for link in links.native:
        table = link.get()
        for row in table.native:
                assert isinstance( row, dict )

para leer pdf sin descargarlos

import corona_chan_gob_mx.scraper import pdf_to_dicts
tabla = pdf_to_dict( "/path/to/pdf/tabla.pdf" )
for row in table:
        assert isinstance( row, dict )

History

0.0.1 (2020-03-28)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corona_chan_gob_mx-1.0.0.tar.gz (13.7 MB view hashes)

Uploaded Source

Built Distribution

corona_chan_gob_mx-1.0.0-py2.py3-none-any.whl (4.7 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page