A small example package for scrapping argentinian newsletters
Project description
NSCRAPY: Scraper de principales portales de noticias argentinos.
Libreria para bajar notas de clarin , pagina12, cronica y cronista.
Instalacion:
pip install nscrapy
Clases para scrapear una url de una nota.
clarin()
p12()
cronica()
cronista()
Ejemplo de uso, scrapear el portal de clarin de hoy (para pagina 12 cambiar clarin por p12) con comentarios:
from nscrapy import nscrap as ns
from nscrapy.clarin import get
notasclarin=ns.clarin()
#scrapeo el portal
notasclarin.hoy()
#las urls a las notas de hoy estan en .urls
notashoy=get(notasclarin.urls)
notashoy es una lista de objetos nota:
nota.titulo : titulo de la nota
nota.comm : comentarios
nota.com : ' '.join(comentarios)
nota.volanta : volanta de la nota
nota.bajada : resumen o subtitulo
nota.cuerpo : cuerpo nota
nota.bolds : textos en negrita
nota.bold : ' '.join(bolds)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nscrapy-0.0.3.tar.gz.
File metadata
- Download URL: nscrapy-0.0.3.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.6.0 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
58fb58d47169e87c4ad67f063f0a66c6c732a6e514518cd6197fb8160c5026e8
|
|
| MD5 |
fb49a2f7168d3bd2d906303268c24444
|
|
| BLAKE2b-256 |
0db0ec0e11b80f86f9341c21b1c7fc0d392990c6a1b7aca0dbc9ef03c0def1a4
|
File details
Details for the file nscrapy-0.0.3-py3-none-any.whl.
File metadata
- Download URL: nscrapy-0.0.3-py3-none-any.whl
- Upload date:
- Size: 12.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.6.0 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
18df76a4d85f348b5cb437eed518037d1c9bcd8f51098afa6e0563f884192014
|
|
| MD5 |
72f97c503f287491c7fa8f39ce2fe8f6
|
|
| BLAKE2b-256 |
803bd030ca496948855f5ba287314c5d20863e44f69ca96d457a9cb8309f4f58
|