Skip to main content

A small example package for scrapping argentinian newsletters

Project description

NSCRAPY: Scraper de principales portales de noticias argentinos.

Libreria para bajar notas de clarin , pagina12, cronica y cronista.

Instalacion:

pip install nscrapy

Clases para scrapear una url de una nota.

clarin()
p12()
cronica()
cronista()

Ejemplo de uso, scrapear el portal de clarin de hoy (para pagina 12 cambiar clarin por p12) con comentarios:

from nscrapy import nscrap as ns
from nscrapy.clarin import get

notasclarin=ns.clarin()

#scrapeo el portal
notasclarin.hoy()

#las urls a las notas de hoy estan en .urls

notashoy=get(notasclarin.urls)

notashoy es una lista de objetos nota:

nota.titulo : titulo de la nota
nota.comm : comentarios
nota.com : ' '.join(comentarios)
nota.volanta : volanta de la nota
nota.bajada : resumen o subtitulo
nota.cuerpo : cuerpo nota
nota.bolds : textos en negrita
nota.bold : ' '.join(bolds)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nscrapy-0.0.3.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nscrapy-0.0.3-py3-none-any.whl (12.8 kB view details)

Uploaded Python 3

File details

Details for the file nscrapy-0.0.3.tar.gz.

File metadata

  • Download URL: nscrapy-0.0.3.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.6.0 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.4

File hashes

Hashes for nscrapy-0.0.3.tar.gz
Algorithm Hash digest
SHA256 58fb58d47169e87c4ad67f063f0a66c6c732a6e514518cd6197fb8160c5026e8
MD5 fb49a2f7168d3bd2d906303268c24444
BLAKE2b-256 0db0ec0e11b80f86f9341c21b1c7fc0d392990c6a1b7aca0dbc9ef03c0def1a4

See more details on using hashes here.

File details

Details for the file nscrapy-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: nscrapy-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 12.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.6.0 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.4

File hashes

Hashes for nscrapy-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 18df76a4d85f348b5cb437eed518037d1c9bcd8f51098afa6e0563f884192014
MD5 72f97c503f287491c7fa8f39ce2fe8f6
BLAKE2b-256 803bd030ca496948855f5ba287314c5d20863e44f69ca96d457a9cb8309f4f58

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page