A Brazilian News Website Data Acquisition Library for Python
Project description
A Brazilian News Website Data Acquisition Library for Python
pyBrNews Project, made with :heart: by Lucas Rodrigues (@NepZR).
The pyBrNews project is a Python 3 library in development for tasks of data acquisition in Brazilian News Websites, capable for extracting news and comments from this platforms and with it's core utilizing the requests-HTML library.
:newspaper: Websites and capture groups supported
Website name | News | Comments | URL |
Portal G1 | :white_check_mark: Working | :keyboard: In progress | Link |
Folha de São Paulo | :white_check_mark: Working | :white_check_mark: Working | Link |
Exame | :white_check_mark: Working | :x: Not supported | Link |
Metrópoles | :keyboard: In progress | :keyboard: In progress | Link |
Database: using MongoDB (pyMongo), supported since October 28th, 2022.
Internal Module:pyBrNews.config.database.PyBrNewsDB
:keyboard: Available methods
Soon :tm:
:man_technologist: Project Developer
Lucas Darlindo Freitas Rodrigues Data Engineer | Backend Python Dev. LinkedIn (lucasdfr) |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pyBrNews-0.1.0.tar.gz
(25.6 kB
view hashes)
Built Distribution
pyBrNews-0.1.0-py3-none-any.whl
(29.0 kB
view hashes)