Web scraping API for Finnish websites
Project description
finscraper
The library provides an easy-to-use API for fetching data from various Finnish websites:
Website | URL | Type | Spider API class |
---|---|---|---|
IltaSanomat | https://www.is.fi | News article | finscraper.spiders.ISArticle |
Iltalehti | https://www.iltalehti.fi | News article | finscraper.spiders.ILArticle |
Documentation is available at https://www.finscraper.readthedocs.io.
Installation
pip install finscraper
Quickstart
Fetch 10 news articles as a pandas DataFrame from IltaSanomat:
from finscraper.spiders import ISArticle
spider = ISArticle().scrape(10)
articles = spider.get()
Contributing
When websites change, spiders tend to break. I can't make a promise to keep this repository up-to-date all by myself - pull requests are more than welcome!
Jesse Myrberg (jesse.myrberg@gmail.com)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
finscraper-0.0.1.dev4.tar.gz
(13.4 kB
view hashes)