Web scraping API for Finnish websites
Project description
finscraper
The library provides an easy-to-use API for fetching data from various Finnish websites:
Website | Type | Spider API class |
---|---|---|
Ilta-Sanomat | News article | ISArticle |
Iltalehti | News article | ILArticle |
YLE Uutiset | News article | YLEArticle |
Suomi24 | Discussion thread | Suomi24Page |
Vauva | Discussion thread | VauvaPage |
Oikotie Asunnot | Apartment ad | OikotieApartment |
Tori | Item deal | ToriDeal |
Documentation is available at https://finscraper.readthedocs.io and simple online demo here.
Installation
pip install finscraper
Quickstart
Fetch 10 news articles as a pandas DataFrame from Ilta-Sanomat:
from finscraper.spiders import ISArticle
spider = ISArticle().scrape(10)
articles = spider.get()
The API is similar for all the spiders:
Contributing
Please see CONTRIBUTING.md for more information.
Jesse Myrberg (jesse.myrberg@gmail.com)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
finscraper-0.2.2.tar.gz
(22.4 kB
view hashes)
Built Distribution
finscraper-0.2.2-py3-none-any.whl
(30.1 kB
view hashes)
Close
Hashes for finscraper-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0b66a6c8dfd37eab853496c24d5dc7de6aacaacd30ea15d6eac4945a9246fb28 |
|
MD5 | 8f9c22837afc3316e2f407cad88b57ee |
|
BLAKE2b-256 | 6d15e2708246ded835689684c42e7100c7fde3146b1e44a273e70593c2f49e42 |