Web Scraping Library
Project description
Title
pikiran-rakyat Scraper
Descrption
- With pikiran-rakyat Scraper you can scrape search results and extract the contents produced by the search result.
- In pikiran-rakyat scraper it will scrape the data present of the website and give json data which contains the details of the contents on the website.
JSON sample data
{
"Scraper_repeat_data_0": [
{
"column_1": "Editor: Puput Akad Ningtyas Pratiwi"
},
{
"column_1": "Sumber: YouTube Najwa Shihab"
}
],
"Scraper_detail_data": {
"name": "Jadi Bulan-bulanan Mahasiswa saat Hadir di Mata Najwa, Fadjroel Rachman: Ampun deh Ini! - Pikiran-Rakyat.com",
"photo_caption": "Staf Khusus Presiden bidang Komunikasi, Fadjroel Rachman /Antara Foto/Wahyu Putro A",
"read_info_author": "Eka Alisa Putri",
"read_title": "Jadi Bulan-bulanan Mahasiswa saat Hadir di Mata Najwa, Fadjroel Rachman: Ampun deh Ini!",
"time": "- 2 Oktober 2021, 08:53 WIB"
},
"Scraper_repeat_data_1": [
{
"column_4": ""
}
Run Scraper
from pikiran_rakyat_scraper import *
link="https://www.pikiran-rakyat.com/nasional/pr-012715353/tokoh-nu-sebut-mensos-risma-seperti-kesurupan-kerjanya-cuma-marah-marah-ganti-saja"
data=run_pikiran_rakyat_scraper(link)
How it works?
- It takes URL of pikiran-rakyat page with a search keyword to scrape the data.
- It generates the json data which contains the information of the pikiran-rakyat search result.
- It gives the every detail present inside website in the form of json data.
Examples
Below are some of the examples of URLs using which you can scrape:
Queries/ Feedback
If you have some queries or feedback please contact us at following
Telegram
Email
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file pikiran_rakyat_scraper-1.0.0.tar.gz
.
File metadata
- Download URL: pikiran_rakyat_scraper-1.0.0.tar.gz
- Upload date:
- Size: 3.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.5.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fc71de636970a84ea91fdcd377fa11e0f2a4fc28ce2330c0e2ec321cee49baf1 |
|
MD5 | ee292de56877552abb8988830be86f71 |
|
BLAKE2b-256 | 385750ab8fe2428fd594b481946bc6ee45a7e3e477f0c243e245f5d649335725 |