Skip to main content

A python package for web scraping korean news articles

Project description

Korean News Scraper

Hello World! Korean News Scraper aims to be a Korean language data collection tool for LLM.

After version 0.1.1. is possible to use. Please do not use it before.

Build

run setup.py

Required Libraries

  • beautifulsoup4
  • selenium
  • pandas
  • requests

Version 0.1.2.

Test version.

This version provide only one service that has the ability to save links to articles about specific keywords in Google News.

Version 0.1.3.

Provide the custom keywords to use for fetching data from the .csv file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

korean_news_scraper-0.1.3.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

korean_news_scraper-0.1.3-py3-none-any.whl (4.7 kB view details)

Uploaded Python 3

File details

Details for the file korean_news_scraper-0.1.3.tar.gz.

File metadata

  • Download URL: korean_news_scraper-0.1.3.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for korean_news_scraper-0.1.3.tar.gz
Algorithm Hash digest
SHA256 77d36ee66d31faded440f5ca40057e80cb8463789ebdacc90e49730beb192173
MD5 fce607673eb96fc9c7d73309d6249ac2
BLAKE2b-256 9aee75ca8bd0ab80cd648fce765398c2e8a8c0e551332095dfc9c8ed2ccf262b

See more details on using hashes here.

File details

Details for the file korean_news_scraper-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for korean_news_scraper-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 bf8b502c48c2a564df41f5dcb1780ab7227b0b8633bf790a8b60a3deac33f511
MD5 518dd1e3f28e3e843ba31cb3709845bf
BLAKE2b-256 5ad7525ee57b089b0b3ce596604395ce33fd7dc0b4eb7b30c4a91a6c8d88932b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page