GNscraper stands for Google News Scraper which scrape News Articles from Google News based on given Keywords and returns Data Frame which contains Publishers, Title, Url, Uploaded Date and Uploaded DateTime of News Articles.
Project description
About GNscraper
GNscraper stands for Google News Scraper which scrape News Articles from Google News based on given Keywords and returns a Data Frame which contains Publishers, Title, Url, Uploaded Date and Uploaded DateTime of News Articles.
·
Report Bug
·
Request Feature
Installation
pip install GNscraper
Available Features
-
Get News by Keyword
Example :
from GNscraper import news_by_keyword
tesla_news = news_by_keyword('tesla')
tesla_news.head(10)
Output :
-
Available Countries
Example :
from GNscraper import available_countries
countries = available_countries()
print(countries)
Output :
country country_code
0 Australia AU
1 Botswana BW
2 Canada CA
3 Ethiopia ET
4 Ghana GH
5 India IN
6 Indonesia ID
7 Ireland IE
8 Israel IL
9 Kenya KE
10 Latvia LV
11 Malaysia MY
12 Namibia NA
13 New Zealand NZ
14 Nigeria NG
15 Pakistan PK
16 Philippines PH
17 Singapore SG
18 South Africa ZA
19 Tanzania TZ
20 Uganda UG
21 United Kingdom GB
22 United States US
23 Zimbabwe ZW
24 Czech Republic CZ
25 Germany DE
26 Austria AT
27 Switzerland CH
28 Argentina AR
29 Chile CL
30 Colombia CO
31 Cuba CU
32 Mexico MX
33 Peru PE
34 Venezuela VE
35 Belgium BE
36 France FR
37 Morocco MA
38 Senegal SN
39 Italy IT
40 Lithuania LT
41 Hungary HU
42 Netherlands NL
43 Norway NO
44 Poland PL
45 Brazil BR
46 Portugal PT
47 Romania RO
48 Slovakia SK
49 Slovenia SI
50 Sweden SE
51 Vietnam VN
52 Turkey TR
53 Greece GR
54 Bulgaria BG
55 Russia RU
56 Ukraine UA
57 Serbia RS
58 United Arab Emirates AE
59 Saudi Arabia SA
60 Lebanon LB
61 Egypt EG
62 Bangladesh BD
63 Thailand TH
64 China CN
65 Taiwan TW
66 Hong Kong HK
67 Japan JP
68 Republic of Korea KR
-
Available Languages
Example :
from GNscraper import available_languages
languages = available_languages()
print(languages)
Output :
language language_code
0 english en
1 indonesian id
2 czech cs
3 german de
4 spanish es-419
5 french fr
6 italian it
7 latvian lv
8 lithuanian lt
9 hungarian hu
10 dutch nl
11 norwegian no
12 polish pl
13 portuguese brasil pt-419
14 portuguese portugal pt-150
15 romanian ro
16 slovak sk
17 slovenian sl
18 swedish sv
19 vietnamese vi
20 turkish tr
21 greek el
22 bulgarian bg
23 russian ru
24 serbian sr
25 ukrainian uk
26 hebrew he
27 arabic ar
28 marathi mr
29 hindi hi
30 bengali bn
31 tamil ta
32 telugu te
33 malyalam ml
34 thai th
35 chinese simplified zh-Hans
36 chinese traditional zh-Hant
37 japanese ja
38 korean ko
-
Available Date ranges
Example :
from GNscraper import available_date_range
date_range = available_date_range()
print(date_range)
Output :
-
Available Categories
Example :
from GNscraper import available_new_categories
news_categories = available_new_categories()
print(news_categories)
Output :
['Health',
'Science',
'Sports',
'Entertainment',
'Technology',
'Business',
'World',
'National',
'Top stories']
-
Get News Articles in Specific Language Based on a Keyword
Example :
from GNscraper import news_by_keyword_lang
covid_df = news_by_keyword_lang('covid-19','ta')
covid_df.head(6)
Output :
-
Get News Articles Based on Keyword within specific date_range
Example :
from GNscraper import news_by_keyword_date
latest_tesla_df = news_by_keyword_date('tesla','1d')
latest_tesla_df.head(10)
Output :
-
Get News Articles Based on Keyword from specific Website
Example :
from GNscraper import news_by_keyword_web
BBC_tesla_df = news_by_keyword_web('tesla','www.BBC.com')
BBc_tesla_df.head(10)
Output :
-
Get News Articles based on location
Example :
from GNscraper import news_by_loaction
chennai_news = news_by_loaction('channai')
chennai_news.head(10)
Output :
-
Get News Articles based on location in specific language
Example :
from GNscraper import news_by_location_lang
chennai_news_df = news_by_location_lang('channai','te)
chennai_news_df.head(6)
Output :
-
Get News Articles based on including and excluding keyword
Example :
from GNscraper import news_by_keyword_exclude
tesla_df = news_by_keyword_exclude('tesla','Elon Musk')
tesla_df.head(6)
Output :
-
Get National headlines based on given Country Code and News Category
Example :
from GNscraper import national_headlines_by_categories
IN_sports_df = national_headlines_by_categories('IN','Sports')
IN_sports_df.head(6)
Output :
License
Distributed under the MIT License. See LICENSE
for more information.
Contact
Mugilan Deiveegan - mugilan.deiveegan98@gmail.com
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for GNscraper-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 975e6a3e5a3b090f0bd714365ca87ac0611e97086234626de88ad524ee4f82ab |
|
MD5 | bed03b7305a33daf52046edca0af6142 |
|
BLAKE2b-256 | b2c94c5a56d8e2cc346ba0f62776b7023f304f2fee07a5a9d837da0a66325754 |