Scrapping altnews|Google images|Instagram|facebook|Twitter|Tradingview chartImage
Project description
Author: Shivam Vishwakarma
Scrapping
1 Altnews.com scrapping 2 Google Image Scrapping 3 Instagram user profile scrapping based on username 4 Instagram single post scrapping based on link 5 Facebook post scrapping based on postlink 6 Twitter post scrapping based on link 7 Tradingview chart image scrapping 8 Whatsapp messaging using whatsappweb
1 ALTNEWS
This is a Indian fact news checking site. this package is helpful to scrape all data and create a dataframe of every article which is available on site . data includes: Text Author name Date Title Link
2 Google Image Downloader
In this module you can scrape all images based on given image name . Basically it takes a few minutes to scrape all imaes . it iterates over all image sub categories and scrape them . This is very useful for making dataset of difficult images dataset . Required only name of image
3 Instagram user profile scrapping
Scrape a whole user profile images based on instagram userid just need userid .
4 Scrapping instagram posts
This is helpful to scrape data related to a instagram post . shared link of post.
5 Scrapping Facebook post
This is related to scrape post of the faceebook based on shared link of facebook post .
6 Scrapping Twitter post
Just get the shareable link of twitter post . Use that as a url to the function and it will return data related to posts .
7 Scrapping trading view image chart
This is to save image of the tradingview chart based on symbol,timeinterval . THis is helpful to make dataset for trading projects .
8 Whatsapp messaging
Here we can send messages to existing friends in the user accout . This is based on whatsapp web . Just need to scan whatsapp QR code and start to message to the friends . You can send a messages n no. of times . Ex: target='varsha' message='Hi' n=10 Then it will send message Hi 10 times to varsha .
Importing scrapperShivam
***************************************************************
from scrapper import scrape
***************************************************************
1 # Altnews.com
***************************************************************
#1 Scrapping full Fake News dataset from altnews.com
scrape.altnews()
***************************************************************
2 # scrape fb_post
***************************************************************
# Scrapping facebook post based on fb_url
url="""http://www.facebook.com/story.php?story_fbid=1465359780314951&id=100005228298088&scmts=
scwspsdd&extid=zIROvP0jkc3UkapK"""
text,image,url=scrape.fb_post(url)
# Facebook shared link of post
***************************************************************
('मेरे द्वारा एक निजी इंजीनियरिंग कॉलेज के फेसबुक पेज पर खुद के वेतन और संस्था की सच्चाई पूर्ण विनम्रता के साथ कॉमेंट बॉक्स में लिखी थी तो कॉलेज प्रशासन ने उसे हटा दिया और सभी स्टाफ मेंबर से बोला है कि...',
'https://scontent.fixc7-1.fna.fbcdn.net/v/t1.0-9/116582208_1465359653648297_6414823970251708469_o.jpg?_nc_cat=103&_nc_sid=110474&_nc_ohc=dAOYaJ52M2oAX-GH2UF&_nc_ht=scontent.fixc7-1.fna&oh=a5e88e7ad2018bb1764f2c97b2caa9f9&oe=5F9ACA07',
'https://www.facebook.com/permalink.php?story_fbid=1465359780314951&id=100005228298088')
3 # scrape image from google
***************************************************************
# scrapping all images from google based on image name
scrape.google_image('./chromedriver','football')
# param1 : chromedriver_path
# param 2 : image name to download
***************************************************************
4 # Instagram scrape post
***************************************************************
# Instagram post scrapping based on link
insta_post_url='https://www.instagram.com/p/CFy6m47lFJi/?utm_source=ig_web_button_share_sheet'
text,image=scrape.insta_post(insta_post_url)
# param 1 : instagram shared post url
***************************************************************
Trying with : https://www.instagram.com/p/CFy6m47lFJi/?utm_source=ig_web_button_share_sheet
('\nAmbivert परिंदा 🦋 (@curlicious_memer) posted on Instagram: “Paisa barbaad bc ..ab lagta Whitehat jr. p hi registration krna pdega 😕😂 Follow @curlicious_memer for more . . . . . . #softwareengineer…” • Oct 1, 2020 at 9:16am UTC\n',
'https://instagram.fixc7-1.fna.fbcdn.net/v/t51.2885-15/e35/120477874_330305014963377_8160288912634310107_n.jpg?_nc_ht=instagram.fixc7-1.fna.fbcdn.net&_nc_cat=109&_nc_ohc=lesd4rVgfMAAX_z-3yo&_nc_tp=18&oh=5fcfd512ef4c3dbb88fbbe181f2bea3b&oe=5FA1AFA0')
5 # Scrape Insta profile
***************************************************************
# scrapping all images of a insta user profile
scrape.insta_profile('./chromedriver','shivamthevirus')
# param 1 : Chromedriver_path
# param 2 : instagram profile username
***************************************************************
Retrieving ..............
Started downloading .........
6 # Scrape chart image from tradingview
***************************************************************
# scrapping chart image from tradingview.com based on symbol and time interval
scrape.trade('./chromedriver','reliance',t='month') #t=month/minute/hour/week/
# param1 : chromedriver_path
#param2 : stock symbol name
# param3 : time_interval
***************************************************************
image_url : https://www.tradingview.com/x/WVWuMkqj/
'images/reliance-2.png'
7 # Scrapping Post from twitter link
***************************************************************
twit_url='https://twitter.com/rashtrapatibhvn/status/1311647175747592198?s=20'
text.image,url=scrape.twit_post(twit_url)
# param1 : shared link of facebook_post
***************************************************************
Trying with : https://mobile.twitter.com/rashtrapatibhvn/status/1311647175747592198?s=20
(' आज मेरे जन्मदिन पर मुझे अनेक शुभकामना सन्देश प्राप्त हुए हैं। इसके लिए मैं हृदय से आभार व्यक्त करता हूं। आप सभी देशवासियों के स्नेह और सौहार्द से मुझे राष्ट्र की सेवा में तत्पर रहने की प्रेरणा व ऊर्जा प्राप्त होती है। आप सभी का बहुत-बहुत धन्यवाद!\n',
'',
'')
8 # messaging with web.whatsapp.com
***************************************************************
# scan web.whatsapp qrcode and give input y to continue
scrape.whatsapp('./chromedriver')
#param 1 : chromedriverpath
***************************************************************
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file scrapeShivam-2.0.0.tar.gz
.
File metadata
- Download URL: scrapeShivam-2.0.0.tar.gz
- Upload date:
- Size: 12.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4cabbdf7442d8ccfdb9c6d24c610f088c010b9e52c3c095ead2e961e28cf4d2c |
|
MD5 | f40213dc06125888f72a17973b70f6e0 |
|
BLAKE2b-256 | e561a7850665bc7a348f9c5e5701246fdf99e2d9b96f13141f1e8c5bd51373ab |
File details
Details for the file scrapeShivam-2.0.0-py3-none-any.whl
.
File metadata
- Download URL: scrapeShivam-2.0.0-py3-none-any.whl
- Upload date:
- Size: 33.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f0cf876e36634509b357e658894cb5a93e46a2bbbff8c038a6acc60d9480fce3 |
|
MD5 | 4b47d509e04ced0ca94099eb5215eda8 |
|
BLAKE2b-256 | 2e4dff952c9313612e2ac52c601ca404e7599b716b53e1fe7f5ce06137eac398 |