Skip to main content

Super lightweight Instagram web scraper for data analysis

Project description

instascrape logo

instascrape: super lightweight Instagram scraping toolkit

What is it?

instascrape is an incredibly lightweight set of tools geared towards scraping Instagram data. It makes no assumptions about your project and is instead designed for flexibility and developer productivity. It is excellent for the seasoned data scientist trying to quickly get an idea of a page's engagement as well as beginners looking to explore web scraping and the beauty of Python for the very first time.

Version Language Code style: black Release License

Downloads Activity Dependencies Issues Size

Example gif of instascrape


Table of Contents

Graph of instagram data (this repository and its author(s) are not affiliated with Real Python)


:computer: Installation

Minimum Python version

This library currently requires Python 3.7 or higher.

pip

Install from PyPI using

$ pip3 install insta-scrape

Clone

Clone right from Github to your local machine using

$ git clone https://github.com/chris-greening/instascrape.git

Install required dependencies using

$ pip3 install -r requirements.txt

:books: Documentation

The official documentation can be found on Read The Docs :newspaper:


:newspaper: Blog Posts

Check out blog posts on DEV for ideas and tutorials!


:pray: Contributing

All contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome!

Feel free to open an Issue or look at existing Issues to get a dialogue going on what you want to see added/changed/fixed!


:spider_web: Dependencies

Instascrape primarily relies on two third-party libraries for requesting and scraping Instagram HTML content:

  1. Requests: HTTP requests
  2. BeautifulSoup: Scraping and parsing HTML data.

The rest of its functionality is provided directly from Python 3's standard library for clear and concise code under the hood.


:jack_o_lantern: Hacktoberfest 2020

This repo is participating in Hacktoberfest 2020! I would love for this repo to be a resource to absolute beginners looking to make some of their first contributions. Check out Issues for some easy ideas or open your own with something you want to work on! Please see the official Hacktober FAQ for rules/questions.

Happy hacking!


:credit_card: License

MIT


:grey_question: Support

Reach out to me if you have questions or ideas!


Background

The inspiration for this project began a long time ago in a galaxy far, far away (a.k.a. Summer 2019 on Long Island). I was mindlessly scrolling Instagram for the 1000th hour that week and thought, "How could I access this data programatically?". After 30 seconds of searching it became clear that Instagram's API was not going to be of any use so I was going to have to figure it out myself, and thus the beginning of instascrape was born.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

insta-scrape-0.8.0.tar.gz (11.9 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page