Skip to main content

CLI tool for collecting and archiving data from Instagram users with Instaloader.

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

Instarchive

Instarchive is a command-line utility for collecting and organizing data from specific Instagram profiles using the Instaloader Python module.

Features

  • Downloads items only from a given list of users to track (instead of enumerating all followees or downloading the whole feed).

  • Sorts items from :stories and :feeds into directories based on username.

  • Responds to username changes and updates the tracking list file automatically.

Usage

Demo assuming a Unix environment:

# Set up the archive. Do re-run this command if you've changed your username.
# The username can also be omitted for using Instarchive anonymously (private
# profiles won't be accessible in those cases).
./instarchive.py init 'my_username'

# The file containing a list of users to track. Only data and metadata
# associated with these users will ever be downloaded.
cat << EOF > ~/instarchive/tracking.txt
target_username

# this is a comment
another_target_username
EOF

# Login to Instagram; your password and authentication code, if necessary, will
# be prompted for. Also, re-run this command if your session has expired, as is
# typically the case when HTTP 401 errors prevent content download. Currently,
# Instarchive cannot be used without logging in.
./instarchive.py login

# Download all accessible data (profile info, profile pic, posts, highlights,
# stories etc.) from all of the users named in the tracking list. Optional;
# intended to be run only once or occasionally since it takes a lot of web
# requests to Instagram.
./instarchive.py everything

# Download data (stories and posts) from the users named in the tracking list,
# only based on what is visible in your feed. Note that this is much faster
# than the 'everything' command. To specify the number of posts to go through in
# the feed, pass the '-p <number of posts>' option; the default is 200.
./instarchive.py feed

By default, the archive is located at the instarchive directory under the home directory of the current OS user. This location can be changed by passing the -d <path to archive directory> option to instarchive.py.

Disclaimer

As with Instaloader itself, Instarchive is independent of and unsupported by Instagram. Use at your own risk, and be wary of ratelimits.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

instarchive-1.0.0.tar.gz (6.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

instarchive-1.0.0-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file instarchive-1.0.0.tar.gz.

File metadata

  • Download URL: instarchive-1.0.0.tar.gz
  • Upload date:
  • Size: 6.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for instarchive-1.0.0.tar.gz
Algorithm Hash digest
SHA256 bd42c8a8ad66f76effcda80eaa5a81a0d53c2f0c02faa9d153fac61e0422374b
MD5 cd40f7320e240766023b5c5e42f61235
BLAKE2b-256 af7e50f9f63056ee87bf3d1d9931538e5252a91b0983f9ebbbb54e0c66ed672e

See more details on using hashes here.

File details

Details for the file instarchive-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: instarchive-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for instarchive-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 46b478b80dab60ceea974cc15a37ee5062738286adda5dc9c626ca90308e8a1e
MD5 d4af33855772df6ba6724cb09f669e19
BLAKE2b-256 32ba884a68cf8a8af9ae7f1b6f2b71c9e40c0da6a2d60e1bd36dd3dffdf255de

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page