Python module for ingesting Twitter exports into Preservica
Project description
preserva-tweet
Ingest Tweets from a Twitter Export into Preservica
This library provides a Python module which will ingest a Twitter export zip file into Preservica as individual tweets with any attached media files such as images or video. The tweets can then be rendered directly from within Preservica.
Contributing
Bug reports and pull requests are welcome on GitHub at https://github.com/carj/preserva-tweet
Support
preserva-tweet is 3rd party open source client and is not affiliated or supported by Preservica Ltd. There is no support for use of the library by Preservica Ltd. Bug reports can be raised directly on GitHub.
Users of preserva-tweet should make sure they are licensed to use the Preservica REST APIs.
License
The package is available as open source under the terms of the Apache License 2.0
Installation
preserva-tweet is available from the Python Package Index (PyPI)
https://pypi.org/project/preserva-tweet/
To install IngestTweets, simply run this simple command in your terminal of choice:
$ pip install preserva-tweet
Downloading your Twitter Archive
Step 1
Log in to your X account and open the Settings and Privacy panel. Go to the “Your Account” tab and select “Download an Archive of Your Data.
Step 2
For security purposes, you’ll need to re-enter your password. You’ll also need to provide a verification code.
Step 3
Once you’ve successfully completed these steps, you’ll see an option to request your archive. Click the “Request Archive” button to begin processing.
Step 4
The button will change to “Requesting Archive” and you’ll see a notice that your request is pending. Now it’s time to wait. It can take 24hrs for the export to be ready.
Step 5
When your archive is ready to download, you’ll get both an email in your inbox and a notification in your X account. Since Twitter archives are only available for a limited time, pay attention to the expiration date.
Ingesting Tweets
To run the module specify the location of the twitter export using the -a or --archive flag. The parent Preservica collection for the tweets must be specified using the -c --collection flag as a UUID
$ python -m preserva-tweet -a twitter-2024-10-17.zip -c a7ad52e3-2cb3-4cb5-af2a-3ab08829a2a8
usage: preserva-tweet [-h] -a ARCHIVE -c COLLECTION [-v] [-d] [-u USERNAME] [-p PASSWORD] [-s SERVER] [-t SECURITY_TAG]
Ingest a Twitter Account History into Preservica
options:
-h, --help show this help message and exit
-a ARCHIVE, --archive ARCHIVE
Twitter export ZIP archive path
-c COLLECTION, --collection COLLECTION
The Preservica parent collection uuid
-v, --verbose Print information as tweets are ingested
-d, --dry-run Validate the twitter export without ingesting
-u USERNAME, --username USERNAME
Your Preservica username if not using credentials.properties
-p PASSWORD, --password PASSWORD
Your Preservica password f not using credentials.properties
-s SERVER, --server SERVER
Your Preservica server domain name if not using credentials.properties
-t SECURITY_TAG, --security-tag SECURITY_TAG
The Preservica security tag of the ingested tweets (default is "open")
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file preserva_tweet-0.2.0.tar.gz
.
File metadata
- Download URL: preserva_tweet-0.2.0.tar.gz
- Upload date:
- Size: 13.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f6476b5abef80e1f18a5305d21497643c1ec9a41b7f725151d1af9ec598f3979 |
|
MD5 | e052f1f31c0462c430536b1ebabd3299 |
|
BLAKE2b-256 | b3849cecaaf27fe6357faee3f2810cb4e7a77ae8254cf66c30fe3935e2c707b4 |
File details
Details for the file preserva_tweet-0.2.0-py3-none-any.whl
.
File metadata
- Download URL: preserva_tweet-0.2.0-py3-none-any.whl
- Upload date:
- Size: 12.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 62bb104614eb5b7c4155603fb00db9bd9d724447f8b31aa7c0382a8df84b764d |
|
MD5 | 1d7e0bb4d2b1c102106c0e21a734b8cc |
|
BLAKE2b-256 | 715a8f919338920e9f605c87a272059cdea25d0c083e6c30779bd65d050a86c8 |