A Python package for spatio-temporal analysis of social media contents
Project description
geoso
A Python package for collecting and spatio-temporal analysis of social media contents
- GitHub repo: https://github.com/MahdiFarnaghi/geoso
- PyPI: https://pypi.org/project/geoso/
- Documentation: https://MahdiFarnaghi.github.io/geoso
- Free software: MIT license
Introduction
geoso is a Python library, being developed to facilitate collection, cleansing, and spatial and spatio-temporal analysis of social media data.
The vision is that the library provided the possibility to download geo-tagged social media content into a database, e.g., PostgreSQL, preprocess the stored data, retrieve, and analyse the data.
Features
- Twitter
- Download tweets from Twitter Streaming API and save them into either a database or JSON Lines text files.
- Import tweets that were from JSON Lines text files into the database.
- Export tweets to CSV file.
- Clean tweets text in the database.
- Retrieve tweets from the database as pandas DataFrame.
Under development features
- Twitter
- Clean tweets text in the database.
- Detect tweets that were published by bots.
Scientific publications
If you are using this library, the following scientific publications could be of your interest.
-
Farnaghi, M., Ghaemi, Z., & Mansourian, A. (2020). Dynamic Spatio-Temporal Tweet Mining for Event Detection: A Case Study of Hurricane Florence. International Journal of Disaster Risk Science, 11, 378-393.
-
Ostermann, F. O. (2021). Linking geosocial sensing with the socio-demographic fabric of smart cities. ISPRS international journal of geo-information, 10(2), 1-22. [52].
-
Zahra, K., Imran, M., Ostermann, F. O. (2020). Automatic identification of eyewitness messages on twitter during disasters. Information processing & management 57 (1), 102107
-
Ghaemi, Z. & Farnaghi, M. 2019. A Varied Density-based Clustering Approach for Event Detection from Heterogeneous Twitter Data. ISPRS International Journal of Geo-Information, 8 (2).
Credits
- spaCy is used for cleaning texts.
- Tweepy is used to develop Twitter data retrieval functionalities.
- This package was created with Cookiecutter and the giswqs/pypackage project template and instructions from Python Packages book.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file geoso-0.0.8.tar.gz
.
File metadata
- Download URL: geoso-0.0.8.tar.gz
- Upload date:
- Size: 23.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.5.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ad3c3845fd755cc74f748cc6c1ed3dd083fcc7e1cdd970fa7175a402e258831 |
|
MD5 | b0aa3fc0ef2f4bcf42afd68bba3b579d |
|
BLAKE2b-256 | dbaa1a7d138e259b06174b1dea77d49a3b5101aa673d19ada3919a3a3c31c725 |
File details
Details for the file geoso-0.0.8-py2.py3-none-any.whl
.
File metadata
- Download URL: geoso-0.0.8-py2.py3-none-any.whl
- Upload date:
- Size: 23.7 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.5.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c85eb8efdd500f71f216d0835194b13637eb24848d2bf3a28692c51c1baac60 |
|
MD5 | 8cd0ad03eea94b98f4a10059b6dc063c |
|
BLAKE2b-256 | 22bce3312d6f49b2761b747679774344f98d24815526773570f8faffa6b2210b |