A twarc plugin to extract hashtags from Twitter data
This module is extends twarc with a
hashtags command that will extract and
count the hashtags in a tweet dataset.
pip install twarc-hashtags
Collect some Twitter data, for example:
twarc2 search blacklivesmatter tweets.jsonl
Because you installed the plugin you have a new subcommand
twarc2 hashtags tweets.jsonl hashtags.csv
hashtags.csv in your favourite spreadsheet program or
Behind the scenes twarc-hashtags uses Python's native support for SQLite to
create a database and then insert/query it. You can see this database after the
program finishes as
hashtags.db in your current working directory.
--group: group results by day, week, month, year
--limit: limit to this number of hashtags (per group if --group is used)
--db: if you would like to name the database something other than
--no-insert: use an existing database instead of inserting (useful for large numbers of tweets)
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.