Skip to main content

cli utility for data mining https://nitter.net/serach

Project description

about

  • cli scraping application for nitter a popular *twitter mirror
  • works for debain/ubuntu/wsl2 and python3

recommended installation

pip install nitter-miner

# now you can use the nitter cli app 
# default installed to ~/.local/bin/nitter
nitter -q <query> or <query_file> 

manual installation

# clone and cd to the nitter_stuff repository
git clone 'https://github.com/hashirkz/nitter_scraper'
cd nitter_scraper

python3 -m venv .venv
. ./.venv/bin/activate
python3 -m pip install -r 'requirements.txt'

# for td bank tweets queries defaults to 50pgs
python3 -m __nitter__.nitter_scraper

# for cli application
python3 -m __nitter__.nitter

cli usage

nitter -q <query: str> xor -qf <query_file: str> OPTIONS

-q:   string query to search nitter for *for query formats see /query_info.txt

-qf: 
      if ur constantly searching the same set of queries make a queryfile.csv
      *note the header/first line is ignored.
      ex format of queryfile:
      nasa,spacex,...
      nasa,spacex,...
      @NASA,@SpaceX,

      the actual query which will be searched is 'nasa OR spacex OR @NASA OR @SpaceX'

*note do not use -q and -qf flags together it will not work also theres no reason to do this

OPTIONS                                                        ? DEFAULT
------------------------------------------------------------------------
-p / --pgs: int, max number of pgs to search for query         ? 50
-m / --mirror: str -> which mirror to search                   ? https://nitter.net/search
--retweets: flag -> if present wont filter out retweets        ? false
--no-sentiments: flag -> if present wont append sentiments     ? false
--no-save: flag -> if present wont save to a file              ? false

example usage *see "/tweets/231120_mcgill or mcgill university_262.csv":

>> nitter -q '"mcgill" OR "mcgill university"' -p 50

[nltk_data] Downloading package stopwords to
[nltk_data]     /home/sleepyzzz/nltk_data...
[nltk_data]   Package stopwords is already up-to-date!
[nltk_data] Downloading package punkt to /home/sleepyzzz/nltk_data...
[nltk_data]   Package punkt is already up-to-date!
[nltk_data] Downloading package wordnet to
[nltk_data]     /home/sleepyzzz/nltk_data...
[nltk_data]   Package wordnet is already up-to-date!
pgs 1/50
searching https://nitter.net/search?f=tweets&q="mcgill" OR "mcgill university"&since=&until=&near=

pgs 2/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9abby9rQJAAIAAIAAAACCAADAAAAAAgABAAAAAAKAAUX9agjk8AnEAoABhf1qCOTv9jwAAA

pgs 3/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9aVlDpsgwgAIAAIAAAACCAADAAAAAAgABAAAAAEKAAUX9agjk8AnEAoABhf1qCOTv7HgAAA

... **skipping for readme.md but it would go through pgs 4-46 aswell**

pgs 47/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9XJLTpaRRgAIAAIAAAACCAADAAAAAAgABAAAAC0KAAUX9agjk8AnEAoABhf1qCOTuPsgAAA

pgs 48/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9XFuEpcAQwAIAAIAAAACCAADAAAAAAgABAAAAC4KAAUX9agjk8AnEAoABhf1qCOTuNQQAAA

ity%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9XCDc9eRFgAIAAIAAAACCAADAAAAAAgABAAAAC8KAAUX9agjk8AnEAoABhf1qCOTuK0AAAA       

pgs 50/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9W-MNVawWQAIAAIAAAACCAADAAAAAAgABAAAADAKAAUX9agjk8AnEAoABhf1qCOTuIXwAAA

results: 262

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nitter-miner-0.0.5.tar.gz (128.4 kB view details)

Uploaded Source

Built Distribution

nitter_miner-0.0.5-py3-none-any.whl (127.2 kB view details)

Uploaded Python 3

File details

Details for the file nitter-miner-0.0.5.tar.gz.

File metadata

  • Download URL: nitter-miner-0.0.5.tar.gz
  • Upload date:
  • Size: 128.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for nitter-miner-0.0.5.tar.gz
Algorithm Hash digest
SHA256 1bac29c7c0a1c55d3a1ca5464be6aceb8aa924cb050448aa5bec15d456c081bf
MD5 0da28c214fea90a6efb62ae3936bcdd5
BLAKE2b-256 d217347a2d7fba697656e092a7fcec460940f5b391dc4acffd49c10b67be2e3d

See more details on using hashes here.

File details

Details for the file nitter_miner-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: nitter_miner-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 127.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for nitter_miner-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 6c5521ebadccbbbf0d38cd6f7d0f0dd27332fbf0fdd454edda29406d95f5d90b
MD5 fe9e1fe9ce7428e26c0c33b235c7296c
BLAKE2b-256 adcb5d5268e715b6634b994e5f7a1ef7cf348c6e2c0c11cf2268bfa3edcedb05

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page