Skip to main content

cli utility for data mining https://nitter.net/serach

Project description

about

  • cli scraping application for nitter a popular *twitter mirror
  • works for debain/ubuntu/wsl2 and python3

installation

# install nitter cli application
pip install nitter-miner

# if youve never used nltk you have to do this its silly but yea
python3 -c "import nltk; nltk.download('all')"

# now you can use the nitter cli app 
# default installed to ~/.local/bin/nitter
nitter -q <query> or <query_file> 

cli usage

#nitter-miner: 0.1.0 nitter -q <query: str> xor -qf <query_file: str>

REQUIRED:

  • q: string query to search nitter for *for query formats see /query_info.txt

  • qf:
    if ur constantly searching the same set of queries make a queryfile.csv
    *note the header/first line is ignored.
    ex format of queryfile:
    nasa,spacex,...
    nasa,spacex,...
    @NASA,@SpaceX,...
    the actual query which will be searched is nasa OR spacex OR @NASA OR @SpaceX

*note do not use -q and -qf flags together it will not work also theres no reason to do this

OPTIONS                                                          ? DEFAULT  
--------------------------------------------------------------------------------------------    
- -p / --pgs: int, max number of pgs to search for query         ? 50  
- -m / --mirror: str -> which mirror to search                   ? https://nitter.net/search  
- --retweets: flag -> if present wont filter out retweets        ? false   
- --no-sentiments: flag -> if present wont append sentiments     ? false   
- --no-save: flag -> if present wont save to a file              ? false   

example usage see /tweets/231120_mcgill or mcgill university_262.csv:

>> nitter -q '"mcgill" OR "mcgill university"' -p 50

[nltk_data] Downloading package stopwords to
[nltk_data]     /home/sleepyzzz/nltk_data...
[nltk_data]   Package stopwords is already up-to-date!
[nltk_data] Downloading package punkt to /home/sleepyzzz/nltk_data...
[nltk_data]   Package punkt is already up-to-date!
[nltk_data] Downloading package wordnet to
[nltk_data]     /home/sleepyzzz/nltk_data...
[nltk_data]   Package wordnet is already up-to-date!
pgs 1/50
searching https://nitter.net/search?f=tweets&q="mcgill" OR "mcgill university"&since=&until=&near=

pgs 2/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9abby9rQJAAIAAIAAAACCAADAAAAAAgABAAAAAAKAAUX9agjk8AnEAoABhf1qCOTv9jwAAA

pgs 3/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9aVlDpsgwgAIAAIAAAACCAADAAAAAAgABAAAAAEKAAUX9agjk8AnEAoABhf1qCOTv7HgAAA

... **skipping for readme.md but it would go through pgs 4-46 aswell**

pgs 47/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9XJLTpaRRgAIAAIAAAACCAADAAAAAAgABAAAAC0KAAUX9agjk8AnEAoABhf1qCOTuPsgAAA

pgs 48/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9XFuEpcAQwAIAAIAAAACCAADAAAAAAgABAAAAC4KAAUX9agjk8AnEAoABhf1qCOTuNQQAAA

ity%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9XCDc9eRFgAIAAIAAAACCAADAAAAAAgABAAAAC8KAAUX9agjk8AnEAoABhf1qCOTuK0AAAA       

pgs 50/50
searching https://nitter.net/search/?f=tweets&q=%22mcgill%22+OR+%22mcgill+university%22&cursor=DAADDAABCgABF_Wn_9hW8WQKAAIX9W-MNVawWQAIAAIAAAACCAADAAAAAAgABAAAADAKAAUX9agjk8AnEAoABhf1qCOTuIXwAAA

results: 262

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nitter-miner-0.1.0.tar.gz (128.2 kB view details)

Uploaded Source

Built Distribution

nitter_miner-0.1.0-py3-none-any.whl (127.2 kB view details)

Uploaded Python 3

File details

Details for the file nitter-miner-0.1.0.tar.gz.

File metadata

  • Download URL: nitter-miner-0.1.0.tar.gz
  • Upload date:
  • Size: 128.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for nitter-miner-0.1.0.tar.gz
Algorithm Hash digest
SHA256 19a3e332414b79793483294df35463518c52ebb71dc5e501b4b44e411f21f1c9
MD5 b44b273b4c38ababe1bcc290fb43094f
BLAKE2b-256 cc715475d5e3d3b26da990d4ea8081eec5337b7ca3270ca46b21cf5592f620df

See more details on using hashes here.

File details

Details for the file nitter_miner-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: nitter_miner-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 127.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for nitter_miner-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 71cd9b8697f7f8f35d9054f4a47acb2deb424b9a449fdff78f0934c9b6a8fffe
MD5 4765dc735b3c9a6434fdce265970cbf1
BLAKE2b-256 2efd4202c7138fe00e51dd8311d0e89d47f0ee492fcc9eac7e1dcef9df8549bc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page