Skip to main content

Tools to pull text data from github, slack, website, stackoverflow, etc.

Project description

spider-nest

Tools to pull text data from github, slack, website, stackoverflow, etc.

Components

gitpump

Fetch github data, including repos, issues, discussions, trendings. Requires github account tokens.

slackpump

Fetch slack messages of a workspace. Requires slack bot token.

stackpump

Fetch questions/answers from stackoverflow by web parser

web2markdown

Get web pages and parse text content to markdown content.

From source code

git clone git@github.com:yhmo/spider-nest.git
cd spider-nest
pip3 install -r ./requirements.txt

Installation

pip3 install spider-nest

Requirement

python >= 3.7

How to run examples

To run the examples of gitpump, follow the steps:

  1. create a github personal token by following this guide
  2. create a text file token_github.txt under the examples folder
  3. put some github tokens into the token_github.txt, no delimiter, one line for each token
  4. now you can run examples example_github_xxx.py

To run the example of example_slack_xxx.py

  1. create a slack bot, bind the bot to a workspace, get the token
  2. create a text file token_slack.txt under the examples folder
  3. put the bot token into the token_slack.txt
  4. now you can run examples example_slack_xxx.py

You can run other examples without specific setting

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spider-nest-0.2.0.tar.gz (67.3 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page