Skip to main content

No project description provided

Project description

nodestream-plugin-github

Overview

This plugin provides a way to scrape GitHub data from the REST api and ingest them as extractors in nodestream pipelines.

Setup Neo4j

  1. Download and install Neo4j: https://neo4j.com/docs/desktop-manual/current/installation/download-installation/
  2. Create and start database (version 5.7.0: https://neo4j.com/docs/desktop-manual/current/operations/create-dbms/
  3. Install APOC: https://neo4j.com/docs/apoc/5/installation/

Create GitHub credentials

  1. Create and GitHub access codes: https://docs.github.com/en/enterprise-server@3.12/apps/creating-github-apps/authenticating-with-a-github-app/generating-a-user-access-token-for-a-github-app NOTE: These values will be used in your .env

Install and run the app

  1. Install python3: https://www.python.org/downloads/
  2. Install poetry: https://python-poetry.org/docs/#installation
  3. Install nodestream: https://nodestream-proj.github.io/nodestream/0.5/docs/tutorial/
  4. Generate a new nodestream project
  5. Add nodestream-github to your project dependencies in your nodestream projects pyproject.toml file.
  6. Install necessary dependencies: poetry install
  7. In nodestream.yaml add the following:
plugins:
  - name: github
    config:
      github_hostname: github.example.com
      auth_token: !env GITHUB_ACCESS_TOKEN
      user_agent: skip-jbristow-test
      per_page: 100
      collecting:
        all_public: True
      rate_limit_per_minute: 225
    targets:
      - my-db:
    pipelines:
      - name: github_repos
      - name: github_teams
targets:
  database: neo4j
  uri: bolt://localhost:7687
  username: neo4j
  password: neo4j123
  1. Set environment variables in your terminal session for: GITHUB_ACCESS_TOKEN.
  2. Verify nodestream has loaded the pipelines: poetry run nodestream show
  3. Use nodestream to run the pipelines: poetry run nodestream run <pipeline-name> --target my-db

Using make

  1. Install make (ie. brew install make)
  2. Run make run

Contributing

When contributing, make sure to sign your commits. To find out more about how to do this, refer to this GitHub documentation.

Authors

  • Jon Bristow
  • Zach Probst
  • Rohith Reddy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nodestream_plugin_github-0.14.1b6.tar.gz (16.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nodestream_plugin_github-0.14.1b6-py3-none-any.whl (24.8 kB view details)

Uploaded Python 3

File details

Details for the file nodestream_plugin_github-0.14.1b6.tar.gz.

File metadata

File hashes

Hashes for nodestream_plugin_github-0.14.1b6.tar.gz
Algorithm Hash digest
SHA256 c2f54c4be93e4a3dbfdc5edd5f35606365a95866dab2b28c592dd6db6740c0db
MD5 4d763c004c42e3e39833009f171b89d2
BLAKE2b-256 0c41479dab77b8507f65dc93d6d67fa2eac4815370dc265f7de9768239366b7a

See more details on using hashes here.

Provenance

The following attestation bundles were made for nodestream_plugin_github-0.14.1b6.tar.gz:

Publisher: release.yaml on nodestream-proj/nodestream-plugin-github

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nodestream_plugin_github-0.14.1b6-py3-none-any.whl.

File metadata

File hashes

Hashes for nodestream_plugin_github-0.14.1b6-py3-none-any.whl
Algorithm Hash digest
SHA256 1354aca32fa0d36363575035e488ce23c7a11fb6d8a2f10ce9baf74a6180fe4a
MD5 7215db9425f35108a0b13fd93f2a264b
BLAKE2b-256 d603b31dc05b1da3c719ff4f705053f0e0f7cf2363978893322459a4efa82e8a

See more details on using hashes here.

Provenance

The following attestation bundles were made for nodestream_plugin_github-0.14.1b6-py3-none-any.whl:

Publisher: release.yaml on nodestream-proj/nodestream-plugin-github

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page