Skip to main content

PySpark JIRA Data Source

Project description

PySpark JIRA Data Source

pyspark-jira is a simple yet powerful Python package designed to integrate JIRA with Apache Spark. This package allows users to seamlessly read JIRA tickets and create Spark DataFrames, enabling efficient data processing and analysis within the Spark ecosystem. Whether you are managing agile workflows or performing detailed project analytics, pyspark-jira makes it easy to leverage the power of PySpark for your JIRA data.

Getting Started

Installation

pip install pyspark-jira

Usage

jira_username = "Your full email address that you used for logging in"
jira_api_token = "See https://id.atlassian.com/manage-profile/security"
jql_query = "project = 'ES'"
df = (
    spark.read.format("jira")
        .option("JIRA_USERNAME", jira_username)
        .option("JIRA_API_TOKEN", jira_api_token)
        .load(jql_query)
)
df.show()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyspark-jira-0.0.2.tar.gz (6.6 kB view hashes)

Uploaded Source

Built Distribution

pyspark_jira-0.0.2-py2.py3-none-any.whl (7.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page