Skip to main content

Utilities for ready-made Databricks, to shorten and facilitate the development of notebooks and jobs

Project description

PhoenixDataTools

made-with-python made-with-python for-use-in-Databricks

Documentation Status PyPI version

PhoenixDataTools is a Python library specifically designed to optimize development in Databricks environments using PySpark. This library is ideal for integration into Databricks notebooks and jobs, offering a simplified and efficient interface for data manipulation.

The primary goal of PhoenixDataTools is to provide an abstraction layer that simplifies and accelerates working with small, medium, and large volumes of data, eliminating the need to write extensive lines of repetitive code. The library facilitates complex operations with DataFrames, Delta table manipulation, and management of storage and reading connections. This results in a significant improvement in productivity and code maintenance.


Topics

1 - Applied technologies

2 - Install library

3 - Utilization

4 - Library documentation

5 - License

6 - Local Development


Applied technologies

Poetry

Poetry is a dependency and package management tool for Python. It simplifies the creation, management, and publication of Python projects, ensuring a consistent development environment.

pytest

pytest is a powerful and flexible testing framework for Python. It makes writing simple and complex tests easy, allowing for robust and scalable test suites.

pytest-cov

pytest-cov is a plugin for pytest that generates code coverage reports. It helps ensure that all parts of your code are tested by providing detailed coverage metrics.

taskipy

taskipy is a tool that allows you to run scripts defined in pyproject.toml. It simplifies managing common project tasks by automating repetitive processes.

flake8

flake8 is a linting tool for Python that combines PyFlakes, pycodestyle, and mccabe. It checks your code for style errors and potential issues, helping maintain clean and consistent code.

isort

isort is a tool to automatically sort imports in your Python files. It organizes your imports according to style conventions, improving code readability and maintainability.

MkDocs

MkDocs is a static site generator geared towards project documentation. With it, you can create elegant and easy-to-navigate documentation websites from Markdown files.

Read the Docs

Read the Docs is a documentation hosting service that automates the building and versioning of your project's documentation. It integrates easily with code repositories, making documentation maintenance and access straightforward.

Install library

You can install phoenixdatatools directly from PyPI by running the following command:

pip install phoenixdatatools

This will install the latest version of the library.

Utilization

After installation, you can use the library directly in your code with the following syntax:

from phoenixdatatools import module

Each module provides specific functions relevant to its purpose. You can find detailed information about the available modules and functions in the following section: Library Documentation.

Example:

from phoenixdatatools import data_quality

data_quality.not_null_test(dataframe, columns)

Library documentation

The library has comprehensive documentation covering all modules, usage instructions, and detailed examples. For complete information and guidance on how to use the library, visit our official documentation on Read the Docs.

License

This project is licensed under the MIT License - see the LICENSE file for more details.

Local Development

For development and contribution to the project, access the repository and go to section Development

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phoenixdatatools-0.1.0.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

phoenixdatatools-0.1.0-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file phoenixdatatools-0.1.0.tar.gz.

File metadata

  • Download URL: phoenixdatatools-0.1.0.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.19 Linux/5.15.153.1-microsoft-standard-WSL2

File hashes

Hashes for phoenixdatatools-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8cf8e3b09166f8f56df849fa45cea1bd379bb52a386bb3b0d29ae485b8a96a64
MD5 3eb3e49c33320be02a8557670ba12f5e
BLAKE2b-256 336964407e6dc17ea70964d71280fdaac3858056ae110ba8f634ab6de720ac90

See more details on using hashes here.

File details

Details for the file phoenixdatatools-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: phoenixdatatools-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.19 Linux/5.15.153.1-microsoft-standard-WSL2

File hashes

Hashes for phoenixdatatools-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9710d246d9d677e94521bdbee4ebd027d02a1945f354a6801016f19c41f1c11f
MD5 a4938516507454e63459d90bc76c5a55
BLAKE2b-256 27a12305a60abe1a22def4622c41c753eb5bfffb144e92a637d7ef9907cd5a83

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page