Skip to main content

Diff-it: Spark Dataframe Differ

Project description

Diff-it: Data Differ

Overview

diffit will report differences between two data sets with similar schema.

Refer to Diffit's documentation for detailed instructions.

Prerequisites

Getting Started

Makester is used as the Integrated Developer Platform.

(macOS Users only) Upgrading GNU Make

Follow these notes to get GNU make.

Creating the Local Environment

Get the code and change into the top level git project directory:

git clone git@github.com:loum/diffit.git && cd diffit

NOTE: Run all commands from the top-level directory of the git repository.

For first-time setup, get the Makester project:

git submodule update --init

Initialise the environment:

make init-dev

Local Environment Maintenance

Keep Makester project up-to-date with:

git submodule update --remote --merge

Help

There should be a make target to get most things done. Check the help for more information:

make help

Running the Test Harness

We use pytest. To run the tests:

make tests

FAQs

Q. Why do I get WARNING: An illegal reflective access operation has occurred? Seems to be related to the JVM version being used. Java 8 will suppress the warning. To check available Java versions on your Mac try /usr/libexec/java_home -V. Then:

export JAVA_HOME=$(/usr/libexec/java_home -v <java_version>)

top

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffit-0.1.5.tar.gz (14.1 kB view details)

Uploaded Source

Built Distribution

diffit-0.1.5-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file diffit-0.1.5.tar.gz.

File metadata

  • Download URL: diffit-0.1.5.tar.gz
  • Upload date:
  • Size: 14.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for diffit-0.1.5.tar.gz
Algorithm Hash digest
SHA256 aadfaf263a29a2b973bc1ea0ef35ea3c514c40d026551ce44b060152275d334e
MD5 72eb2ef7d0dfdaf8cb50b1ded066b595
BLAKE2b-256 cbadc6d258eae79bf0c368e50604e0f7c747b6c1c356723ac9b2e3f72ea661b6

See more details on using hashes here.

File details

Details for the file diffit-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: diffit-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for diffit-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 cf391ab6c137a63e7020c0efc94561b1bad586ab9775d4df6c6695415122664e
MD5 46f4a9a9363a7fa5a4a71d716e991179
BLAKE2b-256 20f1d48e0081d05c490569d0e22c5b21ffa57a574e15df200d13405dc7615c15

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page