Diff-it: Spark Dataframe Differ
Project description
Diff-it: Data Differ
- Overview
- Prerequisites
- Getting Started
- (macOS Users) Upgrading GNU Make
- Help
- Running the Test Harness
- FAQs
Overview
diffit
will report differences between two data sets with similar schema.
Refer to Diffit's documentation for detailed instructions.
Prerequisites
- GNU make
- Python 3 Interpreter. We recommend installing pyenv
- Docker
Getting Started
Makester is used as the Integrated Developer Platform.
(macOS Users only) Upgrading GNU Make
Follow these notes to get GNU make.
Creating the Local Environment
Get the code and change into the top level git
project directory:
git clone git@github.com:loum/diffit.git && cd diffit
NOTE: Run all commands from the top-level directory of the
git
repository.
For first-time setup, get the Makester project:
git submodule update --init
Initialise the environment:
make init-dev
Local Environment Maintenance
Keep Makester project up-to-date with:
git submodule update --remote --merge
Help
There should be a make
target to get most things done. Check the help for more information:
make help
Running the Test Harness
We use pytest. To run the tests:
make tests
FAQs
Q. Why do I get WARNING: An illegal reflective access operation has occurred
?
Seems to be related to the JVM version being used. Java 8 will suppress the warning. To check available Java versions on your Mac try /usr/libexec/java_home -V
. Then:
export JAVA_HOME=$(/usr/libexec/java_home -v <java_version>)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file diffit-0.1.5.tar.gz
.
File metadata
- Download URL: diffit-0.1.5.tar.gz
- Upload date:
- Size: 14.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aadfaf263a29a2b973bc1ea0ef35ea3c514c40d026551ce44b060152275d334e |
|
MD5 | 72eb2ef7d0dfdaf8cb50b1ded066b595 |
|
BLAKE2b-256 | cbadc6d258eae79bf0c368e50604e0f7c747b6c1c356723ac9b2e3f72ea661b6 |
File details
Details for the file diffit-0.1.5-py3-none-any.whl
.
File metadata
- Download URL: diffit-0.1.5-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cf391ab6c137a63e7020c0efc94561b1bad586ab9775d4df6c6695415122664e |
|
MD5 | 46f4a9a9363a7fa5a4a71d716e991179 |
|
BLAKE2b-256 | 20f1d48e0081d05c490569d0e22c5b21ffa57a574e15df200d13405dc7615c15 |