Diff-it: Spark Dataframe Differ
Project description
Diff-it: Data Differ
- Overview
- Prerequisites
- Getting Started
- (macOS Users) Upgrading GNU Make
- Help
- Running the Test Harness
- FAQs
Overview
diffit will report differences between two data sets with similar schema.
Refer to Diffit's documentation for detailed instructions.
Prerequisites
- GNU make
- Python 3 Interpreter. We recommend installing pyenv
- Docker
Getting Started
Makester is used as the Integrated Developer Platform.
(macOS Users only) Upgrading GNU Make
Follow these notes to get GNU make.
Creating the Local Environment
Get the code and change into the top level git project directory:
git clone git@github.com:loum/diffit.git && cd diffit
NOTE: Run all commands from the top-level directory of the
gitrepository.
For first-time setup, get the Makester project:
git submodule update --init
Initialise the environment:
make init-dev
Local Environment Maintenance
Keep Makester project up-to-date with:
git submodule update --remote --merge
Help
There should be a make target to get most things done. Check the help for more information:
make help
Running the Test Harness
We use pytest. To run the tests:
make tests
FAQs
Q. Why do I get WARNING: An illegal reflective access operation has occurred?
Seems to be related to the JVM version being used. Java 8 will suppress the warning. To check available Java versions on your Mac try /usr/libexec/java_home -V. Then:
export JAVA_HOME=$(/usr/libexec/java_home -v <java_version>)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file diffit-0.1.5.tar.gz.
File metadata
- Download URL: diffit-0.1.5.tar.gz
- Upload date:
- Size: 14.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aadfaf263a29a2b973bc1ea0ef35ea3c514c40d026551ce44b060152275d334e
|
|
| MD5 |
72eb2ef7d0dfdaf8cb50b1ded066b595
|
|
| BLAKE2b-256 |
cbadc6d258eae79bf0c368e50604e0f7c747b6c1c356723ac9b2e3f72ea661b6
|
File details
Details for the file diffit-0.1.5-py3-none-any.whl.
File metadata
- Download URL: diffit-0.1.5-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cf391ab6c137a63e7020c0efc94561b1bad586ab9775d4df6c6695415122664e
|
|
| MD5 |
46f4a9a9363a7fa5a4a71d716e991179
|
|
| BLAKE2b-256 |
20f1d48e0081d05c490569d0e22c5b21ffa57a574e15df200d13405dc7615c15
|