Skip to main content

Research repository for the EasyLink ER ecosystem project.

Project description

EasyLink is a framework that allows users to build and run highly configurable entity resolution (ER) pipelines.

Installation

There are a few things to install in order to use this package:

  • Install singularity. If this is not already installed on your system, you will likely need to request it from your system admin. Refer to https://docs.sylabs.io/guides/4.1/admin-guide/installation.html

  • Install graphviz via

    > conda install graphviz

  • Install EasyLink via

    > pip install easylink

    OR

    > cd <path/to/repositories/>

    > git clone git@github.com:ihmeuw/easylink.git

    > # OR git clone https://github.com/ihmeuw/easylink.git

    > cd easylink

    > pip install .

Quickstart

To run a pipeline, use easylink run from the command line and pass in the paths to both a pipeline specification and an input data specification:

> easylink run -p <PIPELINE-SPECIFICATION> -i <INPUT-DATA-SPECIFICATION>

There are several other optional arguments to easylink run as well; for help, use easylink run –help

Note that a schematic of the pipeline’s directed acyclic graph (DAG) that is run is automatically generated. If this schematic is desired _without_ actually running the pipeline, use easylink generate-dag:

> easylink generate-dag -p <PIPELINE-SPECIFICATION> -i <INPUT-DATA-SPECIFICATION>

As before, refer to easylink generate-dag –help for information on other options.

Requirements

TBD

Documentation

TBD

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easylink-0.1.1.tar.gz (162.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

easylink-0.1.1-py3-none-any.whl (136.7 kB view details)

Uploaded Python 3

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page