Skip to main content

Research repository for the EasyLink ER ecosystem project.

Project description

EasyLink is a framework that allows users to build and run highly configurable entity resolution (ER) pipelines.

Installation

There are a few things to install in order to use this package:

  • Install singularity. If this is not already installed on your system, you will likely need to request it from your system admin. Refer to https://docs.sylabs.io/guides/4.1/admin-guide/installation.html

  • Install graphviz via

    > conda install graphviz

  • Install EasyLink via

    > pip install easylink

    OR

    > cd <path/to/repositories/>

    > git clone git@github.com:ihmeuw/easylink.git

    > # OR git clone https://github.com/ihmeuw/easylink.git

    > cd easylink

    > pip install .

Quickstart

To run a pipeline, use easylink run from the command line and pass in the paths to both a pipeline specification and an input data specification:

> easylink run -p <PIPELINE-SPECIFICATION> -i <INPUT-DATA-SPECIFICATION>

There are several other optional arguments to easylink run as well; for help, use easylink run –help

Note that a schematic of the pipeline’s directed acyclic graph (DAG) that is run is automatically generated. If this schematic is desired _without_ actually running the pipeline, use easylink generate-dag:

> easylink generate-dag -p <PIPELINE-SPECIFICATION> -i <INPUT-DATA-SPECIFICATION>

As before, refer to easylink generate-dag –help for information on other options.

Requirements

TBD

Documentation

TBD

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easylink-0.1.0.tar.gz (159.1 kB view details)

Uploaded Source

Built Distribution

easylink-0.1.0-py3-none-any.whl (135.6 kB view details)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page