Skip to main content

Singer tap for ClinicalTrials.gov, built with the Meltano SDK for Singer Taps.

Project description

tap-clinicaltrials

Singer tap for ClinicalTrials.gov study records data.

Built with the Meltano Tap SDK for Singer Taps.

Capabilities

  • catalog
  • state
  • discover
  • activate-version
  • about
  • stream-maps
  • schema-flattening
  • batch
  • structured-logging

Settings

Setting Required Default Description
start_date False None Earliest datetime to get data from
condition False None Conditions or disease query
sponsor False None Sponsor query
stream_maps False None Config object for stream maps capability. For more information check out Stream Maps.
stream_map_config False None User-defined config values to be used within map expressions.
flattening_enabled False None 'True' to enable schema flattening and automatically expand nested properties.
flattening_max_depth False None The max depth to flatten schemas.
batch_config False None

A full list of supported settings and capabilities is available by running: tap-clinicaltrials --about

Installation

In a Meltano project

Using a direct reference

meltano add extractor tap-clinicaltrials --from-ref=https://raw.githubusercontent.com/edgarrmondragon/tap-clinicaltrials/main/plugin.yaml

Requires Meltano v3.1.0+.

From MeltanoHub

Not yet available.

From PyPI

python3 -m pip install --upgrade tap-clinicaltrials

With pipx

pipx install tap-clinicaltrials

From source

git clone https://github.com/edgarrmondragon/tap-clinicaltrials
cd tap-clinicaltrials
python3 -m pip install .

Usage

You can easily run tap-clinicaltrials by itself or in a pipeline using Meltano.

With Meltano

  1. Clone the repo and cd into it:

    git clone https://github.com/edgarrmondragon/tap-clinicaltrials.git
    cd tap-clinicaltrials
    
  2. Make sure you have Meltano installed

  3. Install all plugins

    meltano install
    
  4. Configure the tap-clinicaltrials tap:

    meltano config tap-clinicaltrials set start_date '2020-01-01'
    meltano config tap-clinicaltrials set condition 'COVID-19'
    meltano config tap-clinicaltrials set sponsor 'Pfizer'
    
  5. Run a test tap-clinicaltrials extraction

    meltano run tap-clinicaltrials target-duckdb
    
  6. That's it! Check the data

    $ duckdb output/warehouse.duckdb -c "select nctid, lastUpdateSubmitDate, protocolsection->>'$.identificationModule.briefTitle' from clinicaltrials.studies limit 5;
    ┌─────────────┬──────────────────────┬─────────────────────────────────────────────────────────────────────────────────────────────────────┐
    │    nctid    │ lastupdatesubmitdate │                      (protocolsection ->> '$.identificationModule.briefTitle')                      │
    │   varchar   │       varchar        │                                               varchar                                               │
    ├─────────────┼──────────────────────┼─────────────────────────────────────────────────────────────────────────────────────────────────────┤
    │ NCT06156215 │ 2023-12-06           │ PROmotion of COVID-19 BOOSTer VA(X)Ccination in the Emergency Department - PROBOOSTVAXED            │
    │ NCT05487040 │ 2023-12-06           │ A Study to Measure the Amount of Study Medicine in Blood in Adult Participants With COVID-19 and …  │
    │ NCT06163677 │ 2023-12-07           │ A Study to Look at the Health Outcomes of Patients With COVID-19 and Influenza.                     │
    │ NCT05032976 │ 2023-12-07           │ Korea Comirnaty Post-marketing Surveillance                                                         │
    │ NCT05596734 │ 2023-12-11           │ A Study to Evaluate the Safety, Tolerability, and Immunogenicity of Combined Modified RNA Vaccine…  │
    └─────────────┴──────────────────────┴─────────────────────────────────────────────────────────────────────────────────────────────────────┘
    

Executing the Tap Directly

tap-clinicaltrials --version
tap-clinicaltrials --help
tap-clinicaltrials --config CONFIG --discover > ./catalog.json

Developer Resources

Initialize your Development Environment

pipx install hatch

Create and Run Tests

Run integration tests:

hatch run test:integration

You can also test the tap-clinicaltrials CLI interface directly:

hatch run sync:console -- --about --format=json

SDK Dev Guide

See the dev guide for more instructions on how to use the SDK to develop your own taps and targets.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tap_clinicaltrials-0.3.0.tar.gz (78.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tap_clinicaltrials-0.3.0-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file tap_clinicaltrials-0.3.0.tar.gz.

File metadata

  • Download URL: tap_clinicaltrials-0.3.0.tar.gz
  • Upload date:
  • Size: 78.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for tap_clinicaltrials-0.3.0.tar.gz
Algorithm Hash digest
SHA256 50d52e53d00aa04ae14eebd753eac07ac7b0eb18ab267b86ae3d4641e49cb956
MD5 b2f62e71db0436d7aa3171eb7d001b70
BLAKE2b-256 f9aab29ec452ecedf96b42a169424c17c03ea5de95438ba762fb7ae5b000ce31

See more details on using hashes here.

Provenance

The following attestation bundles were made for tap_clinicaltrials-0.3.0.tar.gz:

Publisher: build.yaml on reservoir-data/tap-clinicaltrials

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file tap_clinicaltrials-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for tap_clinicaltrials-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 83c29269bd4da9e0367b50d2c606305c4f8e24af56d83a480b8a331442402904
MD5 2b14eef853e7caaf58e03e7124275441
BLAKE2b-256 d1f18a37b770ec6062899eadd0ced818d0e15e69aada46765bccf5a814f5d5fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for tap_clinicaltrials-0.3.0-py3-none-any.whl:

Publisher: build.yaml on reservoir-data/tap-clinicaltrials

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page