Skip to main content

A toolkit for accessing and working with data from the CRITT Translation Process Research Database (TPR-DB).

Project description

tprdb-utilities

PyPI version Python License: MIT

A Python toolkit for downloading and reading data tables from the CRITT Translation Process Research Database (TPR-DB).

Two functions cover the full workflow:

Function What it does
fetch_TPRDB_tables Downloads study tables from the CRITT API and saves them to a local directory structure
read_TPRDB_tables Reads those tables (locally or on the CRITT server) into a single pandas.DataFrame

Installation

# pip
pip install tprdb-utilities

# uv
uv add tprdb-utilities

# poetry
poetry add tprdb-utilities

Quick Start

1 — Download data (fetcher)

Public study (no credentials needed):

from tprdb_utilities import fetch_TPRDB_tables

fetch_TPRDB_tables(
    path="/path/to/local/data",
    StudyID="DG21",
    extension=["kd", "ss"],
    public=True,
)

Private study (requires your TPR-DB username and API token):

from tprdb_utilities import fetch_TPRDB_tables

fetch_TPRDB_tables(
    path="/path/to/local/data",
    StudyID="MYSTUDY",
    extension=["kd"],
    public=False,
    username="myTPRDBusername",   # case-sensitive, must match your account
    token="my-api-token",
)

After downloading, the function always prints a summary like this:

=== fetch_TPRDB_tables Summary ===
StudyID  : DG21
Clone dir: /path/to/local/data/tprdb-mothership-clone
User dir : TPRDB

Extension  Status      Time
---------  ----------  ------
kd         Downloaded  1.23s
ss         Downloaded  0.98s

To read these files with read_TPRDB_tables:
  path      = "/path/to/local/data/tprdb-mothership-clone"
  user      = "TPRDB"
  studies   = ["DG21"]

Copy those argument values directly into read_TPRDB_tables.


2 — Read data (reader)

From a local clone (mothership=False) — after running fetch_TPRDB_tables:

from tprdb_utilities import read_TPRDB_tables

df = read_TPRDB_tables(
    studies=["DG21", "AR22"],
    extension="kd",
    mothership=False,
    path="/path/to/local/data/tprdb-mothership-clone",
    user="TPRDB",
)

Directly on the CRITT TPR-DB server (mothership=True):

from tprdb_utilities import read_TPRDB_tables

df = read_TPRDB_tables(
    studies=["DG21", "AR22"],
    extension="kd",
    mothership=True,   # path is set automatically; no path argument needed
)

Directory Structure

fetch_TPRDB_tables creates the following layout under path:

<path>/
└── tprdb-mothership-clone/
    ├── TPRDB/                  ← public studies
    │   └── <StudyID>/
    │       └── Tables/
    │           ├── session1.kd
    │           └── ...
    └── <username>/             ← private studies
        └── <StudyID>/
            └── Tables/
                ├── session1.kd
                └── ...

read_TPRDB_tables with mothership=False expects this exact layout, so the two functions are designed to work together seamlessly.


Supported Table Extensions

ss, sg, st, tt, kd, fd, au, pu, hof, pol


License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tprdb_utilities-0.1.0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tprdb_utilities-0.1.0-py3-none-any.whl (8.6 kB view details)

Uploaded Python 3

File details

Details for the file tprdb_utilities-0.1.0.tar.gz.

File metadata

  • Download URL: tprdb_utilities-0.1.0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for tprdb_utilities-0.1.0.tar.gz
Algorithm Hash digest
SHA256 cd9a3e758dad80932089a83d5a3c06c6e522b2c579a19636426adf1a1d87ab69
MD5 19f620efd29195c447834f9a7c6763e1
BLAKE2b-256 11414702841c8e644e8bd3362bb2f77f2703964daf30a71b829aac4ccc318fc9

See more details on using hashes here.

File details

Details for the file tprdb_utilities-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for tprdb_utilities-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b16c073a0f3a15a9f9d3bd80c35170114606dddf510b3aec4ec0e81a5ac99795
MD5 88842166aa6a841484a25f52d7233011
BLAKE2b-256 41530f83f43d13802401de12513ccef1e27e9aed572060d8143f4cd824abccdf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page