Any-to-Any Terminology Mapping (AATM) is a framework for creating mappings between different terminologies or vocabularies. It uses OHDSI standard vocabularies and data models as a basis for mapping, and can be used to create mappings between any two terminologies or vocabularies. AATM is designed to be flexible and extensible, allowing users to create mappings for a wide range of use cases.

These details have not been verified by PyPI

Project links

Project description

Any-to-Any Terminology Mapping

Any-to-Any Terminology Mapping is an open-source Python framework designed to facilitate terminology mapping tasks. The library organizes this process in a modular and extensible way to support multiple use cases and incorporate new techniques as they emerge.

In simple terms, mapping a new expression to a specific terminology involves considering many possible expressions, retrieving the best candidate target terms, and selecting them manually, which can be effortful and time-consuming.

AATM leverages the OMOP vocabularies to facilitate this task. These vocabularies reflect large, community-driven mapping efforts that connect many different health-related terminologies and classifications worldwide and organize them around standard terminologies, which serve as the central connecting nodes in the system. As these mapping efforts continue, healthcare-related concepts become increasingly well represented in these vocabularies, creating a virtuous cycle and increasing the chances of finding a strong correspondence for a new unmapped expression.

Standard vocabularies

To accomplish this, AATM organizes the mapping process into very simple steps:

translation, which can be optional;
retrieval, which explores what is available from prior mapping efforts; and
selection, which connects a standard concept to the new expression being mapped.

Once this connection is made, every link associated with that standard concept becomes immediately available, enabling mapping to many different terminologies and classifications that are already connected to that concept, effectively breaking down barriers to interoperability in healthcare.

Terminology mapping pipeline

Documentation

The full documentation is available at: https://precisiondata.github.io/aatm

Installation

Install the package in your environment.

pip install aatm

If you want to build from the source, clone the repository and install it locally:

git clone https://github.com/precisiondata/aatm.git
pip install -e .

git clone https://github.com/precisiondata/aatm.git
uv sync

1. Prepare your OMOP vocabularies directory

Before running aatm init, download the OMOP vocabularies you want to use and place them in a directory. You can find them at https://athena.ohdsi.org/vocabulary/list

By default, the CLI expects it at the root directory:

./vocabularies

If you do not use that location, you can point the CLI to a different directory with the option --vocab-dir or -vd. The CLI validates this path during initialization.

2. Run the initialization command

The init command is the main CLI setup workflow.

It does all of the following for you:

creates the local .aatm helper directory where the local databases and aatm config files will be stored
ensures .aatm is added to .gitignore
builds the local OMOP SQLite database
lets you choose an embedding model
lets you choose the standard vocabularies
builds the mapping datasets
builds the local vector database

That means you do not need to call Python setup functions manually for the normal setup flow. At the end, you will be ready to run terminology mapping tasks.

Simplest setup

aatm init

This uses the default vocab directory and interactively asks you to choose the embedding model, standard vocabularies and other options.

Setup with a custom vocab directory

aatm init --vocab-dir ./my_vocabularies

Setup with an explicit embedding model

aatm init --embedding-model embeddinggemma-300M

Setup with explicit standard vocabularies

aatm init --standard-vocabs LOINC --standard-vocabs SNOMED --standard-vocabs RxNorm

Fully explicit setup

aatm init \
  --vocab-dir ./vocabularies \
  --embedding-model embeddinggemma-300M \
  --standard-vocabs LOINC \
  --standard-vocabs SNOMED \
  --standard-vocabs RxNorm

3. Prepare your input CSV

After initialization, prepare the CSV you want to map.

The mapper expects an OMOP-style SOURCE_TO_CONCEPT_MAP input structure, including these columns:

source_code
source_concept_id
source_vocabulary_id
source_code_description
valid_start_date
valid_end_date
invalid_reason

Example:

source_code,source_concept_id,source_vocabulary_id,source_code_description,valid_start_date,valid_end_date,invalid_reason
A01,,LOCAL,"Dor no peito",2020-01-01,2099-12-31,
B02,,LOCAL,"Diabetes mellitus tipo 2",2020-01-01,2099-12-31,

4. Run mapping directly from the CLI

The map command runs a terminology mapping task. You can use it in two ways:

with a task config file
with explicit CLI options

Both paths are supported directly by the CLI implementation.

Option A: run from explicit CLI arguments

This is the most direct fully-CLI workflow.

aatm map \
  --input-file data/source_to_concept_map.csv \
  --output-dir output \
  --translator-id empty-translator \
  --retriever-id embeddinggemma-300M \
  --reranker-id bm25-reranker \
  --selector-id first-result-selector \
  --batch-size 100

Run a small test job

Use --limit-to when you want to test with only a few rows.

aatm map \
  --input-file data/source_to_concept_map.csv \
  --output-dir output \
  --translator-id empty-translator \
  --retriever-id embeddinggemma-300M \
  --reranker-id bm25-reranker \
  --selector-id first-result-selector \
  --limit-to 20

Apply rate limiting

If needed, you can also pass a rate limit:

aatm map \
  --input-file data/source_to_concept_map.csv \
  --output-dir output \
  --translator-id gemini-2.5-flash \
  --retriever-id embeddinggemma-300M \
  --reranker-id bm25-reranker \
  --selector-id first-result-selector \
  --batch-size 50 \
  --rate-limit 100

The CLI accepts all of these options directly.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.6

Jun 6, 2026

0.1.5

Jun 6, 2026

0.1.4

Jun 6, 2026

This version

0.1.3

Jun 6, 2026

0.1.2

Jun 6, 2026

0.1.1

Jun 6, 2026

0.1.0

Jun 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aatm-0.1.3.tar.gz (58.3 kB view details)

Uploaded Jun 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aatm-0.1.3-py3-none-any.whl (75.9 kB view details)

Uploaded Jun 6, 2026 Python 3

File details

Details for the file aatm-0.1.3.tar.gz.

File metadata

Download URL: aatm-0.1.3.tar.gz
Upload date: Jun 6, 2026
Size: 58.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for aatm-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`c466924666bb63c1c814d7fb806891047786defdc4d04f6e0964a4f7f5ed2dca`
MD5	`2f5ec7b7bc7ba925b91bfa6106a3e943`
BLAKE2b-256	`637e26b3eb629505dd6d8465239f046b0ce448e2c43631b5be9e97c75a167b9d`

See more details on using hashes here.

File details

Details for the file aatm-0.1.3-py3-none-any.whl.

File metadata

Download URL: aatm-0.1.3-py3-none-any.whl
Upload date: Jun 6, 2026
Size: 75.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for aatm-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e310ca9d3ad91c7d271fccddddc285b6d9b6da4b981809b4e6b7c98093b1efdd`
MD5	`1c4fa5844a87f9a5eb1298ebcbbc1daa`
BLAKE2b-256	`315b2e46a4985109340ab5758098289bcea6399a2e657fc6d011e20bc4855e0f`

See more details on using hashes here.

aatm 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Any-to-Any Terminology Mapping

Documentation

Installation

1. Prepare your OMOP vocabularies directory

2. Run the initialization command

Simplest setup

Setup with a custom vocab directory

Setup with an explicit embedding model

Setup with explicit standard vocabularies

Fully explicit setup

3. Prepare your input CSV

4. Run mapping directly from the CLI

Option A: run from explicit CLI arguments

Run a small test job

Apply rate limiting

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes