Skip to main content

Create artificial data for the MEx project.

Project description

MEx artificial

Create artificial data for the MEx project.

cookiecutter cve-scan documentation linting open-code testing

Project

The Metadata Exchange (MEx) project is committed to improve the retrieval of RKI research data and projects. How? By focusing on metadata: instead of providing the actual research data directly, the MEx metadata catalog captures descriptive information about research data and activities. On this basis, we want to make the data FAIR[^1] so that it can be shared with others.

Via MEx, metadata will be made findable, accessible and shareable, as well as available for further research. The goal is to get an overview of what research data is available, understand its context, and know what needs to be considered for subsequent use.

RKI cooperated with D4L data4life gGmbH for a pilot phase where the vision of a FAIR metadata catalog was explored and concepts and prototypes were developed. The partnership has ended with the successful conclusion of the pilot phase.

After an internal launch, the metadata will also be made publicly available and thus be available to external researchers as well as the interested (professional) public to find research data from the RKI.

For further details, please consult our project page.

[^1]: FAIR is referencing the so-called FAIR data principles – guidelines to make data Findable, Accessible, Interoperable and Reusable.

Contact
For more information, please feel free to email us at mex@rki.de.

Publisher

Robert Koch-Institut
Nordufer 20
13353 Berlin
Germany

Package

Create artificial extracted items, transform them into merged items and write the results into a configured sink.

License

This package is licensed under the MIT license. All other software components of the MEx project are open-sourced under the same license as well.

Development

Installation

  • install python on your system
  • on unix, run make install
  • on windows, run .\mex.bat install

Linting and testing

  • run all linters with make lint or .\mex.bat lint
  • run unit and integration tests with make test or .\mex.bat test
  • run just the unit tests with make unit or .\mex.bat unit

Updating dependencies

  • update boilerplate files with cruft update
  • update global requirements in requirements.txt manually
  • update git hooks with pre-commit autoupdate
  • update package dependencies using uv sync --upgrade
  • update github actions in .github/workflows/*.yml manually

Creating release

  • run mex release RULE to release a new version where RULE determines which part of the version to update and is one of major, minor, patch.

Container workflow

  • build image with make image
  • run local version using docker make run

Pre-built workflow

  • you can run the latest artificial data generator without building it locally
  • just pull it from the container registry and configure using cli arguments
  • docker run -it -u $(id -u):$(id -g) -v $(pwd):/out ghcr.io/robert-koch-institut/mex-artificial:latest --count=1000 --chattiness=10
  • using -u $(id -u):$(id -g) to run the process using your local user
  • using -v $(pwd):/out to specify an output directory for the resulting ndjson
  • --count controls the number of items to generate
  • --chattiness controls the number of words in textual fields

Commands

  • run uv run artificial --help to print instructions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mex_artificial-1.3.1.tar.gz (11.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mex_artificial-1.3.1-py3-none-any.whl (13.6 kB view details)

Uploaded Python 3

File details

Details for the file mex_artificial-1.3.1.tar.gz.

File metadata

  • Download URL: mex_artificial-1.3.1.tar.gz
  • Upload date:
  • Size: 11.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mex_artificial-1.3.1.tar.gz
Algorithm Hash digest
SHA256 0cf8027d45dcfa7c8cc8dd96da3fd0c31de622a2b2c6d8db737c1f9665c3b756
MD5 8e1211a639865e262a7b3436a085f4e1
BLAKE2b-256 af792c8bb0039b7f9f58955fdc05ce3e49e0de9dbc162e87731917a9c9fc8bd1

See more details on using hashes here.

Provenance

The following attestation bundles were made for mex_artificial-1.3.1.tar.gz:

Publisher: release.yml on robert-koch-institut/mex-artificial

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mex_artificial-1.3.1-py3-none-any.whl.

File metadata

  • Download URL: mex_artificial-1.3.1-py3-none-any.whl
  • Upload date:
  • Size: 13.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mex_artificial-1.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 62700e3bd6cb79467304bcbe20e84d0f3938415aa95d35f5119e7c1e058b2db2
MD5 23ac04018c1bda86a2b85a4a7371042d
BLAKE2b-256 f3ca0d013397e8377e05cdb729a902cd9a5246fcdae189acf747cab617ec8756

See more details on using hashes here.

Provenance

The following attestation bundles were made for mex_artificial-1.3.1-py3-none-any.whl:

Publisher: release.yml on robert-koch-institut/mex-artificial

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page