Skip to main content

Health data faker

Project description

Headfake

What is Headfake?

It is Health data faker: a Python-based package which allows the user to create fake or test data sets from a YAML template file.

The package can be embedded directly into Python scripts, or it can be used through a command-line script.

It builds upon some of the ideas in pydbgen and improves on it with support for a number of additional features including statistically distributed random values; dependent fields and custom fields.

Why would I use Headfake?

Headfake makes it simple and straightforward to generate fake or test data. It has a number of features which make this easier:

  • Support for shareable template-based config or direct Python implementation to setup and perform the data generation.

  • Embeddable data generation into projects (either using a YAML config or constructing the classes manually).

  • Generation of names and contact details through use of the Python package Faker.

  • Randomised names can be output based on a gender field.

  • More realistic simulated data uses statistical distributions to create date of birth and also probability-based option values. Other approaches to simulate real data are also being investigated.

  • Clinical data supported includes random NHS numbers and deceased flags/date of death based on age-based odds of death.

  • Dependent fields (e.g. one field's values are dependent on the values from one or more)

  • Field data can be looked up from another file using a key field, allowing re-use of patient details in a different field set.

  • A selection of fields to handle generation of different types of data.

  • Ability to create and use custom fields to generate your own data types and values

  • Support for transformers which pre or post-process data once it's been generated

How do I install and use Headfake?

To install Headfake please go to the Installation page and then start with the Tutorials or the usage page.

Is Headfake being actively maintained?

Yes, we are using Headfake is our own projects and as result are keep it maintained and adding new features when we need them.

Is Headfake suitable for my project?

The library has been released under an MIT license so can be embedded into your own tools with minimal restrictions on use.

If I use Headfake to generate data in my research project which source should I cite?

We are working on a journal paper, for now please cite the project URL (https://rcfgroup.github.io/headfake).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headfake-1.0.2.tar.gz (47.9 kB view hashes)

Uploaded Source

Built Distribution

headfake-1.0.2-py3-none-any.whl (42.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page