Build analytical tables for clinicedc/edc projects
Project description
edc-analytics
Build analytic tables from EDC data
Overview
Read your data into a dataframe, for example an EDC screening table:
import pandas as pd
from django_pandas.io import read_frame
from meta_screening.models import SubjectScreening
from edc_analytics.custom_tables import AgeTable, GenderTable
qs_screening = SubjectScreening.objects.all()
df = read_frame(qs_screening)
Convert all numerics to pandas numerics:
cols = [
"age_in_years",
"dia_blood_pressure_avg",
"fbg_value",
"hba1c_value",
"ogtt_value",
"sys_blood_pressure_avg",
]
df[cols] = df[cols].apply(pd.to_numeric)
Pass the dataframe to each Table class
gender_tbl = GenderTable(main_df=df)
age_tbl = AgeTable(main_df=df)
bp_table = BpTable(main_df=df)
In the Table instance,
data_df is the supporting dataframe
table_df is the dataframe to display. The table_df displays formatted data in the first 5 columns (“Characteristic”, “Statistic”, “F”, “M”, “All”). The table_df has additional columns that contain the statistics used for the statistics displayed in columns [“F”, “M”, “All”].
From above, gender_tbl.table_df is just a dataframe and can be combined with other table_df dataframes using pd.concat() to make a single table_df.
table_df = pd.concat(
[gender_tbl.table_df, age_tbl.table_df, bp_table.table_df]
)
Show just the first 5 columns:
table_df.iloc[:, :5]
Like any dataframe, you can export to csv:
path = "my/path/to/csv/folder/table_df.csv"
table_df.to_csv(path_or_buf=path, encoding="utf-8", index=0, sep="|")
Details
Assumptions
The default table assumes:
you have gender for all observations.
gender is “M”, “F” or from edc.constants MALE, FEMALE
A Table presents data by characteristic per row (such as age, bp, glucose, …). It is a dataframe where the first columns are formatted for presentation and the remining columns are the descriptive statistics used to render the formatted columns (mean, median, sd, range, IQR, proportions).
If a table is stratified by gender, then the formatted row for “Age” might be like this:
| Characteristic | Statistic | F | M | All |
======================================================
| Age (years) | n | 1175 | 1000 | 2175 |
| | 18-34 | 70 | 64 | 134 |
| | ...etc | | | |
contains a collection of RowDefinitions
Stratification
Putting together a table
RowDefinitions
RowDefinitions are a collection of RowDefinition.
To build a table use the Table class and override the build_defs method. For example:
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file edc_analytics-1.0.2.tar.gz.
File metadata
- Download URL: edc_analytics-1.0.2.tar.gz
- Upload date:
- Size: 25.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0c71a5e80744fc9c3a419173a3ff9ae28b576146f345581940b6a62498999955
|
|
| MD5 |
a4195f3a064f8424b1e37ce32260313c
|
|
| BLAKE2b-256 |
77fb0d1e42e7830b1bfa0847f374c31ec6f9ca7cf23334ad125aebe20f400af3
|
File details
Details for the file edc_analytics-1.0.2-py3-none-any.whl.
File metadata
- Download URL: edc_analytics-1.0.2-py3-none-any.whl
- Upload date:
- Size: 35.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7d96d458c5d2cf8a5915b15da11cff80d13212ed5ff77a70a4ba747d43d13c55
|
|
| MD5 |
0048d2edc66ad756cb0563f5411718ca
|
|
| BLAKE2b-256 |
6c3629f7f7b5da8eeb55ede71445765864ec448fc6fbd4250fdac1c5517dcbaa
|