A Python library for Healthcare Data Analytics and Revenue Cycle Management.
Project description
Carelytics – Healthcare Data Analytics Library
A modular Python package for healthcare data cleaning, validation, and revenue cycle insights.
Overview
Carelytics is a Python library designed to simplify data analytics and automation in the healthcare domain, especially focusing on Revenue Cycle Management (RCM) workflows.
It provides functions to:
- Validate and clean large healthcare datasets
- Analyze patient encounters, lab data, and vitals
- Standardize formats for interoperability (FHIR-ready)
- Support predictive modeling such as readmission and denial prediction
Built with Pandas, NumPy, and Scikit-learn, Carelytics empowers analysts, researchers, and developers to derive actionable insights from healthcare data quickly and efficiently.
Package Architecture
carelytics/
│
├── data/ # (Placeholder for sample data or CSVs)
│
├── fhir/ # Handles healthcare interoperability (FHIR parsing)
│ ├── parser.py
│ └── validator.py
│
├── models/ # Predictive models for healthcare analytics
│ ├── denial_prediction.py
│ └── readmission.py
│
├── utils/ # Utility functions for data processing
│ ├── __init__.py
│ ├── cleaner.py
│ ├── deid.py # De-identification utilities for PHI data
│ ├── validator.py # Schema and datatype validation
│ └── __init__.py
│
├── claims.py # Claim-level metrics and KPIs
├── encounter.py # Patient encounter analytics
├── lab.py # Lab result standardization
├── patient.py # Patient-level summaries
└── vitals.py # Vital signs normalization and aggregation
Each module is reusable and can be independently imported.
⚙️ Core Functionalities
1. carelytics.utils.validator
Provides schema and datatype validation for healthcare datasets.
Example:
from carelytics.utils.validator import validate_columns, validate_datatypes
validate_columns(df, ["patient_id", "age", "diagnosis"])
validate_datatypes(df, {"age": "int64", "diagnosis": "object"})
Output:
all required columns validated.
Column data types validated successfully.
2. carelytics.utils.cleaner
Includes data cleaning utilities like missing value handling, standardization, and column renaming.
from carelytics.utils.cleaner import fill_missing
df = fill_missing(df, strategy="median")
3. carelytics.models.denial_prediction
Predicts claim denial probabilities based on payer data, CPT/ICD codes, and historical denials.
from carelytics.models.denial_prediction import predict_denials
pred = predict_denials(df)
print(pred.head())
4. carelytics.models.readmission
Predicts hospital readmission likelihood using patient demographics and vitals.
5. carelytics.claims
Analyzes RCM claim metrics such as:
- Average AR days
- Net collection rate
- Denial rates
6. carelytics.lab & carelytics.vitals
Helps in normalizing patient lab values and vitals for statistical analysis.
7. carelytics.utils.deid
Supports data anonymization to remove or mask PHI (Protected Health Information) before analysis.
Example Workflow
import pandas as pd
from carelytics.utils import validator, cleaner
from carelytics.models import denial_prediction
# Load your healthcare dataset
df = pd.read_csv("claims.csv")
# Validate structure
validator.validate_columns(df, ["claim_id", "payer", "amount", "denial_flag"])
# Clean and prepare
df = cleaner.fill_missing(df, "median")
# Run prediction
pred = denial_prediction.predict_denials(df)
print(pred.head())
Key Use Cases
| Use Case | Description |
|---|---|
| Hospital Analytics | Clean and validate EHR data for performance dashboards |
| RCM Optimization | Predict denials, track collection efficiency |
| Clinical Research | Analyze patient lab results and vitals |
| Data Interoperability | FHIR parser ensures standard formats for sharing |
| PHI Handling | Built-in data de-identification ensures HIPAA compliance |
Dependencies
- Python ≥ 3.7
- pandas
- numpy
- scikit-learn
Install all dependencies with:
pip install carelytics
Authors & Contributors
Rohan Desai Dallas, Texas, USA Email: rohan.acme@gmail.com GitHub: https://github.com/rohan-desai LinkedIn: https://www.linkedin.com/in/rohandesai07/
Vaishnavi Sanjay Gadve Irving, Texas, USA Email: vaishnavigadve143@gmail.com GitHub: https://github.com/vaish2412 LinkedIn: https://www.linkedin.com/in/vaishnavi-gadve-4b577512a/
License
MIT License © 2025 Rohan Desai & Vaishnavi Sanjay Gadve
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file carelytics-0.1.3.tar.gz.
File metadata
- Download URL: carelytics-0.1.3.tar.gz
- Upload date:
- Size: 13.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6fe69eb1cd13c77342db86963d14eb84df5ef7798ee3863cca590d7a621b64b8
|
|
| MD5 |
7d00709d53dc6dc0a2d5e2a1bff09485
|
|
| BLAKE2b-256 |
40dc76962328f321eb557081463435882461bb7595036800bf4907edba8a3bc0
|
File details
Details for the file carelytics-0.1.3-py3-none-any.whl.
File metadata
- Download URL: carelytics-0.1.3-py3-none-any.whl
- Upload date:
- Size: 16.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1c6be359d33bcabeb919cce649638f210b64e8f348564ced3aeecaa217eb931b
|
|
| MD5 |
39046fffbe8f1ee92e1a318cdbfd1357
|
|
| BLAKE2b-256 |
21043b230cbd56c5067fea5b449c40c5e8a0a23df9f01ea47cc242f0885b265a
|