A collection of Machine Learning techniques for data management, engineering and augmentation.
Project description
DeepCoreML is a collection of Machine Learning techniques for data management, engineering, and augmentation. More specifically, DeepCoreML includes modules for:
- Data management
- Text data preprocessing
- Text representation, vectorization, embeddings
- Dimensionality reduction
- Generative modeling
- Imbalanced datasets
Licence: Apache License, 2.0 (Apache-2.0)
Dependencies:NumPy, pandas, Matplotlib, seaborn, joblib, Synthetic Data Vault (sdv), pytorch,scikit-learn, xgboost, imblearn, Reversible Data Transforms(RDT), tqdm.
GitHub repository: https://github.com/lakritidis/DeepCoreML
Publications:
- L. Akritidis, P. Bozanis, "A Clustering-Based Resampling Technique with Cluster Structure Analysis forSoftware Defect Detection in Imbalanced Datasets", Information Sciences, vol. 674, pp. 120724, 2024.
- L. Akritidis, A. Fevgas, M. Alamaniotis, P. Bozanis, "Conditional Data Synthesis with Deep Generative Models for Imbalanced Dataset Oversampling", In Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 444-451, 2023, 2023.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
deepcoreml-0.4.4.tar.gz
(68.6 kB
view details)
File details
Details for the file deepcoreml-0.4.4.tar.gz.
File metadata
- Download URL: deepcoreml-0.4.4.tar.gz
- Upload date:
- Size: 68.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cafe2f04ca12f71d5fd1de2dfed27ca4771605b782021afd4a74e178696c9cbf
|
|
| MD5 |
8b56f2908ed848d1d03fe1d094427367
|
|
| BLAKE2b-256 |
ab753cbad522a0fcd49c1dab1f83125f0f79ed2add21e957823b59fe352fd8cd
|