A collection of Machine Learning techniques for data management, engineering and augmentation.
Project description
DeepCoreML is a collection of Machine Learning techniques for data management, engineering, and augmentation. More specifically, DeepCoreML includes modules for:
- Data management
- Text data preprocessing
- Text representation, vectorization, embeddings
- Dimensionality reduction
- Generative modeling
- Imbalanced datasets
Licence: Apache License, 2.0 (Apache-2.0)
Dependencies:NumPy, Pandas, Matplotlib, Seaborn, joblib, Synthetic Data Vault (SDV), pyTorch,scikit-learn, xgboost, imblearn, Reversible Data Transforms (RDT), tqdm.
GitHub repository: https://github.com/lakritidis/DeepCoreML
Publications:
- L. Akritidis, P. Bozanis, "A Clustering-Based Resampling Technique with Cluster Structure Analysis forSoftware Defect Detection in Imbalanced Datasets", Information Sciences, vol. 674, pp. 120724, 2024.
- L. Akritidis, A. Fevgas, M. Alamaniotis, P. Bozanis, "Conditional Data Synthesis with Deep Generative Models for Imbalanced Dataset Oversampling", In Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 444-451, 2023, 2023.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
deepcoreml-0.4.10.tar.gz
(107.7 kB
view details)
File details
Details for the file deepcoreml-0.4.10.tar.gz.
File metadata
- Download URL: deepcoreml-0.4.10.tar.gz
- Upload date:
- Size: 107.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
55da8c0fc1b146c60bf374565d3cfbf37bfd4d739c007dc1afa217de9b141380
|
|
| MD5 |
e902215f0982aa042acadeb7881f24d5
|
|
| BLAKE2b-256 |
f14e33efeb4cfd0c37fddd0e2b4593c43b50111f86df275c3f2c9ee92aa6c4a9
|