A package for preprocessing data for AI models
Project description
BigFlowPrototypes
A package for preprocessing data for AI models.
Modules
Image.py
Processes image data using VGG16 to convert it to vector data and store it in a database.
linearEmbed.py
Prepares a "normal dataset" and converts non-numeric columns to embedded data using OpenAI embedding.
Linear.py
Removes non-numeric data and inputs it into a model.
Installation
pip install BigFlowPrototypes
from data_preprocessor import linearEmbed
if __name__ == "__main__":
input_data_path = 'titanic.csv' # Path to the input data file
label_column = 'Survived' # Name of the label column
columns_to_embed = ['Name'] # Columns to embed
openai_api_key = 'your_openai_api_key' # OpenAI API key
X_train, X_test, y_train, y_test = linearEmbed.main(input_data_path, label_column, columns_to_embed, openai_api_key)
print("Data processing complete.")
from data_preprocessor import Linear
if __name__ == "__main__":
input_file = 'movies.csv' # Path to the input data file
label = 'Popularity' # Name of the label column
X_train, X_test, y_train, y_test = Linear.main(input_file, label)
print("Data processing complete.")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bigflowprototypes-0.1.tar.gz.
File metadata
- Download URL: bigflowprototypes-0.1.tar.gz
- Upload date:
- Size: 5.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.10.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
147b819546ef043cd3e5aff7fd2ae8f9bb46cdb6f70566798ab31e5fc97ff6a9
|
|
| MD5 |
44db1167c9b4ffb4556115eb5a52ec87
|
|
| BLAKE2b-256 |
069d54d0d4a56c369a1d6ff3a79535d2852a8de140798579ce1eb52aff2b834b
|
File details
Details for the file BigFlowPrototypes-0.1-py3-none-any.whl.
File metadata
- Download URL: BigFlowPrototypes-0.1-py3-none-any.whl
- Upload date:
- Size: 7.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.10.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
acdb41bed95c6cbae040842f170c98345dfb141f0c775ac0a86387ba97b72a34
|
|
| MD5 |
b04a99dcc9311a1c5e33763f17c4ebd4
|
|
| BLAKE2b-256 |
799cc665fe61c22dc33f4b2b62f9a8d5f9ecd44feaf758868d8f25339a9ae7db
|