Skip to main content

This is a Python package that automates your data preprocessing

Project description

DataFit

DataFit is a python package developed for automating data preprocessing.

Note This Package is under development and is open source.

This package is developed by Syed Syab and Hamza Rustam for the purpose of Final Year Project at University of Swat. our information is given below

About Project:
    DataFit is a python package developed for automating data preprocessing.
    Project initilization data: 01/OCT/2023
    Project Finilization Data: 01/Dec/2023 (Expected)

Team Member:
    Syed Syab: Student (Me) [github.com/SyabAhmad] [linkedin.com/SyedSyab]
    Hamza Rustam: Student
    ================================
    Professor Naeem Ullah: Supervisor 

This Package is desinged in a user-friendly manner which means every one can use it.

The main functionality of the package is to just automate the data pre-processing step, and make it easy for machine learning engineers or data scientist.

Current Functionality of the package is:

    Function:
        displaying information
        Handling Null Value
        Delete Multiple Columns
        Handling Categorical Values
        Normalization
        Standardization
        Extract Numeric Values
        Tokenization

To use the package

pip install datafit

To use this package it's quit simple, just import it like pandas and then use it.

import datafit as df
# to check information of the data
df.information(data)

To categorize the data

import datafit as df

df.handleCategoricalValues(data,["column1","column2"])

if you want to not mention the columns name an do it for all columns then simply type None inplace of columns names.

import datafit as df

df.handleCategoricalValues(data,None)

To Extract numerical values from the columns

import datafit as df

df.extractValues(data,["columns1", "columns2"])

Note Again: This package is uder development. if it touches your heart do share it and follow me on github [github.com/SyabAhmad] and linkedin [linkedin.com/in/SyedSyab]for mote intersting updates

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datafit-0.2023.2.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datafit-0.2023.2-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file datafit-0.2023.2.tar.gz.

File metadata

  • Download URL: datafit-0.2023.2.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for datafit-0.2023.2.tar.gz
Algorithm Hash digest
SHA256 f1f58850840058c660d7c9235c76ef202a1cb5acc7b61d234ed8eb6709d44a2e
MD5 30915fed0a905daf7446df4e55bcf139
BLAKE2b-256 732d547daca39b44768f09cd594b88a0c818f09ec6a038fa1c8794ff3d40bb89

See more details on using hashes here.

File details

Details for the file datafit-0.2023.2-py3-none-any.whl.

File metadata

  • Download URL: datafit-0.2023.2-py3-none-any.whl
  • Upload date:
  • Size: 7.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for datafit-0.2023.2-py3-none-any.whl
Algorithm Hash digest
SHA256 352907f24a38821181eed1d21a6b0c57addd37cef9fc5708e027845e187e4bdd
MD5 e8197047ecbd50e39a8d5b8595003161
BLAKE2b-256 aa054f6f732f47dec5211e83c4433542340582934e02c68e785f2e4b5974857c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page