Skip to main content

Data Cleaning Tool

Project description

PureFlow - README

This tool is designed to clean datasets by removing null values, special characters, links, hashtags, and mentions from specific columns. It also performs various text cleaning operations to preprocess the data.

Usage:

  1. Prepare your dataset in a supported format (CSV, Excel, etc.).
  2. Place the dataset file in the same directory as this cleaning tool.
  3. Open a command prompt or terminal.
  4. Navigate to the directory containing the cleaning tool and dataset file.
  5. Install the cleaning tool using the this commands: pip install PureFlow
  6. Import The PureFlow Package and start using It: import PureFlow as pf pf.clean_dataset() pf.remove_nulls()

Parameters:

  • columns_name: Provide the names of the columns to clean. Use ['all'] to clean all columns or ['column1', 'column2'] to clean specific columns.
  • remove_nan (Optional): Specify if null values should be removed. Default is False.
  • save_it_as_csv (Optional): Specify if the cleaned dataset should be saved as a new CSV file. Default is True.

Output:

  • The cleaned dataset will be displayed on the console.
  • If save_it_as_csv is True, the cleaned dataset will be saved as "Cleaned_Dataset.csv" in the same directory.

Note:

  • Make sure you have Python and the necessary dependencies (Pandas) installed.
  • Ensure that your dataset file is not open or locked by any other program during the cleaning process.

Example Usage:

pip install PureFlow import PureFlow as pf pf.clean_dataset('dataset name', ['columns name']) Cleaning in progress... Cleaned dataset: ... [Display a sample of cleaned data here] ... Cleaning completed successfully. The cleaned dataset has been saved as "Cleaned_Dataset.csv".

For any issues or inquiries, please contact me:

E-Mail: iMoHd8@hotmail.com LinkedIn: https://www.linkedin.com/in/mohammed-mahameed GitHub: https://github.com/iMoHd8 Instagram: https://www.instagram.com/i.mohd.8/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PureFlow-1.0.1.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

PureFlow-1.0.1-py3-none-any.whl (4.8 kB view details)

Uploaded Python 3

File details

Details for the file PureFlow-1.0.1.tar.gz.

File metadata

  • Download URL: PureFlow-1.0.1.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for PureFlow-1.0.1.tar.gz
Algorithm Hash digest
SHA256 f8aba5501aafabc0c82f1383b9968881c44207ebe4d1000a3b2998d09e359b59
MD5 6dcd213f83a66465812e6315925d21e6
BLAKE2b-256 e00e6fe7bbf26d6f3c71417429199ca1540abd833159f7d65e59abd383c96ae9

See more details on using hashes here.

File details

Details for the file PureFlow-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: PureFlow-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 4.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for PureFlow-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b4653ee6acbca67a0834f809910c64dcfd3e6b856cc79d633bcfaf874dde3ca8
MD5 974244837c78222eb0258ef7c242a39f
BLAKE2b-256 604addfe145894949d3371ab696badef14fe34b467802c1bebdf56579931aa88

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page