Skip to main content

Supplementary library for pandas that processes dataframes derived from CSV files.

Project description

datasoap

What is it?

datasoap is a supplementary library for pandas that processes dataframes derived from CSV files. The module checks cell data for correct numerical formatting and converts mismatched data to the correct data type (ex. str > float64).

Main Features

  • Strips unnecessary characters from numerical data fields in pandas dataframes to ensure consistent data formatting
  • Provides before and after representations of dataframes to allow for comparison

Repository

Source code is hosted on: github.com/snake-fingers/data-soap

Dependencies

pandas - Python package that provides fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive.

Installation

poetry add datasoap

Documentation

User Docs can be found at datasoap docs

Background

datasoap originated from a Code Fellows 401 Python midterm project. The project team includes Alex Angelico, Grace Choi, Robert Carter, Mason Fryberger, and Jae Choi. After working with a few painful datasets using, we wanted to create a library that allows users to more efficiently manipulate clean datasets extracted from CSVs that may have inconsistent formatting.

Future Features in Planning

  • Additional functionality that re-names columns with the common unit of measure.
  • Automation of identifying columns that require reformatting, removing the need for user input on instantiation
  • Extend useablity to work with other virtual environments such as pip, and conda

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasoap-1.1.2.tar.gz (5.2 kB view details)

Uploaded Source

Built Distribution

datasoap-1.1.2-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file datasoap-1.1.2.tar.gz.

File metadata

  • Download URL: datasoap-1.1.2.tar.gz
  • Upload date:
  • Size: 5.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.9.0 Linux/4.19.128-microsoft-standard

File hashes

Hashes for datasoap-1.1.2.tar.gz
Algorithm Hash digest
SHA256 54185523a669f006531ce674e358967a0ce23b691138db5a226194d18e81f92f
MD5 e64d8a5a276d5a8c3e900f2ee87c2f93
BLAKE2b-256 840b8b11e29da171890b53e5864333b8e207289938dd46c9e182490c20dd02c3

See more details on using hashes here.

File details

Details for the file datasoap-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: datasoap-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.9.0 Linux/4.19.128-microsoft-standard

File hashes

Hashes for datasoap-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 06cbab1ea942f216f2bdc568b778dbf851386cad8e3724a367b091962bfb61b0
MD5 efe9459a8b7e13c8cb0d9fd70a90d9a5
BLAKE2b-256 3fe88bd415ddc41854b2d0e7442ca8d1223b8f829568f5dd56fc50da98a4d154

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page