Skip to main content

Supplementary library for pandas that processes dataframes derived from CSV files.

Project description

datasoap

What is it?

datasoap is a supplementary library for pandas that processes dataframes derived from CSV files. The module checks cell data for correct numerical formatting and converts mismatched data to the correct data type (ex. str > float64).

Main Features

  • Strips unnecessary characters from numerical data fields in pandas dataframes to ensure consistent data formatting
  • Provides before and after representations of dataframes to allow for comparison

Repository

Source code is hosted on: github.com/snake-fingers/data-soap

Dependencies

pandas - Python package that provides fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive.

Installation

poetry add datasoap

Documentation

Documentation to come.

Background

datasoap originated from a Code Fellows 401 Python midterm project. The project team includes Alex Angelico, Grace Choi, Robert Carter, Mason Fryberger, and Jae Choi. After working with a few painful datasets using, we wanted to create a library that allows users to more efficiently manipulate clean datasets extracted from CSVs that may have inconsistent formatting.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasoap-1.0.3.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

datasoap-1.0.3-py3-none-any.whl (5.7 kB view details)

Uploaded Python 3

File details

Details for the file datasoap-1.0.3.tar.gz.

File metadata

  • Download URL: datasoap-1.0.3.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.9.0 Linux/4.19.128-microsoft-standard

File hashes

Hashes for datasoap-1.0.3.tar.gz
Algorithm Hash digest
SHA256 e6bcc9f87c3cd29228f4b8f19a2faa502c45f52d968f5855f4eee9d81b2a4f6c
MD5 602643a20c37f51389dce2618ddd19b5
BLAKE2b-256 089341f928c5130943f94c9716f9fa4c085d47f33942cfdc4d75b3c644f773a4

See more details on using hashes here.

File details

Details for the file datasoap-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: datasoap-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 5.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.9.0 Linux/4.19.128-microsoft-standard

File hashes

Hashes for datasoap-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 5a237741f1cccba26467b6a66ff9aed2f5e888edc1f1c1aed518679fff79c40e
MD5 9b0990c3958315847d22029c7c79cbad
BLAKE2b-256 24430f8838a4fbe8ed3bd011f8fb048e2464b5a4b61db01e25820394e489e420

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page