Skip to main content

Decrease DataFrame size with 2 lines of code.

Project description

pandazip

GitHub license Maintenance made-with-python

Go minimal, go green, go pandazip.

Cut memory footprint by forth in just 2 lines of code. Compress Pandas DataFrames without losing information or go smaller in expense of losing a bit of information. Swift parallel execution makes pandazip very fast.

"A 2018 blog post from OpenAI revealed that the amount of compute required for the largest AI training runs has increased by 300,000 times since 2012. And while that post didn’t calculate the carbon emissions of such training runs, others have done so. According to a paper by Emma Strubel and colleagues, an average American is responsible for about 36,000 tons of CO2 emissions per year; training and developing one machine translation model that uses a technique called neural architecture search was responsible for an estimated 626,000 tons of CO2."

"According to the American Council for an Energy-Efficient Economy it takes 5.12 kWh of electricity per gigabyte of transferred data. And according to the Department of Energy the average US power plant expends 600 grams of carbon dioxide for every kWh generated. That means that transferring 1GB of data produces 3kg of CO2."

plus storage and processing = Huge impact on environment without even noticing.

Take action now!

Install

pandazip can be installed from PyPI:

pip install pandazip

Compressing Pandas DataFrame using Pandazip

from pandazip import Pandazip
zipper = Pandazip()
x_train = zipper.fit_transform(x_train)
x_test = zipper.transform(x_test)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandazip-0.1.0.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

pandazip-0.1.0-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file pandazip-0.1.0.tar.gz.

File metadata

  • Download URL: pandazip-0.1.0.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.7.12

File hashes

Hashes for pandazip-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ae6603030d8d53853c59c9d37753ca1e8bd27fd8785083617c818a464c3114ef
MD5 b18117b1456d9d533eddcc9821e49749
BLAKE2b-256 92e1a4862a0433cda18960c964d656f53da0a4e2128965dec61c679e1a76f0ac

See more details on using hashes here.

File details

Details for the file pandazip-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: pandazip-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.7.12

File hashes

Hashes for pandazip-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d0b39fd068935ad4518774524b0628443e762beeb72f318037d29eda8ebc7500
MD5 63ca337740aeb3e86123b94ebc7d3d43
BLAKE2b-256 2a3fe71828ede751aa8ffe77ab9df833f13593acfc5872ac646cca219c337e0e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page