Pandas dataframes with object oriented programming style

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Pandas-Oop

(Also known as Poop), is a package that uses Pandas dataframes with object oriented programming style

Installation:

  pip install pandas-oop

Some examples

from pandas_oop import models
from pandas_oop.fields import StringColumn, IntegerColumn, FloatColumn, DateColumn, BoolColumn

DB_CONNECTION = models.Connection('sqlite:///pandas_oop.db') # this is the same con_string for sqlalchemy engine

@models.sql(table='people', con=DB_CONNECTION) # Use this decorator if you want to connect your class to a database
@models.Data
class People(models.DataFrame):
    name = StringColumn(unique=True)
    age = IntegerColumn()
    money = FloatColumn(target_name="coins") # target_name if the name in the csv or table is coins and you want to have a different variable name
    insertion_date = DateColumn(format='%d-%m-%Y')
    is_staff = BoolColumn(true='yes', false='no')

Now when instantiating this class, it will return a custom dataframe with all the functionalities of a Pandas dataframe and some others

people = People()
or
people = People(from_csv=DATA_FILE, delimiter=";")
or
people = People(from_sql_query='select * from people')
or
people = People(from_df=some_dataframe)
or
people = People(from_iterator=some_function_that_yield_values)

example of function that yield values:

def some_function_that_yield_values():
    while something:
        ...
        yield name, age, money, insertion_date, is_staff

You can also save it to the database with the save() method (if the dtypes of the columns change, this will raise a ValidationError):

people.save()

You can upsert to the database and this will automatically look at the unique fields that were declared in the class

people.save(if_row_exists='update')
or
people.save(if_row_exists='ignore')

If you want to revalidate your dataframe (convert the columns dtypes to the type that was declared in the class), you can call the validate() method:

people.validate()

You can also validate from another class. For example, you can do something like this:

people = People(from_csv=DATA_FILE)
jobs = Jobs(from_sql_query='select * from jobs')
people_with_jobs = people.merge(jobs, on='name').validate(from_class=PeopleWithJobs)

This is the list of the overriten methods that return a pandas_oop custom dataframe

'isnull'
'head'
'abs'
'merge'
'loc' and dataframe slicing

I will add more and more methods on this list.

New features

Alembic Database migration support added:

On your main application package, import Base (this is a declarative_base from sqlalchemy)

from pandas_oop import Base

Add this configuration on the env.py file of your alembic config

from your_app import Base
target_metadata = Base.metadata

And finaly, update your database url on your alembic.ini file

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.9.6

May 1, 2022

This version

0.9.5

Apr 17, 2022

0.9.4

Apr 17, 2022

0.9.3

Apr 17, 2022

0.9.2

Apr 9, 2022

0.9.1

Apr 9, 2022

0.9.0

Apr 9, 2022

0.0.4

Apr 3, 2022

0.0.3

Mar 31, 2022

0.0.2

Mar 31, 2022

0.0.1

Mar 31, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-oop-0.9.5.tar.gz (8.8 kB view hashes)

Uploaded Apr 17, 2022 Source

Built Distribution

pandas_oop-0.9.5-py3-none-any.whl (8.4 kB view hashes)

Uploaded Apr 17, 2022 Python 3

Hashes for pandas-oop-0.9.5.tar.gz

Hashes for pandas-oop-0.9.5.tar.gz
Algorithm	Hash digest
SHA256	`aa9fc2b06aa688a4453fac406bf49fcc31ffaf380cf382482331668dd321048f`
MD5	`e6cefedda961b9404c5ac736156429e9`
BLAKE2b-256	`24ce4bfbc8a0549bd3b955ef06329dac4dae2a1083f5bc03702d473589e77d67`

Hashes for pandas_oop-0.9.5-py3-none-any.whl

Hashes for pandas_oop-0.9.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c8f32bf51d5c846d0ebaa26940a7b4e085ae444f06817017be716e46a0e10ba0`
MD5	`569412bf4daa239cc47c98358e0308c3`
BLAKE2b-256	`39ac0ba4f6a65a30cacfb085e2c8ad20c377dac9df2233a7481702e9168f836d`