An API to run SQL queries (SQLite) on pandas.Dataframe objects.
Project description
SQLdf
An API to run SQL (SQLite) queries on pandas.Dataframe objects.
How it works
- It create a virtual in-memory SQLite3 database at runtime
- It convert the pd.Dataframe input(s) to SQL table(s)
- It proceed the SQL query on the table(s)
- It convert back the SQL table(s) to updated pd.Dataframe (s)
## Installation
With pip
:
pip install sqldf -U
# Examples of use
# Import libraries
import pandas as pd
from sqldf import run
# Create a dummy pd.Dataframe
url = ('https://raw.github.com/pandas-dev/pandas/master/pandas/tests/data/tips.csv')
tips = pd.read_csv(url)
# Define a SQL (SQLite3) query
query = """
UPDATE tips
SET tip = tip*2
WHERE tip < 2;
"""
# Run the query
run(query)
# Import libraries
import pandas as pd
import numpy as np
from sqldf import run
# Create a dummy pd.Dataframe
df = pd.DataFrame({'col1': ['A', 'B', np.NaN, 'C', 'D'], 'col2': ['F', np.NaN, 'G', 'H', 'I']})
# Define a SQL (SQLite3) query
query = """
SELECT *
FROM df
WHERE col_1 ;
"""
# Run the query
df_view = run(query)
# Requirements
- 'pandas>=1.0'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sqldf-0.2.tar.gz
(3.8 kB
view hashes)
Built Distribution
sqldf-0.2-py3-none-any.whl
(4.0 kB
view hashes)