Skip to main content

An improved way to upload pandas dataframes to Microsoft SQL Server.

Project description

fast_to_sql

Test and lint pypi Python version PyPI - Downloads PyPI - License

Introduction

fast_to_sql is an improved way to upload pandas dataframes to Microsoft SQL Server.

fast_to_sql takes advantage of pyodbc rather than SQLAlchemy. This allows for a much lighter weight import for writing pandas dataframes to sql server. It uses pyodbc's executemany method with fast_executemany set to True, resulting in far superior run times when inserting data.

Installation

pip install fast_to_sql

Requirements

  • Written for Python 3.8+
  • Requires pandas, pyodbc

Example

from datetime import datetime

import pandas as pd

import pyodbc
from fast_to_sql import fast_to_sql

# Test Dataframe for insertion
df = pd.DataFrame({
    "Col1": [1, 2, 3],
    "Col2": ["A", "B", "C"],
    "Col3": [True, False, True],
    "Col4": [datetime(2020,1,1),datetime(2020,1,2),datetime(2020,1,3)]
})

# Create a pyodbc connection
conn = pyodbc.connect(
    """
    Driver={ODBC Driver 17 for SQL Server};
    Server=localhost;
    Database=my_database;
    UID=my_user;
    PWD=my_pass;
    """
)

# If a table is created, the generated sql is returned
create_statement = fast_to_sql(
  df, "my_great_table", conn, if_exists="replace", custom={"Col1":"INT PRIMARY KEY"}
)

# Commit upload actions and close connection
conn.commit()
conn.close()

Usage

Main function

fast_to_sql(
  df, 
  name, 
  conn, 
  if_exists="append", 
  custom=None, 
  temp=False, 
  copy=False,
  clean_cols=True
)
  • df: pandas DataFrame to upload
  • name: String of desired name for the table in SQL server
  • conn: A valid pyodbc connection object
  • if_exists: Option for what to do if the specified table name already exists in the database. If the table does not exist a new one will be created. By default this option is set to 'append'
    • 'append': Appends the dataframe to the table if it already exists in SQL server.
    • 'fail': Purposely raises a FailError if the table already exists in SQL server.
    • 'replace': Drops the old table with the specified name, and creates a new one. Be careful with this option, it will completely delete a table with the specified name in SQL server.
  • custom: A dictionary object with one or more of the column names being uploaded as the key, and a valid SQL column definition as the value. The value must contain a type (INT, FLOAT, VARCHAR(500), etc.), and can optionally also include constraints (NOT NULL, PRIMARY KEY, etc.)
    • Examples: {'ColumnName':'varchar(1000)'} {'ColumnName2':'int primary key'}
  • temp: Either True if creating a local sql server temporary table for the connection, or False (default) if not.
  • copy: Defaults to False. If set to True, a copy of the dataframe will be made so column names of the original dataframe are not altered. Use this if you plan to continue to use the dataframe in your script after running fast_to_sql.
  • clean_cols: Defaults to True. If set to False, column names will not be cleaned when creating the table to insert the DataFrame into. If this is set to False, it is up to the caller of the function to make sure the names of the columns in the DataFrame are compatible with SQL Server.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast-to-sql-2.3.0.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

fast_to_sql-2.3.0-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file fast-to-sql-2.3.0.tar.gz.

File metadata

  • Download URL: fast-to-sql-2.3.0.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for fast-to-sql-2.3.0.tar.gz
Algorithm Hash digest
SHA256 854cebe03fc20e1b51e700118e9c88189435cd3f088a81bf370bc4ae89f4a210
MD5 12106b24a05aae9c886840a2e8ffd404
BLAKE2b-256 e675ec246bce62becef75b69e421ee58fa6b9c9c75028bb6bfe3d7b9ccc6ac33

See more details on using hashes here.

File details

Details for the file fast_to_sql-2.3.0-py3-none-any.whl.

File metadata

  • Download URL: fast_to_sql-2.3.0-py3-none-any.whl
  • Upload date:
  • Size: 7.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for fast_to_sql-2.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 93d346ff2379960e0dcd300e2ae6f938e7153b976e3befb227bc172dfe790138
MD5 37dc3956056e90d6e25b07cc6f54c118
BLAKE2b-256 371a0e9a74f848fb3fbf2c840f69eeb22c281d97a398cd4a10af3adfadc9a6d4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page