A Python package to benchmark query performance and comparison on PostgreSQL Database
Project description
Python package to benchmark query performance on a PostgreSQL database. It allows you to measure the execution time of queries over multiple runs, providing detailed metrics about each run's performance.
Installation
pip install pgbenchmark
Example
import psycopg2
from pgbenchmark import Benchmark
conn = psycopg2.connect(
"<< YOUR CONNECTION >>"
)
benchmark = Benchmark(db_connection=conn, number_of_runs=1000)
benchmark.set_sql("./test.sql")
for result in benchmark:
# {'run': X, 'sent_at': <DATETIME WITH MS>, 'duration': '0.000064'}
pass
""" View Summary """
print(benchmark.get_execution_results())
# {'runs': 1000, 'min_time': '0.00005', 'max_time': '0.000287', 'avg_time': '0.000072'}
You can also pass raw SQL as a String, instead of file
benchmark.set_sql("SELECT 1;")
It also supports SQLAlchemy connection engine
engine = create_engine("postgresql+psycopg2://.......")
conn = engine.connect()
# Set up benchmark class
benchmark = Benchmark(db_connection=conn, number_of_runs=5)
Example with Parallel or Threaded execution
⚠️ Please be careful. If you are running on Linux, pgbenchmark will load your cores on 100% !!!⚠️
from pgbenchmark import ParallelBenchmark # <<-------- NEW IMPORT
conn_params = {
"dbname": "postgres",
"user": "postgres",
"password": "",
"host": "localhost",
"port": "5432"
}
n_procs = 20 # Number of Processes (Cores basically)
n_runs_per_proc = 1_000
parallel_bench_pg = ParallelBenchmark(
num_processes=n_procs,
number_of_runs=n_runs_per_proc,
db_connection_info=conn_params
)
parallel_bench_pg.set_sql("SELECT * from information_schema.tables;") # Same as before
""" Unfortunately, as of now, you can't get execution results on the fly. """
parallel_bench_pg.run() # RUN THE BENCHMARK
results_pg = parallel_bench_pg.get_execution_results()
print(results_pg)
Example with Template Engine
From version 0.1.0 pgbenchmark supports simple Template Engine for queries.
import random
import string
from pgbenchmark import ParallelBenchmark
conn_params = {
"dbname": "postgres",
"user": "postgres",
"password": "asdASD123",
"host": "localhost",
"port": "5432"
}
n_procs = 20
n_runs_per_proc = 10
# Generator Function for Random Product Price
def generate_random_price():
return round(random.randint(10, 1000), 2)
# Generator Function for Random Product Name (String)
def generate_random_string(length=10):
characters = string.ascii_letters + string.digits
return ''.join(random.choice(characters) for _ in range(length))
parallel_bench_pg = ParallelBenchmark(
num_processes=n_procs,
number_of_runs=n_runs_per_proc,
db_connection_info=conn_params
)
# Define the SQL Query Template
query = """
INSERT INTO products (name, price, stock_quantity) VALUES ('{{product_name}}', {{price_value}}, 10);
"""
# ===============================
# Note that similar to Jinja2, you have to define template variables within Query
# {{product_name}}
# {{price_value}}
# ===============================
parallel_bench_pg.set_sql(query)
# Set formatters
parallel_bench_pg.set_sql_formatter(for_placeholder="price_value", generator=generate_random_price)
parallel_bench_pg.set_sql_formatter(for_placeholder="product_name", generator=generate_random_string)
# Run Benchmark
if __name__ == '__main__':
# Run the Parallel Benchmark
parallel_bench_pg.run()
results_pg = parallel_bench_pg.get_execution_results()
throughput = results_pg["throughput_runs_per_sec"]
avg_time = results_pg["avg_time"]
print("\n=============================================================================")
print(" Benchmark Results ")
print("=============================================================================")
print(f"Throughput (runs/sec): {throughput}")
print(f"Average Execution Time (sec): {avg_time}")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pgbenchmark-0.1.0.tar.gz
(19.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pgbenchmark-0.1.0.tar.gz.
File metadata
- Download URL: pgbenchmark-0.1.0.tar.gz
- Upload date:
- Size: 19.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
43afe35c31fc94a906dbe8eadb858117a0b35d31ef1a8c752079d80bb7dd9fa4
|
|
| MD5 |
1c55755585ae942ea95aad51690e0073
|
|
| BLAKE2b-256 |
5aaf8ac88e31738abe76ee58a89725a7c7a2ae77d185d60817d8befcfe2458d2
|
File details
Details for the file pgbenchmark-0.1.0-py3-none-any.whl.
File metadata
- Download URL: pgbenchmark-0.1.0-py3-none-any.whl
- Upload date:
- Size: 21.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c28a7e64c7cd073ace6b4300d128968dbab7f655b0694e51e3617bb62bbfb1a4
|
|
| MD5 |
6b35821bab90374804aba67764ff131c
|
|
| BLAKE2b-256 |
781d25a76dccb2292cdbe59b44123bc87cc4897bdb57232551065f33439e0501
|