A Python package for loading petroleum datasets

These details have not been verified by PyPI

Project links

Project description

per-datasets

A Python package for loading reservoir datasets from API endpoints.

Installation

pip install per-datasets

Quick Start

Option 1: Using Global API Key (Recommended)

First, set your API key globally:

# Set API key globally (works across all projects)
per-datasets set-key "your_api_key_here"

# Or use interactive setup
per-datasets interactive

Then use in your Python code:

import per_datasets as pds

# Initialize without API key (uses global key)
pds.initialize()

# Load a random reservoir dataset
df_random = pds.reservoir.load_random()
print(f"Loaded dataset with shape: {df_random.shape}")

Option 2: Using API Key in Code

import per_datasets as pds

# Initialize with your API key
pds.initialize('your_api_key_here')

# Load a random reservoir dataset
df_random = pds.reservoir.load_random()
print(f"Loaded dataset with shape: {df_random.shape}")

Workflows

The package includes Dockerized workflows for common operations:

Available Workflows

Add Workflow - Adds two numbers together
Subtract Workflow - Subtracts one number from another
PINN Workflow - Trains a Physics-Informed Neural Network (Transformer-based)

Running Workflows in Python

You can run workflows directly in Python:

from per_datasets.workflows import add, subtract, pinn

# Run simple workflows
print(add(5, 3))       # 8
print(subtract(10, 4)) # 6

# Run PINN training workflow
results = pinn(epochs=50)
print(f"Final Loss: {results['final_loss']}")

# Visualize the loss history dynamically
from per_datasets import visual # Or use pds.visual if imported as pds
visual.line_plot(results, y='loss_history', title="PINN Training Loss")

Building Workflow Containers

# Build all workflow Docker images
./build_workflows.sh

# Or build individually
docker build -t perd-add-workflow -f per_datasets/workflows/add/Dockerfile .
docker build -t perd-subtract-workflow -f per_datasets/workflows/substract/Dockerfile .

Running Workflows

# Run add workflow
docker run --rm perd-add-workflow 5.2 3.8

# Run subtract workflow
docker run --rm perd-subtract-workflow 10.5 4.3

See per_datasets/workflows/README.md for more details.

Command Line Interface

The package includes a CLI for managing API keys globally:

# Set API key globally
per-datasets set-key "your_api_key_here"

# Check configuration status
per-datasets status

# Get stored API key (masked)
per-datasets get-key

# Remove API key
per-datasets remove-key

# Interactive setup
per-datasets interactive

# Clear all configuration
per-datasets clear

# Show help
per-datasets --help

Complete Usage Examples

import per_datasets as pds

# Initialize (uses global key if available)
pds.initialize()

# Load a random reservoir dataset
df_random = pds.reservoir.load_random()
print(f"Loaded dataset with shape: {df_random.shape}")

# Load a specific dataset by ID
df_specific = pds.reservoir.load('your_dataset_id')

# Get information about available datasets
info = pds.get_dataset_info()

API Reference

`initialize(api_key=None)`

Initialize the per_datasets module with API credentials.

Parameters:

api_key (str, optional): The API key for authentication. If not provided, uses globally stored key.

Note: If no API key is provided and none is stored globally, raises a ValueError with instructions to set a global key.

`load_random()`

Loads a random reservoir model from the API endpoint and returns as pandas DataFrame.

Returns:

pandas.DataFrame: A DataFrame containing the dataset

Configuration Management

The package stores configuration in ~/.per_datasets/config.json by default:

{
  "api_key": "your_api_key_here"
}

Benefits of Global Configuration:

✅ No API key in code: Keep sensitive keys out of your source code
✅ Cross-project: Use the same API key across multiple projects
✅ Secure: API keys are stored in user's home directory
✅ Override: Can still provide API key in code to override global setting
✅ Easy management: Use CLI commands to manage keys

Security Notes:

API keys are stored in plain text in your home directory
Only you can access the configuration file
Consider using environment variables for production deployments

Dependencies

requests>=2.25.1
pandas>=1.3.0

License

MIT

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

Development

To set up the development environment:

git clone https://github.com/P-E-R-D/library-py.git
cd per-datasets
pip install -e .

Building and Publishing

Automatic Deployment (Recommended)

This package uses GitHub Actions for automatic deployment to PyPI:

Make your changes to the code
Update version numbers in per_datasets/__init__.py and pyproject.toml
Create a git tag with the new version:
```
git tag v0.2.0
git push origin v0.2.0
```
GitHub Actions automatically builds and uploads to PyPI!

See DEPLOYMENT.md for detailed setup instructions.

Manual Publishing

python -m build
twine upload dist/*

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.7a2 pre-release

Mar 7, 2026

0.0.7a1 pre-release

Mar 7, 2026

0.0.6

Mar 6, 2026

0.0.6a4 pre-release

Mar 6, 2026

0.0.6a3 pre-release

Mar 4, 2026

0.0.6a2 pre-release

Mar 4, 2026

0.0.6a1 pre-release

Mar 4, 2026

0.0.5

Mar 3, 2026

0.0.5a1 pre-release

Feb 13, 2026

0.0.4a2 pre-release

Feb 13, 2026

0.0.4a1 pre-release

Feb 13, 2026

This version

0.0.4a0 pre-release

Feb 13, 2026

0.0.3b1 pre-release

Feb 12, 2026

0.0.3b0 pre-release

Feb 12, 2026

0.0.3a0 pre-release

Jan 15, 2026

0.0.2a0 pre-release

Nov 12, 2025

0.0.1a0 pre-release

Nov 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

per_datasets-0.0.4a0.tar.gz (21.5 kB view details)

Uploaded Feb 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

per_datasets-0.0.4a0-py3-none-any.whl (24.1 kB view details)

Uploaded Feb 13, 2026 Python 3

File details

Details for the file per_datasets-0.0.4a0.tar.gz.

File metadata

Download URL: per_datasets-0.0.4a0.tar.gz
Upload date: Feb 13, 2026
Size: 21.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for per_datasets-0.0.4a0.tar.gz
Algorithm	Hash digest
SHA256	`8a07aa4181cd93585995b623c9af5aa1cc6b0f4ef74e193d0105b98a2fe525ef`
MD5	`9295adf4c7e04294ffa72f2b5a07f291`
BLAKE2b-256	`3f6c82aea54d03e7cb819fc82a25fa5e6d668b1886080562701150e3a6a84421`

See more details on using hashes here.

File details

Details for the file per_datasets-0.0.4a0-py3-none-any.whl.

File metadata

Download URL: per_datasets-0.0.4a0-py3-none-any.whl
Upload date: Feb 13, 2026
Size: 24.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for per_datasets-0.0.4a0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3009c0223d4f70859bc92a9ba7994c350bc649889dfb5340ca863b9978626259`
MD5	`7aa2bbee98c1a3e449a608c5b101c8ba`
BLAKE2b-256	`2170ca5ec4b6e22d25d42ff9111700aca1c0fe43674392a66b97c98116c4915b`

See more details on using hashes here.

per-datasets 0.0.4a0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

per-datasets

Installation

Quick Start

Option 1: Using Global API Key (Recommended)

Option 2: Using API Key in Code

Workflows

Available Workflows

Running Workflows in Python

Building Workflow Containers

Running Workflows

Command Line Interface

Complete Usage Examples

API Reference

initialize(api_key=None)

load_random()

Configuration Management

Benefits of Global Configuration:

Security Notes:

Dependencies

License

Contributing

Development

Building and Publishing

Automatic Deployment (Recommended)

Manual Publishing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`initialize(api_key=None)`

`load_random()`