A modular and open-source AI toolkit for time-series processing, aimed at fault detection and classification in oil well operation
Project description
3W Toolkit
- About
- Architecture Overview
- 3W Dataset
- Installation & Setup
- Jupyter Notebooks & Examples
- Contributing
📘 About
The evolution of machine learning has been catalyzed by the rapid advancement in data acquisition systems, scalable storage, high-performance processing, and increasingly efficient model training through matrix-centric hardware (e.g., GPUs). These advances have enabled the deployment of highly parameterized AI models in real-world applications such as health care, finance, and industrial operations.
In the oil & gas sector, the widespread availability of low-cost sensors has driven a paradigm shift from reactive maintenance to condition-based monitoring (CBM), where faults are detected and classified during ongoing operation. This approach minimizes downtime and improves operational safety. The synergy between AI and big data analysis has thus enabled the development of generalizable classifiers that require minimal domain knowledge and can be effectively adapted to a wide range of operational scenarios.
In this context, we present 3WToolkit+, a modular and open-source AI toolkit for time-series processing, aimed at fault detection and classification in oil well operation. Building upon the experience with the original 3WToolkit system and leveraging the Petrobras 3W Dataset, 3WToolkit introduces enhanced functionalities, such as advanced data imputation, deep feature extraction, synthetic data augmentation, and high-performance computing capabilities for model training.
The development of the 3WToolkit+ is the result of a collaborative partnership between Petrobras, with a focus on the CENPES research center, and the COPPE/Universidade Federal do Rio de Janeiro (UFRJ). This joint effort brings together complementary strengths: COPPE/UFRJ contributes decades of proven expertise in signal processing and machine learning model development, while CENPES offers access to highly specialized technical knowledge and real-world operational challenges in the oil and gas sector. This synergy ensures that 3WToolkit+ is both scientifically rigorous and practically relevant, addressing complex scenarios with robust and scalable AI-based solutions for time-series analysis and fault detection in oil well operations.
🧩 Architecture Overview
The following image illustrates the high-level architecture of the 3W Toolkit, designed to support the full pipeline of machine learning applications using the 3W dataset. From raw data ingestion to model evaluation and delivery to end users.
The schema below illustrates the main classes of the toolkit.
To better understand how the system is organized, we can divide all classes into two main abstraction layers: Application and Core.
-
The
Corelayer contains all configuration (Config) classes. These classes define the parameters required by each functional module. In other words, they describe how each component should behave, specifying its inputs and parameters. -
The
Applicationlayer contains the operational classes, which implement the actual logic for interacting with datasets, preprocessing tools, feature extraction, models, evaluation, visualization, and pipelines. These are the classes that you will uses in workflows.
📊 3W Dataset
The 3W dataset serves as a reference dataset for this project and is hosted on Figshare. However, the toolkit is not limited to the 3W dataset and can be adapted for other datasets as well.
Further details on the 3W dataset’s structure, preprocessing, and usage are available in the 3W_DATASET_STRUCTURE.md file.
⚙️ Installation & Setup
It is possible to perform the installation in different ways.
1. Fork or clone the repository
Option A: Fork
- Go to https://github.com/petrobras/3W
- Click “Fork” in the top-right corner to create a copy under your GitHub account.
- Clone your forked repository:
git clone git@github.com:<your-username>/3W.git
Option B: Clone
git clone git@github.com:petrobras/3W.git
2. Install the package
Option A: Using pip
pip install -e .
Option B: Using uv
uv venv .venv
source venv/bin/activate
uv pip install -e .
Option C: Using conda
conda env create -f environment.yml
conda activate 3W
🪐 Jupyter Notebooks & Examples
A curated set of ready-to-use jupyter notebooks that demonstrate how to use the toolkit to common fault detection tasks using the 3W dataset. These examples accelerate onboarding and reproducibility.
The set of notebooks can be found in the notebooks folder
🤝 Contributing
We welcome contributions to help us improve and expand the functionality of the 3W toolkit. To ensure a smooth collaboration process, please follow our contrubuting guidelines here.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file threewtoolkit-1.0.0.tar.gz.
File metadata
- Download URL: threewtoolkit-1.0.0.tar.gz
- Upload date:
- Size: 133.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a55ff4ac0d480d1ec7d19ca2e70948e4d4d1444a7841ef0ada2f3aeda7705b75
|
|
| MD5 |
3b70749d088add2ccc79aa7fe575b694
|
|
| BLAKE2b-256 |
5dadf8af007276d45d93d1833aed07dfe0e15788ac74f8d317fa037e1b70ae76
|
File details
Details for the file threewtoolkit-1.0.0-py3-none-any.whl.
File metadata
- Download URL: threewtoolkit-1.0.0-py3-none-any.whl
- Upload date:
- Size: 111.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3530b81185db51c5ece2eb59ec511434ace86016aacb622ec64a394d593c33c6
|
|
| MD5 |
9e31a4d15615f8556b78bcf9409e2c65
|
|
| BLAKE2b-256 |
7579330830a5d4edc0e89337e6ff407ca3a1a9ed7ce08c9e9a2ac2a9456641a1
|