Skip to main content

A Data science library for data science / data analysis teams

Project description

Dataramp

Code style: black Pylint Flake8 Scikit-learn

Welcome to the Dataramp documentation! Here you will find information about Dataramp, including some examples to get you started.

Dataramp

Dataramp is a Python library designed to streamline data science and data analysis workflows. It offers a collection of utility functions and tools tailored to assist data science teams in various aspects of their projects.

By providing a range of functionalities, Dataramp aims to enhance productivity and efficiency in data science projects, empowering teams to focus on deriving meaningful insights from their data.

Getting Started

Read the quick start guide here.

If you want to see some examples, you can look at the examples in the examples directory.

You can install Dataramp and learn more from PyPi.

Example

# Create and register a model pipeline
preprocessor = Pipeline([
    ('scaler', StandardScaler()),
    ('imputer', SimpleImputer())
])

pipeline = Pipeline([
    ('preprocess', preprocessor),
    ('classifier', LogisticRegression())
])

model_save(pipeline, "classifier", method="joblib", metadata={"dataset": "2023_sales"})
register_model(
    pipeline,
    name="sales_classifier",
    version="v1.0",
    metadata={
        "metrics": {"accuracy": 0.89},
        "serialization_method": "joblib"
    }
)

# Create versioned dataset
df = pd.read_csv("data.csv")
data_save(df, "processed_data", versioning=True, description="Initial cleaned version")

Potential Use Cases

  • Data Science Projects : Initialize projects with a standardized structure and manage datasets and models effectively.
  • Team Collaboration : Facilitate collaboration by providing clear project organization and versioning.
  • Reproducibility : Ensure reproducibility by tracking dataset versions, model metadata, and dependencies.
  • Automation : Integrate into CI/CD pipelines for automated testing, deployment, and dependency updates.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataramp-0.3.4.tar.gz (34.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dataramp-0.3.4-py2.py3-none-any.whl (35.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file dataramp-0.3.4.tar.gz.

File metadata

  • Download URL: dataramp-0.3.4.tar.gz
  • Upload date:
  • Size: 34.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for dataramp-0.3.4.tar.gz
Algorithm Hash digest
SHA256 f8086b0cec6fdfd12575c48f6613b3e1b9d0efd0599640e6c26a95e62a6b7e32
MD5 641b3d09df404e668123eb8fb4ba5d1f
BLAKE2b-256 1e006baf01bc2352d97ca9fa497222d66c6268a2212bf3789eac4f1161464f34

See more details on using hashes here.

File details

Details for the file dataramp-0.3.4-py2.py3-none-any.whl.

File metadata

  • Download URL: dataramp-0.3.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 35.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for dataramp-0.3.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 53bdbb8c4a85f6f6fcc7912ff18b357624a5a7b46998e17ee5b44d2ebf66f868
MD5 c8dc7fd107dd388dd4fcfa2af5c8f6f8
BLAKE2b-256 0877852ff99f013e40df51a9a74f457e5874fc08b6298db22fcac8b9019b9d99

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page