Skip to main content

Solution for DS Team

Project description

utilsds

Utilsds is a library that includes classes and functions used in data science projects such as:

  • ds_statistics:

    • test_kruskal_wallis: Perform the Kruskal-Wallis statistical test.
  • transform_data:

    • DataTransformer: Transform data using various methods.
  • data_processing:

    • encode_one_hot: Encode categorical features using one-hot encoding.
    • convert_numerical_to_categorized: Convert numerical features to categorized intervals.
    • scale_train_test: Scale training and testing datasets.
    • resample_X_y: Resample training data and target columns.
  • data_split:

    • train_test_validation_split: Split data into training, testing, and validation sets.
  • visualization:

    • MetricsPlot: Compare metrics for different parameter values.
    • Radar: Create radar plots for visualizing data.
    • cluster_characteristics: Analyze cluster characteristics.
    • comparison_density: Compare density distributions.
    • feature_distribution_box: Visualize feature distributions per cluster.
    • elbow_visualisation: Visualize the elbow method for clustering.
    • describe_clusters_metrics: Describe metrics for clusters.
    • category_null_variables: Visualize null variables in categorical data.
    • normal_distr_plots: Visualize normal distribution plots.
    • distplot_limitations: Visualize limitations of distplot.
    • boxplot_limitations: Visualize limitations of boxplot.
    • violinplot_limitations: Visualize limitations of violinplot.
    • countplot_limitations: Visualize limitations of countplot.
    • categorical_variable_perc: Visualize percentage of categorical variables.
    • spearman_correlation: Visualize spearman correlation.
    • CalculateCrammersV: Calculate Crammer's V.
  • data_ops:

    • BigQuery operations:
      • load_bq_data: Load data from tables, views, and SQL files
      • save_bq_view, save_bq_table: Save views and tables
      • load_bq_procedure: Execute stored procedures
      • load_bq_details: Get table/view details and schema
      • load_bq_describe_data: Get data description using ML.DESCRIBE_DATA
      • delete_bq_data: Delete data with safety confirmations
      • dry_run: Perform dry runs to estimate query costs
    • Cloud Storage operations:
      • save_gcs_bucket: Create buckets
      • save_gcs_file, load_gcs_file: Save and load files (.pkl, .json, .csv, .html, .sql)
    • Local file operations:
      • save_local_file, load_local_file: Save and load files (.pkl, .json, .csv, .html, .sql)
  • confusion_matrix:

    • ConfusionMatrix: Generate and plot confusion matrices.
  • modeling:

    • Modeling: Manage modeling, metrics, and logging with Vertex AI.
  • hyperopt:

    • Hyperopt: Optimize hyperparameters using Hyperopt.
  • classifier:

    • Classifier: Fit, train, and manage classification models.
  • experiments:

    • VertexExperiments: Manage experiments with Vertex AI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

utilsds-1.1.0.tar.gz (35.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

utilsds-1.1.0-py3-none-any.whl (40.3 kB view details)

Uploaded Python 3

File details

Details for the file utilsds-1.1.0.tar.gz.

File metadata

  • Download URL: utilsds-1.1.0.tar.gz
  • Upload date:
  • Size: 35.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.3

File hashes

Hashes for utilsds-1.1.0.tar.gz
Algorithm Hash digest
SHA256 b31cd453a9a0b5187b1ed6e563163d04dd31cd12c7650cd19f0600f8f4490f82
MD5 c39388075e377d87045b89ab8b0c0d9e
BLAKE2b-256 96e54a6e47e6af8ec1b211a9b1fa4b9c1c40c48c4a2b56f1553bf12f107b6efb

See more details on using hashes here.

File details

Details for the file utilsds-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: utilsds-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 40.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.3

File hashes

Hashes for utilsds-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ff5198baccf6fea4824c304b51a4966079b8b13b435b89a2b921c65d7317b886
MD5 04562749d790058e19d7775082a45daf
BLAKE2b-256 40ddcebee5360cc2754565128308f9b269752627fad528977f13dda287b4647a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page