Skip to main content

Solution for DS Team

Project description

utilsds

Utilsds is a library that includes classes and functions used in data science projects such as:

  • ds_statistics:

    • test_kruskal_wallis: Perform the Kruskal-Wallis statistical test.
  • transform_data:

    • DataTransformer: Transform data using various methods.
  • data_processing:

    • encode_one_hot: Encode categorical features using one-hot encoding.
    • convert_numerical_to_categorized: Convert numerical features to categorized intervals.
    • scale_train_test: Scale training and testing datasets.
    • resample_X_y: Resample training data and target columns.
  • data_split:

    • train_test_validation_split: Split data into training, testing, and validation sets.
  • visualization:

    • MetricsPlot: Compare metrics for different parameter values.
    • Radar: Create radar plots for visualizing data.
    • cluster_characteristics: Analyze cluster characteristics.
    • comparison_density: Compare density distributions.
    • feature_distribution_box: Visualize feature distributions per cluster.
    • elbow_visualisation: Visualize the elbow method for clustering.
    • describe_clusters_metrics: Describe metrics for clusters.
    • category_null_variables: Visualize null variables in categorical data.
    • normal_distr_plots: Visualize normal distribution plots.
    • distplot_limitations: Visualize limitations of distplot.
    • boxplot_limitations: Visualize limitations of boxplot.
    • violinplot_limitations: Visualize limitations of violinplot.
    • countplot_limitations: Visualize limitations of countplot.
    • categorical_variable_perc: Visualize percentage of categorical variables.
    • spearman_correlation: Visualize spearman correlation.
    • CalculateCrammersV: Calculate Crammer's V.
  • data_ops:

    • BigQuery operations:
      • load_bq_data: Load data from tables, views, and SQL files
      • save_bq_view, save_bq_table: Save views and tables
      • load_bq_procedure: Execute stored procedures
      • load_bq_details: Get table/view details and schema
      • load_bq_describe_data: Get data description using ML.DESCRIBE_DATA
      • delete_bq_data: Delete data with safety confirmations
      • dry_run: Perform dry runs to estimate query costs
    • Cloud Storage operations:
      • save_gcs_bucket: Create buckets
      • save_gcs_file, load_gcs_file: Save and load files (.pkl, .json, .csv, .html, .sql)
    • Local file operations:
      • save_local_file, load_local_file: Save and load files (.pkl, .json, .csv, .html, .sql)
  • confusion_matrix:

    • ConfusionMatrix: Generate and plot confusion matrices.
  • modeling:

    • Modeling: Manage modeling, metrics, and logging with Vertex AI.
  • hyperopt:

    • Hyperopt: Optimize hyperparameters using Hyperopt.
  • classifier:

    • Classifier: Fit, train, and manage classification models.
  • experiments:

    • VertexExperiments: Manage experiments with Vertex AI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

utilsds-1.0.14.tar.gz (32.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

utilsds-1.0.14-py3-none-any.whl (35.4 kB view details)

Uploaded Python 3

File details

Details for the file utilsds-1.0.14.tar.gz.

File metadata

  • Download URL: utilsds-1.0.14.tar.gz
  • Upload date:
  • Size: 32.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.1

File hashes

Hashes for utilsds-1.0.14.tar.gz
Algorithm Hash digest
SHA256 1f72e196105076fd0da8da18c92f72cd1b1c335d1489034871cb8cc94e06b59e
MD5 edd928b426bb3fc153d90e485cb5b5de
BLAKE2b-256 4476cf95fb2eaefd11b6ef1e87774d6198b9e95d9c9b1955371ffca46c5ea3a5

See more details on using hashes here.

File details

Details for the file utilsds-1.0.14-py3-none-any.whl.

File metadata

  • Download URL: utilsds-1.0.14-py3-none-any.whl
  • Upload date:
  • Size: 35.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.1

File hashes

Hashes for utilsds-1.0.14-py3-none-any.whl
Algorithm Hash digest
SHA256 4098ba2cf2b9f61cefd32a96f86043edc201458416c4811fd1bfb3bcc7d1717f
MD5 4130049eb0434272940f01c210fae855
BLAKE2b-256 b362deefe0db7ac184882ffa245fdcf2b4d50f9db6548526a79f45e9d6f47e20

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page