Skip to main content

A sample library of Data Transformation functions.

Project description

Data Transformation Library

About

This project is an exercise primarily focused on building Python libraries, publishing packages to PyPI and dependency management using Poetry.

The task is to implement three data transformation functions:

  • Transpose - flipping of the input matrix diagonally, meaning that the row and column indices of a matrix are switched and a new matrix is produced.
  • Time Series Windowing - producing lists or numpy arrays containing a subset of elements of the input list or numpy array. The size of the window is defined by passing size, stride and shift parameters. Time series windowing is used for making informed decisions in economics, climate, epidemiology and many other fields.
  • Cross-Correlation (Convolution) - used in convolutional neural networks in machine learning libraries. Cross-correlation and convolution are both operations applied to images. Cross-correlation means sliding a kernel (filter) across an image. Convolution means sliding a flipped kernel across an image. In practice using one over the other does not change the result, just the resulting values are learned in a flipped orientation. 2D Convolutions are essential in the advancement of convolutional neural networks and various image processing filters such as blurring, sharpening, edge detection, and others.

Functions were built using Python and Numpy (as per requirements of this task). Having the knowledge of how to build such functions by yourself provides the flexibility in ones specific uses cases, as some times the already built functions in various libraries (PyTorch, cv2, etc.) have constraints and changing them can be difficult or even impossible and impractical.

For package management we were required to use Poetry.

Libraries Used

For this project only Python's standard library as well as Numpy (were it was required) were used.

For package management Poetry was used.

How to Use

As this project is an exercise on building Python libraries, the built library can be installed from PyPI, were it is being published.

To do so in the terminal window type pip install dtl_functions_rd. After the library is installed, open the main.py file that is located in the main project directory to see a few examples on how to use the functions of this data transformation library.

Possible Future Improvements

  • Version handling

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dtl_functions_rd-0.1.1.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dtl_functions_rd-0.1.1-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file dtl_functions_rd-0.1.1.tar.gz.

File metadata

  • Download URL: dtl_functions_rd-0.1.1.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for dtl_functions_rd-0.1.1.tar.gz
Algorithm Hash digest
SHA256 532773a02adf1819d2fc446cf9884aac877bea07b3e76c7ba3e36f676457d775
MD5 cf5546495d4385057bd3becdfdd9b137
BLAKE2b-256 1582d9df7db9cbc28d393f8c4f4357c75fd76bc3ee0026a35b68d23c89738aec

See more details on using hashes here.

File details

Details for the file dtl_functions_rd-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for dtl_functions_rd-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5e33b91b932be3aae028c898819b893a09dd4bd94c96e0c052ad0233719e7f96
MD5 3b1e2a29fdc42a180e7d5e61f6fe59e4
BLAKE2b-256 125de2e3b6820e8f311e5650d02feaef67529705e32efe5ab72f28efdeae8375

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page