Skip to main content

Module that contains different packages to perform data-related operations through Sinapsis templates.

Project description



Sinapsis Data Tools

Mono repo with packages to read, write, process data, including images, audios, videos, bytes objects. The packages can be easily extensible to handle other types of data.

🐍 Installation📦 Packages📚 Usage example📙 Documentation🔍 License

🐍 Installation

This mono repo consists of different packages to handle data:

  • sinapsis-data-analysis
  • sinapsis-data-readers
  • sinapsis-data-visualization
  • sinapsis-data-writers
  • sinapsis-generic-data-tools

Install using your package manager of choice. We encourage the use of uv

Example with uv:

  uv pip install sinapsis-data-readers --extra-index-url https://pypi.sinapsis.tech

or with raw pip:

  pip install sinapsis-data-readers --extra-index-url https://pypi.sinapsis.tech

Change the name of the package for the one you want to install.

[!IMPORTANT] Templates in each package may require extra dependencies. For development, we recommend installing the package with all the optional dependencies:

with uv:

  uv pip install sinapsis-data-readers[all] --extra-index-url https://pypi.sinapsis.tech

or with raw pip:

  pip install sinapsis-data-readers[all] --extra-index-url https://pypi.sinapsis.tech

Change the name of the package accordingly.

[!TIP] You can also install all the packages within this project:

  uv pip install sinapsis-data-tools[all] --extra-index-url https://pypi.sinapsis.tech

[!NOTE] Some templates also need system dependencies (e.g., ffmpeg). The installation depends on your OS. For Linux:

apt-get install -y ffmpeg

📦 Packages

Packages summary
  • Sinapsis Data Readers

    • Audio Readers
      Read audio files from several formats using Pydub, Soundfile, among others.
    • Dataset Readers
      Read and manipulate tabular datasets from the scikit libraries, among others.
    • Image Readers
      Read and manipulate images from COCO, paths in CSVs, whole folders, etc.
    • Text Readers
      Read text data from a simple string and other sources.
    • Video Readers
      Read videoframes using CV2, Dali, FFMPEG, Torch, among others.
  • Sinapsis Data Visualization
    Visualize data distributions and manifolds, as well as draw all kinds of annotations on images, such as bounding boxes, keypoints, labels, oriented bounding boxes, segmentation masks, etc.

  • Sinapsis Data Writers
    Write data to many kinds of files.

    • Annotation Writers
      Save text annotations to JSON, geometries to polygons, etc.
    • Audio Writers
      Save to audio files using Soundfile, among others.
    • Image Writers Save to image files using CV2, among others.
    • Video Writers
      Save to video files using CV2 or FFMPEG, among others.
  • Sinapsis Generic Data Tools
    Wide range of miscellaneous tools to manipulate your data.

[!TIP] Use CLI command sinapsis info --all-template-names to show a list with all the available Template names installed with Sinapsis Data Tools.

[!TIP] Use CLI command sinapsis info --example-template-config TEMPLATE_NAME to produce an example Agent config for the Template specified in TEMPLATE_NAME.

[!TIP] Run the docker image docker run -it --gpus all sinapsis-data-tools:base bash You need to activate the environment inside the image source .venv/bin/activate

For example, for ImageSaver use sinapsis info --example-template-config ImageSaver to produce the following example config:

agent:
  name: my_test_agent
  description: agent to save image locally
templates:
- template_name: InputTemplate
  class_name: InputTemplate
  attributes: {}
- template_name: ImageSaver
  class_name: ImageSaver
  template_input: InputTemplate
  attributes:
    save_dir: /path/to/save/dir
    extension: jpg
    root_dir: '/path/to/sinapsis/cache'
    save_full_image: true
    save_bbox_crops: false
    save_mask_crops: false
    min_bbox_dim: 5

📚 Usage example

Example agent config You can copy and paste the following config and run it using the sinapsis cli, changing the data_dir attribute in the FolderImageDatasetCV2 and the root_dir attribute in the ImageSaver template
agent:
  name: my_test_agent
  description: agent to save image locally
templates:
- template_name: InputTemplate
  class_name: InputTemplate
  attributes: {}
- template_name: FolderImageDatasetCV2
  class_name: FolderImageDatasetCV2
  attributes:
    data_dir: /path/to/image
    pattern: '**/*'
    batch_size: 1
    load_on_init: true
    label_path_index: 0
    is_ground_truth: false

- template_name: ImageSaver
  class_name: ImageSaver
  template_input: FolderImageDatasetCV2
  attributes:
    save_dir: /path/to/save/dir
    extension: jpg
    root_dir: '/path/to/sinapsis/cache'
    save_full_image: true
    save_bbox_crops: false
    save_mask_crops: false
    min_bbox_dim: 5

To run, simply use:

sinapsis run name_of_the_config.yml

📙 Documentation

Documentation for this and other sinapsis packages is available on the sinapsis website

Tutorials for different projects within sinapsis are available at sinapsis tutorials page

🔍 License

This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.

For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sinapsis_data_tools-0.2.6.tar.gz (80.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sinapsis_data_tools-0.2.6-py3-none-any.whl (120.0 kB view details)

Uploaded Python 3

File details

Details for the file sinapsis_data_tools-0.2.6.tar.gz.

File metadata

  • Download URL: sinapsis_data_tools-0.2.6.tar.gz
  • Upload date:
  • Size: 80.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.16

File hashes

Hashes for sinapsis_data_tools-0.2.6.tar.gz
Algorithm Hash digest
SHA256 6d6e96764159fb54c9e8b0456174a58c4adb177d0e261f2d0b48f56029e48303
MD5 18a890cf7ea7b7969cc016ca9bc6b06f
BLAKE2b-256 9fd0b6598cfed6b041a688a1cc77a4816a7b5cba48634ef82e6d3c3ad64eef3f

See more details on using hashes here.

File details

Details for the file sinapsis_data_tools-0.2.6-py3-none-any.whl.

File metadata

File hashes

Hashes for sinapsis_data_tools-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 bcb08cc1ee0ef4780a60bd8f9c2ea08a3e9e39826a1284f9f0a7840859153225
MD5 279f5a1a7fd4c1db1f733289a6282329
BLAKE2b-256 45fecc0cc3c0dd3a08fc4ee5a5b8f8dbb88ae36b8a796e7af6ec3d2916ac021a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page