Skip to main content

No project description provided

Project description

GeDa

GeDa is a Python package that helps you to Get the Data for your project easily.

Installation

pip install geda

Usage

Using specific data provider class

from geda.data_providers.voc import VOCSemanticSegmentationDataProvider

root = "<directory>/<to>/<store>/<data>" # e.g. "data/VOC"
dataprovider = VOCSemanticSegmentationDataProvider(root)
dataprovider.get_data()

Using get_data shortcut

from geda import get_data

root = "<directory>/<to>/<store>/<data>" # e.g. "data/VOC"
dataprovider = get_data(name="VOC_SemanticSegmentation", root=root)
dataprovider.get_data()

The get_data function currently supported names: MNIST, DUTS, NYUDv2, VOC_InstanceSegmentation, VOC_SemanticSegmentation, VOC_PersonPartSegmentation, VOC_Main, VOC_Action, VOC_Layout, MPII, COCO_Keypoints

What it does

By using dataprovider.get_data() functionality, the data is subjected to the following pipeline:

  1. Download the data from source (specified by the _URLS variable in each module)
  2. Unzip the files if needed (in case of tar, zip or gz files downloaded)
  3. Move the files to <root>/raw directory
  4. Find the split ids (file basenames or indices - depending on the dataset)
  5. Arrange files, i.e. move (or copy) files from <root>/raw directory to task-specific directories
  6. [Optional] Create labels in specific format (f.e. YOLO)

Example

Resulting directory structure of the get_data(name="VOC_SemanticSegmentation", root="data/VOC")

.
└── data
    └── VOC
        ├── raw
        │   ├── Annotations
        │   ├── ImageSets
        │   ├── JPEGImages
        │   ├── SegmentationClass
        │   └── SegmentationObject
        ├── SegmentationClass
        │   ├── annots
        │   ├── images
        │   ├── labels
        │   └── masks
        └── trainval_2012.tar

Currently supported datasets

Image classification

Image Segmentation

Keypoints detection

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geda-0.1.15.tar.gz (20.3 kB view hashes)

Uploaded Source

Built Distribution

geda-0.1.15-py3-none-any.whl (29.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page