Skip to main content

Deep Learning, Annotations, Training Data.

Project description

Diffgram.com

91% of teams take > 3 weeks to create their first dataset. And months to get to Beta.

Here's why: The process of creating datasets gets blocked by default. As shown above, each clock represents a stage in which a user often must wait for other users, or other processes to finish before continuing. Further, Teams create many sets. So after weeks of work the process repeats:

This is because the needed abstractions (such as various Templates) only become known through an iterative process as shown above. Often many iterations are needed before shipping, and ongoing usage of the system requires further iterations to be effective.

To manage all this, your team is likely doing a lot of "Extract Transform Load" operations. And managing sets with: "set_with_labels_good_one" and "good_one_really_this_time___v2" Or writing a ton of one-off scripts.

And that's just for creating MVPs and beta products. When you get to production, most current setups look something like this. Meaning even the best teams end up only shipping handfuls of models. Or investing heavily in recreating the wheel with infrastructure.

Introducing Diffgram

Where Diffgram Integrates Into Your System

Diffgram as your Training Data Database

Diffgram is all about Training Data: Data that's ready to be used by AI systems.

It's created by combining raw data with human centered meaning. For example, combining an image with a box identifying an object. The encoded meaning can be relatively simple, for example a single bounding box, or complex, such as a time series video with a graph of attributes.

Tell me moar

Motivation

  • An increase in complexity in annotations and frequency of data change.
  • Organization between data, people and teams on larger scale projects.

I need motivation

Use Cases

  • Create, Update, And Maintain Datasets
  • Create processes for working with Deep Learning systems
  • Compliance and threat actors
  • Launch faster

Use Cases Detail

Install

Full Documentation

Quickstart

pip install diffgram

On linux pip3 install diffgram

Credentials Guide Get Credentials from Diffgram.com (or your Private Deploy Link)

Get a client project:

from diffgram import Project

project = Project(project_string_id = "replace_with_project_string",
		  client_id = "replace_with_client_id",
		  client_secret = "replace_with_client_secret"	)

Tasks Introduction

Import Introduction

Updating Existing Instances

Pre-Label Example Video

Compatibility

Interfaces

Diffgram (Image and Video)

Scale AI

Datasaur

Labelbox

Contact us to request an interface!

Beta

Note the API/SDK is in beta and is undergoing rapid improvement. There may be breaking changes. Please see the API docs for the latest canonical reference and be sure to upgrade to latest ie: pip install diffgram --upgrade. We will attempt to keep the SDK up to date with the API.

Help articles for Diffgram.com See below for some examples.

Requires Python >=3.5

The default install through pip will install dependencies for local prediction (tensorflow opencv) as listed in requirements.txt. The only requirement needed for majority of functions is requests. If you are looking for a minimal size install and already have requests use the --no-dependencies flag ie pip install diffgram --no-dependencies

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffgram-0.2.7.tar.gz (27.0 kB view hashes)

Uploaded Source

Built Distribution

diffgram-0.2.7-py3-none-any.whl (34.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page