Skip to main content

The official package for the easy-VQA dataset.

Project description


Build Status PyPI

The official repository for the easy-VQA dataset. Contains:

  • the official Python package for the dataset
  • the source code for generating the dataset

About the Dataset

easy-VQA contains

  • 4000 train images and 38575 train questions.
  • 1000 test images and 9673 test questions.
  • 13 total possible answers.
  • 28407 training questions that are yes/no.
  • 7136 testing questions that are yes/no.

All images are 64x64 color images. See a live demo of a model trained on the dataset.

Example Images

Example Questions

  • What color is the rectangle?
  • Does the image contain a triangle?
  • Is no blue shape present?
  • What shape does the image contain?

Installing the Package

pip install easy-vqa

Using the Package


Each question has 3 parts:

  • the question text
  • the answer
  • the image ID

The question getters return corresponding arrays for each of the 3 parts:

from easy_vqa import get_train_questions, get_test_questions

train_questions, train_answers, train_image_ids = get_train_questions()
test_questions, test_answers, test_image_ids = get_test_questions()

# Question 0 is at index 0 for all 3 arrays:
print(train_questions[0]) # what shape does the image contain?
print(train_answers[0])   # circle
print(train_image_ids[0]) # 0


The image path getters return dicts that map image ID to absolute paths that can be used to load the image.

from easy_vqa import get_train_image_paths, get_test_image_paths

train_image_paths = get_train_image_paths()
test_image_paths = get_test_image_paths()

print(train_image_paths[0]) # ends in easy_vqa/data/train/images/0.png


The answers getter returns an array of all possible answers.

from easy_vqa import get_answers

answers = get_answers()

print(answers) # ['teal', 'brown', 'black', 'gray', 'yes', 'blue', 'rectangle', 'yellow', 'triangle', 'red', 'circle', 'no', 'green']

Generating the Dataset

The easy-VQA dataset was generated by running

python gen_data/

which writes to the easy_vqa/data/ directory. Be sure to install the dependencies for dataset generation running

pip install -r gen_data/requirements.txt

If you want to generate a larger easy-VQA dataset, simply modify the NUM_TRAIN and NUM_TEST constants in Otherwise, if you want to modify the dataset itself, the files and code in the gen_data/ directory should be pretty self-explanatory.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for easy-vqa, version 1.0b1
Filename, size File type Python version Upload date Hashes
Filename, size easy_vqa-1.0b1-py3-none-any.whl (3.2 MB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size easy-vqa-1.0b1.tar.gz (2.0 MB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page