The Room environment

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

The Room environment - v2

For the documentation of RoomEnv-v0 and RoomEnv-v1, click the corresponding buttons.

This document, RoomEnv-v2, is the most up-to-date one.

We have released a challenging OpenAI Gym compatible environment. The best strategy for this environment is to have both episodic and semantic memory systems. See the paper for more information.

This env is added to the PyPI server:

pip install room-env

Data collection

Data is collected from querying ConceptNet APIs. For simplicity, we only collect triples whose format is (head, AtLocation, tail). Here head is one of the 80 MS COCO dataset categories. This was kept in mind so that later on we can use images as well.

If you want to collect the data manually, then run below:

python collect_data.py

The RoomDes

You can run the RoomDes by

from room_env.des import RoomDes

des = RoomDes()
des.run(debug=True)

with debug=True it'll print events (i.e., state changes) to the console.

{'resource_changes': {'desk': -1, 'lap': 1},
 'state_changes': {'Vincent': {'current_time': 1,
                               'object_location': {'current': 'desk',
                                                   'previous': 'lap'}}}}
{'resource_changes': {}, 'state_changes': {}}
{'resource_changes': {}, 'state_changes': {}}
{'resource_changes': {},
 'state_changes': {'Michael': {'current_time': 4,
                               'object_location': {'current': 'lap',
                                                   'previous': 'desk'}},
                   'Tae': {'current_time': 4,
                           'object_location': {'current': 'desk',
                                               'previous': 'lap'}}}}

RoomEnv-v2

import gym
import room_env

env = gym.make("RoomEnv-v2")
observation, info = env.reset()
while True:
    observation, reward, done, info = env.step(0)
    if done:
        break

Every time when an agent takes an action, the environment will give you three memory systems (i.e., episodic, semantic, and short-term), as an observation. The goal of the agent is to learn a memory management policy. The actions are:

0: Put the short-term memory into the epiosdic memory system.
1: Put it into the semantic.
2: Just forget it.

The memory systems will be managed according to your actions, and they will eventually used to answer questions. You don't have to worry about the question answering. It's done by the environment. The better you manage your memory systems, the higher chances that your agent can answer more questions correctly!

Take a look at this repo for an actual interaction with this environment to learn a policy.

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Run make test && make style && make quality in the root repo directory, to ensure code quality.
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

Cite our paper

new paper bibtex coming soon

Cite our code

Authors

License

MIT

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.0.1

Apr 28, 2024

2.0.0

Apr 20, 2024

1.0.3

Apr 5, 2024

1.0.2

Mar 15, 2023

1.0.1

Dec 5, 2022

0.2.2

Oct 13, 2022

This version

0.2.1

Sep 28, 2022

0.2

Sep 27, 2022

0.1.5

Apr 4, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

room_env-0.2.1.tar.gz (56.2 kB view hashes)

Uploaded Sep 28, 2022 Source

Built Distribution

room_env-0.2.1-py3-none-any.whl (63.0 kB view hashes)

Uploaded Sep 28, 2022 Python 3

Hashes for room_env-0.2.1.tar.gz

Hashes for room_env-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`420c1a4e72feeba5a1845d32eb8885d072fe8bb07b474437ed4046b66dbb44ce`
MD5	`9d8f79b176b9dc8a8f82fa5f349e05b0`
BLAKE2b-256	`ac3c7d30a0b0b6d979be8bbb454620084ae8ae8d1332650d8a94d623cd9c72db`

Hashes for room_env-0.2.1-py3-none-any.whl

Hashes for room_env-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`456c00f5f25b529199e577ddc0f0ecb9e59945ccb17e7ec8728a449443c35491`
MD5	`ce0a7bdd82a648004eb4539e00e50858`
BLAKE2b-256	`97ed54c19dbcdba6e9e443c234871ac3ec0afa071cdf438254073b2887715359`