Skip to main content

Easy-to-use tools for Curriculum Learning

Project description

academia

This package’s purpose is to provide easy-to-use tools for Curriculum Learning. It is a part of an engineering thesis at Warsaw University of Technology that touches on the topic of curriculum learning.

Documentation

https://academia.readthedocs.io/

Sources

An unordered list of interesting books and papers

Books

  1. Reinforcement Learning: An Introduction (Barto, Sutton)

Papers

Paper Link Short Description Related Papers
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey Survey of curriculum learning papers. Formalises curriuclum learning based on a variety of attributes and gives a good introduction into the topic.
Curriculum Design for Machine Learners in Sequential Decision Tasks Talks about curricula desgined by non-experts (i.e. people who do not know much/anything about a given domain). Uses "dog training" game as the basis of their experiments. Users can design a curriculum by sequencing any number of tasks in more or less complex environments. A target (more difficult) task is also provided to the user but they are not allowed to include it in the curriculum. A trainer model is used to go through the curriculum and provide feedback to the agent. They measure how good a curriculum is by the number of feedbacks that the trainer has to give to the agent i.e. if a curriculum is well designed the agent will require a relatively smaller number of feedbacks from the trainer to move on to the next task. They use three different trainer behaviours and show that the type of the trainer oes not influence the impact of curriuclum design i.e. if a curriculum is well designed under one trainer it is also well designed under another trainer. Another condition for a curriculum to be considered good is that the number of feedbacks over the entire curriculum (with the target task) should be smaller than the number of feedbacks when training on the target task alone. Results show that non-experts can design a better-than-random curriculum when it comes to reducing number of feedbacks on the target task alone but are not better-than-random in desigining a curriculum that decreases the overall number of feedbacks. Language and Policy Learning from Human-delivered Feedback,

Learning behaviors via human-delivered discrete feedback
Proximal Policy Optimization Algorithms Not directly related to Curriculum Learning but related to Reinforcement Learning.
A Deep Hierarchical Approach to Lifelong Learning in Minecraft Haven't read it yet, not directly connected to CL but should still be helpful

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

academia_rl-0.1.0.tar.gz (50.8 kB view details)

Uploaded Source

Built Distribution

academia_rl-0.1.0-py3-none-any.whl (74.1 kB view details)

Uploaded Python 3

File details

Details for the file academia_rl-0.1.0.tar.gz.

File metadata

  • Download URL: academia_rl-0.1.0.tar.gz
  • Upload date:
  • Size: 50.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for academia_rl-0.1.0.tar.gz
Algorithm Hash digest
SHA256 95b199c5ed3eba8fa02193ac3e774204ca86e5efc23c0a0a8252b767c67a17b0
MD5 fcfe754650ced2696004a61d06d6efc4
BLAKE2b-256 d5b569860a40b1aafb54f42f2bdecaa891c5f05fcb8945e7de9454817ed069f2

See more details on using hashes here.

File details

Details for the file academia_rl-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: academia_rl-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 74.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for academia_rl-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0f412d5e1c04fd8fee52f9239a99ec6bd1a8f020f031eb9fd6417af8aacb38c9
MD5 981a1e8e3ac31b552b078d2c20f3e7d6
BLAKE2b-256 70c9b226032ae5afe8ba1ebb08210da952e0609f6655541627b8b5810b797582

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page