Learning tabula rasa can be unnecessarily slow

Learning tabula rasa can be unnecessarily slow

Model Free

Transferring Instances for Model Based REinforcement Learning

χx: starget→ssource

Fitted R-MAX balances:

Instance Transfer in Fitted Q Iteration

Implement with other model-learning methods

Significantly increases speed of learning

Model Free:

Dostları ilə paylaş:

Learning tabula rasa can be unnecessarily slow

Learning tabula rasa can be unnecessarily slow

Learning tabula rasa can be unnecessarily slow

Humans can use past information

Agents: leverage learned knowledge in novel/modified tasks

Model Free

Model Free

Model-Based

Transferring Instances for Model Based REinforcement Learning

Transferring Instances for Model Based REinforcement Learning

Transfer between

In this paper, we use:

χx: starget→ssource

χx: starget→ssource

χA: atarget→asource

Intuitive mappings exist in some domains (Oracle)

Mappings can be learned (e.g., Taylor, Kuhlmann, and Stone (2008))

2D Mountain Car

3D Mountain Car

χX

χA

Fitted R-MAX balances:

Fitted R-MAX balances:

Instance Transfer in Fitted Q Iteration

Instance Transfer in Fitted Q Iteration

Transferring Regression Model of Transition Function

Ordering Prioritized Sweeping via Transfer

Bayesian Model Transfer

Implement with other model-learning methods

Implement with other model-learning methods

Guard against U-shaped curve in Fitted R-Max?

Examine more complex tasks

Significantly increases speed of learning

Significantly increases speed of learning

Results suggest less data needed to learn than

Transfer performances depends on:

Model Free:

Model Free:

Full Model?