Next: Horizon length
Up: Simulation Results
Previous: Dynamic Model Based Directly
Even though most of the information on the speed and the angular
velocity can still be inferred taking into account past
observations, in practice the problem of learning the dynamics of the
system becomes harder and relying on past observations increases the
reaction time. The model with hidden state could still perform the
swing-up with some success, but a model based directly on observations
could not handle the swing-up at all. This result was to be expected,
as the dynamic mapping (8) alone cannot adequately
describe the modified system.
Tapani Raiko
2005-05-23