Direct Optimal Control using TD(?) Mixtures of Experts

Chatwin, Chris; Paraskevopoulos, V; Heywood, M I

File(s) not publicly available

Direct Optimal Control using TD(?) Mixtures of Experts

journal contribution

posted on 2023-06-08, 00:37 authored by Chris ChatwinChris Chatwin, V Paraskevopoulos, M I Heywood

Real-time control of continuous valued plants using TD(lamda) reinforcement learning is detailed. This problem is significantly more dif icult then the case of a discrete control space as in bang-bang or Q-learning. The methodology employs a combination of Stochastic Real-Valued units, Mixtures of Experts and RBF partitioning To do so the significance of both Maximum-Likelihood and Square Error Cost functions are emphasised, as is provision for RBF co-variances during training. The resulting architecture is demonstrated on benchmark problems.

History

Publication status

Published

Journal

International Journal of Knowledge-Based Intelligent Engineering Systems

ISSN

1327-2314

Publisher

IOS Press

Publisher URL

http://web.cs.dal.ca/~mheywood/Pubs/IJKIES-2k1.pdf

Issue

2

Volume

5

Page range

83-91

Department affiliated with

Engineering and Design Publications

Full text available

No

Peer reviewed?

Yes

Legacy Posted Date

2012-02-06

Usage metrics

Keywords

Uncategorised value

Licence

Copyright not evaluated

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) not publicly available

Direct Optimal Control using TD(?) Mixtures of Experts

History

Publication status

Journal

ISSN

Publisher

Publisher URL

Issue

Volume

Page range

Department affiliated with

Full text available

Peer reviewed?

Legacy Posted Date

Usage metrics

Categories

Keywords

Licence

Exports