In this paper, the problem of exploring stochastic graphs is addressed. The definition of the entropy related to the a-priori unknown parameters (the lengths of the a-priori unknown links) leads to the formulation of the problem as a stochastic optimal control one. The application of exact Dynamic Programming suffers the so-called curse of dimensionality. To overcome this drawback, an approximate technique is proposed making use of Neuro-Dynamic Programming. Exploiting the concept of frontier, any approximate solution of the problem is shown to generate a "proper" policy.
Neuro-dynamic programming for the exploration of unknown graphs / M. Baglietto; G. Battistelli; L. Scardovi; R. Zoppoli. - STAMPA. - (2005), pp. 401-406. (Intervento presentato al convegno 16th IFAC World Congress, 2005 tenutosi a Prague, Czech Republic) [10.3182/20050703-6-CZ-1902.00928].
Neuro-dynamic programming for the exploration of unknown graphs
BATTISTELLI, GIORGIO;
2005
Abstract
In this paper, the problem of exploring stochastic graphs is addressed. The definition of the entropy related to the a-priori unknown parameters (the lengths of the a-priori unknown links) leads to the formulation of the problem as a stochastic optimal control one. The application of exact Dynamic Programming suffers the so-called curse of dimensionality. To overcome this drawback, an approximate technique is proposed making use of Neuro-Dynamic Programming. Exploiting the concept of frontier, any approximate solution of the problem is shown to generate a "proper" policy.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.