Conditional Imitation learning is a common and effective approach to train autonomous driving agents. However, two issues limit the full potential of this approach: (i) the inertia problem, a special case of causal confusion where the agent mistakenly correlates low speed with no acceleration, and (ii) low correlation between offline and online performance due to the accumulation of small errors that brings the agent in a previously unseen state. Both issues are critical for state-aware models, yet informing the driving agent of its internal state as well as the state of the environment is of crucial importance. In this paper we propose a multi-task learning agent based on a multi-stage vision transformer with state token propagation. We feed the state of the vehicle along with the representation of the environment as a special token of the transformer and propagate it throughout the network. This allows us to tackle the aforementioned issues from different angles: guiding the driving policy with learned stop/go information, performing data augmentation directly on the state of the vehicle and visually explaining the model's decisions. We report a drastic decrease in inertia and a high correlation between offline and online metrics.

Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving / Cultrera, Luca; Becattini, Federico; Seidenari, Lorenzo; Pala, Pietro; Bimbo, Alberto Del. - In: IEEE TRANSACTIONS ON INTELLIGENT VEHICLES. - ISSN 2379-8904. - ELETTRONICO. - (2023), pp. 1-10. [10.1109/TIV.2023.3336063]

Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving

Cultrera, Luca;Becattini, Federico;Seidenari, Lorenzo;Pala, Pietro;Bimbo, Alberto Del
2023

Abstract

Conditional Imitation learning is a common and effective approach to train autonomous driving agents. However, two issues limit the full potential of this approach: (i) the inertia problem, a special case of causal confusion where the agent mistakenly correlates low speed with no acceleration, and (ii) low correlation between offline and online performance due to the accumulation of small errors that brings the agent in a previously unseen state. Both issues are critical for state-aware models, yet informing the driving agent of its internal state as well as the state of the environment is of crucial importance. In this paper we propose a multi-task learning agent based on a multi-stage vision transformer with state token propagation. We feed the state of the vehicle along with the representation of the environment as a special token of the transformer and propagate it throughout the network. This allows us to tackle the aforementioned issues from different angles: guiding the driving policy with learned stop/go information, performing data augmentation directly on the state of the vehicle and visually explaining the model's decisions. We report a drastic decrease in inertia and a high correlation between offline and online metrics.
2023
1
10
Cultrera, Luca; Becattini, Federico; Seidenari, Lorenzo; Pala, Pietro; Bimbo, Alberto Del
File in questo prodotto:
File Dimensione Formato  
Addressing_Limitations_of_State-Aware_Imitation_Learning_for_Autonomous_Driving.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 4.41 MB
Formato Adobe PDF
4.41 MB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1345451
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact