With the rapid advancement of tools based on Artificial Intelligence, it is interesting to assess their usefulness in requirements engineering. In early experiments, we have seen that ChatGPT can detect inconsistency defects in natural language (NL) requirements, that traditional NLP tools cannot identify or can identify with difficulties even after domain-focused training. This study is devoted to specifically measuring the performance of ChatGPT in finding inconsistency in requirements. Positive results in this respect could lead to the use of ChatGPT to complement existing requirements analysis tools to automatically detect this important quality criterion. For this purpose, we consider GPT-3.5, the Generative Pretrained Transformer language model developed by OpenAI. We evaluate its ability to detect inconsistency by comparing its predictions with those obtained from expert judgments by students with a proven knowledge of RE issues on a few example requirements documents.

Inconsistency Detection in Natural Language Requirements using ChatGPT: a Preliminary Evaluation / Fantechi, Alessandro; Gnesi, Stefania; Passaro, Lucia; Semini, Laura. - STAMPA. - (2023), pp. 335-340. (Intervento presentato al convegno 31st {IEEE} International Requirements Engineering Conference tenutosi a Hannover, Germany nel September 4-8, 2023) [10.1109/RE57278.2023.00045].

Inconsistency Detection in Natural Language Requirements using ChatGPT: a Preliminary Evaluation

Fantechi, Alessandro;Gnesi, Stefania;Semini, Laura
2023

Abstract

With the rapid advancement of tools based on Artificial Intelligence, it is interesting to assess their usefulness in requirements engineering. In early experiments, we have seen that ChatGPT can detect inconsistency defects in natural language (NL) requirements, that traditional NLP tools cannot identify or can identify with difficulties even after domain-focused training. This study is devoted to specifically measuring the performance of ChatGPT in finding inconsistency in requirements. Positive results in this respect could lead to the use of ChatGPT to complement existing requirements analysis tools to automatically detect this important quality criterion. For this purpose, we consider GPT-3.5, the Generative Pretrained Transformer language model developed by OpenAI. We evaluate its ability to detect inconsistency by comparing its predictions with those obtained from expert judgments by students with a proven knowledge of RE issues on a few example requirements documents.
2023
31st {IEEE} International Requirements Engineering Conference
31st {IEEE} International Requirements Engineering Conference
Hannover, Germany
September 4-8, 2023
Fantechi, Alessandro; Gnesi, Stefania; Passaro, Lucia; Semini, Laura
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1346371
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact