AI Unreliable Answers: A Case Study on ChatGPT

Amaro, I.; Della Greca, A.; Francese, R.; Tortora, G.; Tucci, C.

doi:10.1007/978-3-031-35894-4_2

ChatGPT is a general domain chatbot which is object of great attention stimulating all the world discussions on the power and the consequences of the Artificial Intelligence diffusion in all the field, ranging from education, research, music to software development, health care, cultural heritage, and entertainment. In this paper, we try to investigate whether and when the answers provided by ChatGPT are unreliable and how this is perceived by expert users, such as Computer Science students. To this aim, we first analyze the reliability of the answers provided by ChatGPT by experimenting its narrative, problem solving, searching, and logic capabilities and report examples of answers. Then, we conducted a user study in which 15 participants that already knew the chatbot proposed a set of predetermined queries generating both correct and incorrect answers and then we collected their satisfaction. Results revealed that even if the present version of ChatGPT sometimes is unreliable, people still plan to use it. Thus, it is recommended to use the present version of ChatGPT always with the support of human verification and interpretation.

AI Unreliable Answers: A Case Study on ChatGPT

Amaro I.;Della Greca A.;Francese R.;Tortora G.;Tucci C.

2023

Abstract

ChatGPT is a general domain chatbot which is object of great attention stimulating all the world discussions on the power and the consequences of the Artificial Intelligence diffusion in all the field, ranging from education, research, music to software development, health care, cultural heritage, and entertainment. In this paper, we try to investigate whether and when the answers provided by ChatGPT are unreliable and how this is perceived by expert users, such as Computer Science students. To this aim, we first analyze the reliability of the answers provided by ChatGPT by experimenting its narrative, problem solving, searching, and logic capabilities and report examples of answers. Then, we conducted a user study in which 15 participants that already knew the chatbot proposed a set of predetermined queries generating both correct and incorrect answers and then we collected their satisfaction. Results revealed that even if the present version of ChatGPT sometimes is unreliable, people still plan to use it. Thus, it is recommended to use the present version of ChatGPT always with the support of human verification and interpretation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	ISBN
	
				978-3-031-35893-7
978-3-031-35894-4
			
	Appare nelle tipologie:
	
				4.1.2 Proceedings con ISBN

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4854105

Citazioni

ND

44

27

UniSa - IRIS Institutional Research Information System

AI Unreliable Answers: A Case Study on ChatGPT

Amaro I.;Della Greca A.;Francese R.;Tortora G.;Tucci C.

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

UniSa - IRIS Institutional Research Information System

AI Unreliable Answers: A Case Study on ChatGPT

Amaro I.;Della Greca A.;Francese R.;Tortora G.;Tucci C.

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)