Voice of customer extraction from product reviews: a benchmark of RAG variants

IRIS

This study evaluates the effectiveness of Generative Artificial Intelligence (G-AI) models enhanced with Retrieval-Augmented Generation (RAG) for automating Voice of Customer (VOC) creation. Four Generative AI architectures were compared using product reviews as the dataset: (1) baseline large language model without retrieval, (2) RAG model with feature labeling, (3) Self-RAG with feature labeling and (4) Sentiment Aware Self-RAG with feature labeling. Models were evaluated across six dimensions: requirements to Critical to Quality (CTQ) coherence, CTQ measurability, description representativeness, topic coverage, desiderata to requirement consistency and overall performance. Sentiment aware Self-RAG model and Self-RAG model with structured feature labeling demonstrated superior performances in generating consistent and comprehensive VOC insights. The Sentiment Aware Self-RAG is an innovative retrieval-augmented strategy that incorporates both semantic similarity and sentiment signals, enabling a more context sensitive generation of VOC insights. Results highlight the potential of Sentiment Aware and feature driven retrieval strategies to improve both the consistency and the depth of VOC generation, providing a more robust foundation for product innovation and customer-centric decision-making. By bridging methods from informatics and marketing, the paper contributes to the development of Artificial Intelligence (AI) driven approaches that enhance the translation of customer voices into actionable product requirements.

Fiocco, E., Proietti, S., Cesarotti, V. (2026). Voice of customer extraction from product reviews: a benchmark of RAG variants. ??????? it.cilea.surplus.oa.citation.tipologie.CitationProceedings.prensentedAt ??????? ICMarkTech, Valencia.

Voice of customer extraction from product reviews: a benchmark of RAG variants

Emanuele Fiocco;Serena Proietti;Vittorio Cesarotti

2026-01-01

Abstract

This study evaluates the effectiveness of Generative Artificial Intelligence (G-AI) models enhanced with Retrieval-Augmented Generation (RAG) for automating Voice of Customer (VOC) creation. Four Generative AI architectures were compared using product reviews as the dataset: (1) baseline large language model without retrieval, (2) RAG model with feature labeling, (3) Self-RAG with feature labeling and (4) Sentiment Aware Self-RAG with feature labeling. Models were evaluated across six dimensions: requirements to Critical to Quality (CTQ) coherence, CTQ measurability, description representativeness, topic coverage, desiderata to requirement consistency and overall performance. Sentiment aware Self-RAG model and Self-RAG model with structured feature labeling demonstrated superior performances in generating consistent and comprehensive VOC insights. The Sentiment Aware Self-RAG is an innovative retrieval-augmented strategy that incorporates both semantic similarity and sentiment signals, enabling a more context sensitive generation of VOC insights. Results highlight the potential of Sentiment Aware and feature driven retrieval strategies to improve both the consistency and the depth of VOC generation, providing a more robust foundation for product innovation and customer-centric decision-making. By bridging methods from informatics and marketing, the paper contributes to the development of Artificial Intelligence (AI) driven approaches that enhance the translation of customer voices into actionable product requirements.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				ICMarkTech
			
	Luogo del convegno
	
				Valencia
			
	Anno del convegno
	
				2025
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				2026
			
	Settore disciplinare dell'intervento (valido dal 09/05/2024)
	
				Settore IIND-05/A - Impianti industriali meccanici
Settore IEGE-01/A - Ingegneria economico-gestionale
			
	Lingua del contenuto
	
				English
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Fiocco, E., Proietti, S., Cesarotti, V. (2026). Voice of customer extraction from product reviews: a benchmark of RAG variants. ??????? it.cilea.surplus.oa.citation.tipologie.CitationProceedings.prensentedAt ??????? ICMarkTech, Valencia.
			
	Tutti gli autori
	
						Fiocco, E; Proietti, S; Cesarotti, V
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/451687

Citazioni

ND

ND

ND

social impact