Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls

IRIS

BackgroundIn the last decades, the P300 Speller paradigm was replicated in many experiments, and collected data were released to the public domain to allow research groups, particularly those in the field of machine learning, to test and improve their algorithms for higher performances of brain-computer interface (BCI) systems. Training data is needed to learn the identification of brain activity. The more training data are available, the better the algorithms will perform. The availability of larger datasets is highly desirable, eventually obtained by merging datasets from different repositories. The main obstacle to such merging is that all public datasets are released in various file formats because no standard way is established to share these data. Additionally, all datasets necessitate reading documents or scientific papers to retrieve relevant information, which prevents automating the processing. In this study, we thus adopted a unique file format to demonstrate the importance of having a standard and to propose which information should be stored and why.MethodsWe described our process to convert a dozen of P300 Speller datasets and reported the main encountered problems while converting them into the same file format. All the datasets are characterized by the same 6 x 6 matrix of alphanumeric symbols (characters and numbers or symbols) and by the same subset of acquired signals (8 EEG sensors at the same recording sites).Results and discussionNearly a million stimuli were converted, relative to about 7000 spelled characters and belonging to 127 subjects. The converted stimuli represent the most extensively available platform for training and testing new algorithms on the specific paradigm - the P300 Speller. The platform could potentially allow exploring transfer learning procedures to reduce or eliminate the time needed for training a classifier to improve the performance and accuracy of such BCI systems.

Bianchi, L., Ferrante, R., Hu, Y., Sahonero-Alvarez, G., Zenia, N. (2022). Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls. FRONTIERS IN NEUROERGONOMICS, 3 [10.3389/fnrgo.2022.1045653].

Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls

Bianchi, L^{Writing – Review & Editing};Ferrante, R^Resources;Hu, YP;Sahonero-Alvarez, G;Zenia, NZ

2022-01-01

Abstract

BackgroundIn the last decades, the P300 Speller paradigm was replicated in many experiments, and collected data were released to the public domain to allow research groups, particularly those in the field of machine learning, to test and improve their algorithms for higher performances of brain-computer interface (BCI) systems. Training data is needed to learn the identification of brain activity. The more training data are available, the better the algorithms will perform. The availability of larger datasets is highly desirable, eventually obtained by merging datasets from different repositories. The main obstacle to such merging is that all public datasets are released in various file formats because no standard way is established to share these data. Additionally, all datasets necessitate reading documents or scientific papers to retrieve relevant information, which prevents automating the processing. In this study, we thus adopted a unique file format to demonstrate the importance of having a standard and to propose which information should be stored and why.MethodsWe described our process to convert a dozen of P300 Speller datasets and reported the main encountered problems while converting them into the same file format. All the datasets are characterized by the same 6 x 6 matrix of alphanumeric symbols (characters and numbers or symbols) and by the same subset of acquired signals (8 EEG sensors at the same recording sites).Results and discussionNearly a million stimuli were converted, relative to about 7000 spelled characters and belonging to 127 subjects. The converted stimuli represent the most extensively available platform for training and testing new algorithms on the specific paradigm - the P300 Speller. The platform could potentially allow exploring transfer learning procedures to reduce or eliminate the time needed for training a classifier to improve the performance and accuracy of such BCI systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2022
			
	Status di pubblicazione
	
				Pubblicato
			
	DOI dell'articolo
	
				https://dx.doi.org/10.3389/fnrgo.2022.1045653
			
	Rilevanza
	
				Rilevanza internazionale
			
	Tipo
	
				Articolo
			
	Referee
	
				Esperti anonimi
			
	Settore disciplinare dell'articolo (valido fino a 24/06/2024)
	
				Settore INF/01
Settore ING-INF/06
Settore ING-INF/05
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				fair principles
BCI
P300
speller
dataset
database
			
	URL alternativo
	
				https://www.frontiersin.org/articles/10.3389/fnrgo.2022.1045653/full
			
	Citazione
	
				Bianchi, L., Ferrante, R., Hu, Y., Sahonero-Alvarez, G., Zenia, N. (2022). Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls. FRONTIERS IN NEUROERGONOMICS, 3 [10.3389/fnrgo.2022.1045653].
			
	Tutti gli autori
	
						Bianchi, L; Ferrante, R; Hu, Y; Sahonero-Alvarez, G; Zenia, N
					
	Tipologia
	
				Articolo su rivista
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
fnrgo-03-1045653 (4).pdf accesso aperto Tipologia: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 1.16 MB Formato Adobe PDF Visualizza/Apri	1.16 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/345968

Citazioni

ND

0

0

social impact