BackgroundIn the last decades, the P300 Speller paradigm was replicated in many experiments, and collected data were released to the public domain to allow research groups, particularly those in the field of machine learning, to test and improve their algorithms for higher performances of brain-computer interface (BCI) systems. Training data is needed to learn the identification of brain activity. The more training data are available, the better the algorithms will perform. The availability of larger datasets is highly desirable, eventually obtained by merging datasets from different repositories. The main obstacle to such merging is that all public datasets are released in various file formats because no standard way is established to share these data. Additionally, all datasets necessitate reading documents or scientific papers to retrieve relevant information, which prevents automating the processing. In this study, we thus adopted a unique file format to demonstrate the importance of having a standard and to propose which information should be stored and why.MethodsWe described our process to convert a dozen of P300 Speller datasets and reported the main encountered problems while converting them into the same file format. All the datasets are characterized by the same 6 x 6 matrix of alphanumeric symbols (characters and numbers or symbols) and by the same subset of acquired signals (8 EEG sensors at the same recording sites).Results and discussionNearly a million stimuli were converted, relative to about 7000 spelled characters and belonging to 127 subjects. The converted stimuli represent the most extensively available platform for training and testing new algorithms on the specific paradigm - the P300 Speller. The platform could potentially allow exploring transfer learning procedures to reduce or eliminate the time needed for training a classifier to improve the performance and accuracy of such BCI systems.

Bianchi, L., Ferrante, R., Hu, Y., Sahonero-Alvarez, G., Zenia, N. (2022). Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls. FRONTIERS IN NEUROERGONOMICS, 3 [10.3389/fnrgo.2022.1045653].

Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls

Bianchi, L
Writing – Review & Editing
;
2022-01-01

Abstract

BackgroundIn the last decades, the P300 Speller paradigm was replicated in many experiments, and collected data were released to the public domain to allow research groups, particularly those in the field of machine learning, to test and improve their algorithms for higher performances of brain-computer interface (BCI) systems. Training data is needed to learn the identification of brain activity. The more training data are available, the better the algorithms will perform. The availability of larger datasets is highly desirable, eventually obtained by merging datasets from different repositories. The main obstacle to such merging is that all public datasets are released in various file formats because no standard way is established to share these data. Additionally, all datasets necessitate reading documents or scientific papers to retrieve relevant information, which prevents automating the processing. In this study, we thus adopted a unique file format to demonstrate the importance of having a standard and to propose which information should be stored and why.MethodsWe described our process to convert a dozen of P300 Speller datasets and reported the main encountered problems while converting them into the same file format. All the datasets are characterized by the same 6 x 6 matrix of alphanumeric symbols (characters and numbers or symbols) and by the same subset of acquired signals (8 EEG sensors at the same recording sites).Results and discussionNearly a million stimuli were converted, relative to about 7000 spelled characters and belonging to 127 subjects. The converted stimuli represent the most extensively available platform for training and testing new algorithms on the specific paradigm - the P300 Speller. The platform could potentially allow exploring transfer learning procedures to reduce or eliminate the time needed for training a classifier to improve the performance and accuracy of such BCI systems.
2022
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore INF/01
Settore ING-INF/06
Settore ING-INF/05
English
fair principles
BCI
P300
speller
dataset
database
https://www.frontiersin.org/articles/10.3389/fnrgo.2022.1045653/full
Bianchi, L., Ferrante, R., Hu, Y., Sahonero-Alvarez, G., Zenia, N. (2022). Merging Brain-Computer Interface P300 speller datasets: Perspectives and pitfalls. FRONTIERS IN NEUROERGONOMICS, 3 [10.3389/fnrgo.2022.1045653].
Bianchi, L; Ferrante, R; Hu, Y; Sahonero-Alvarez, G; Zenia, N
Articolo su rivista
File in questo prodotto:
File Dimensione Formato  
fnrgo-03-1045653 (4).pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.16 MB
Formato Adobe PDF
1.16 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/345968
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact