End-to-end Dependency Parsing via Auto-regressive Large Language Models

IRIS

This paper presents a straightforward application of Large Language Models (LLMs) for Dependency Parsing. The parsing process is approached as a sequence-to-sequence task, where a language model takes a sentence as input and generates a bracketed form, allowing for the deterministic derivation of the dependency graph. The experimental evaluation explores the feasibility of utilizing LLMs for this purpose, while also assessing the process’s sustainability with modest parameter sizes (training on a single GPU with limited resources) and investigating the impact of incorporating multilingual data during training. The results demonstrate that an end-to-end dependency parsing process can indeed be formulated using a task-agnostic architecture.

Hromei, C.d., Croce, D., Basili, R. (2023). End-to-end Dependency Parsing via Auto-regressive Large Language Models. In CLiC-it 2023: Italian Conference on Computational Linguistics: proceedings of the 9th Italian Conference on Computational Linguistics. CEUR-WS.

End-to-end Dependency Parsing via Auto-regressive Large Language Models

Hromei C. D.;Croce D.;Basili R.

2023-01-01

Abstract

This paper presents a straightforward application of Large Language Models (LLMs) for Dependency Parsing. The parsing process is approached as a sequence-to-sequence task, where a language model takes a sentence as input and generates a bracketed form, allowing for the deterministic derivation of the dependency graph. The experimental evaluation explores the feasibility of utilizing LLMs for this purpose, while also assessing the process’s sustainability with modest parameter sizes (training on a single GPU with limited resources) and investigating the impact of incorporating multilingual data during training. The results demonstrate that an end-to-end dependency parsing process can indeed be formulated using a task-agnostic architecture.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				9th Italian Conference on Computational Linguistics, CLiC-it 2023
			
	Luogo del convegno
	
				Ca' Foscari University, Venezia, Italia
			
	Anno del convegno
	
				2023
			
	Numero del convegno
	
				9
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				2023
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore INF/01
Settore ING-INF/05
			
	Lingua del contenuto
	
				English
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Hromei, C.d., Croce, D., Basili, R. (2023). End-to-end Dependency Parsing via Auto-regressive Large Language Models. In CLiC-it 2023: Italian Conference on Computational Linguistics: proceedings of the 9th Italian Conference on Computational Linguistics. CEUR-WS.
			
	Tutti gli autori
	
						Hromei, Cd; Croce, D; Basili, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/359276

Citazioni

ND

2

ND

social impact