Traditional approaches to Named Entity Recognition (NER) frame the task into a BIO sequence labeling problem. Although these systems often excel in the downstream task at hand, they require extensive annotated data and struggle to generalize to out-of-distribution input domains and unseen entity types. On the contrary, Large Language Models (LLMs) have demonstrated strong zero-shot capabilities. While several works address Zero-Shot NER in English, little has been done in other languages. In this paper, we define an evaluation framework for Zero-Shot NER, applying it to the Italian language. Furthermore, we introduce SLIMER-IT, the Italian version of SLIMER, an instruction-tuning approach for zero-shot NER leveraging prompts enriched with definition and guidelines. Comparisons with other state-of-the-art models, demonstrate the superiority of SLIMER-IT on never-seen-before entity tags.

Zamai, A., Rigutini, L., Maggini, M., Zugarini, A. (2024). SLIMER-IT: Zero-Shot NER on Italian Language. In Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024) [10.48550/arXiv.2409.15933].

SLIMER-IT: Zero-Shot NER on Italian Language

Andrew Zamai
Software
;
Leonardo Rigutini
Supervision
;
Marco Maggini
Project Administration
;
Andrea Zugarini
Supervision
2024-01-01

Abstract

Traditional approaches to Named Entity Recognition (NER) frame the task into a BIO sequence labeling problem. Although these systems often excel in the downstream task at hand, they require extensive annotated data and struggle to generalize to out-of-distribution input domains and unseen entity types. On the contrary, Large Language Models (LLMs) have demonstrated strong zero-shot capabilities. While several works address Zero-Shot NER in English, little has been done in other languages. In this paper, we define an evaluation framework for Zero-Shot NER, applying it to the Italian language. Furthermore, we introduce SLIMER-IT, the Italian version of SLIMER, an instruction-tuning approach for zero-shot NER leveraging prompts enriched with definition and guidelines. Comparisons with other state-of-the-art models, demonstrate the superiority of SLIMER-IT on never-seen-before entity tags.
2024
Zamai, A., Rigutini, L., Maggini, M., Zugarini, A. (2024). SLIMER-IT: Zero-Shot NER on Italian Language. In Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024) [10.48550/arXiv.2409.15933].
File in questo prodotto:
File Dimensione Formato  
109_main_long.pdf

accesso aperto

Tipologia: PDF editoriale
Licenza: Creative commons
Dimensione 1.35 MB
Formato Adobe PDF
1.35 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1282335