A Comparative Assessment of State-Of-The-Art Methods for Multilingual Unsupervised Keyphrase Extraction - Artificial Intelligence Applications and Innovations Access content directly
Conference Papers Year : 2021

A Comparative Assessment of State-Of-The-Art Methods for Multilingual Unsupervised Keyphrase Extraction

Nikolaos Giarelis
  • Function : Author
  • PersonId : 1105446
Nikos Kanakaris
  • Function : Author
  • PersonId : 1105447
Nikos Karacapilidis
  • Function : Author
  • PersonId : 1033582

Abstract

Keyphrase extraction is a fundamental task in information management, which is often used as a preliminary step in various information retrieval and natural language processing tasks. The main contribution of this paper lies in providing a comparative assessment of prominent multilingual unsupervised keyphrase extraction methods that build on statistical (RAKE, YAKE), graph-based (TextRank, SingleRank) and deep learning (KeyBERT) methods. For the experimentations reported in this paper, we employ well-known datasets designed for keyphrase extraction from five different natural languages (English, French, Spanish, Portuguese and Polish). We use the F1 score and a partial match evaluation framework, aiming to investigate whether the number of terms of the documents and the language of each dataset affect the accuracy of the selected methods. Our experimental results reveal a set of insights about the suitability of the selected methods in texts of different sizes, as well as the performance of these methods in datasets of different languages.
Embargoed file
Embargoed file
0 7 0
Year Month Jours
Avant la publication

Dates and versions

hal-03287681 , version 1 (15-07-2021)

Licence

Attribution - CC BY 4.0

Identifiers

Cite

Nikolaos Giarelis, Nikos Kanakaris, Nikos Karacapilidis. A Comparative Assessment of State-Of-The-Art Methods for Multilingual Unsupervised Keyphrase Extraction. 17th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), Jun 2021, Hersonissos, Crete, Greece. pp.635-645, ⟨10.1007/978-3-030-79150-6_50⟩. ⟨hal-03287681⟩
164 View
25 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More