Similarity Evaluation with Wikipedia Features

Shahbaz Wasti; Jawad Hussain; Guangjiang Huang; Yuncheng Jiang

doi:10.1007/978-3-030-46931-3_10

Conference Papers Year : 2020

Similarity Evaluation with Wikipedia Features

(1) , (1) , (1) , (1)

Shahbaz Wasti

Function : Author
PersonId : 1118807

South China Normal University [Guangdong, China] = Université normale de Chine du Sud [Canton, Chine] = 華南師范大學

Jawad Hussain

Function : Author

South China Normal University [Guangdong, China] = Université normale de Chine du Sud [Canton, Chine] = 華南師范大學

Guangjiang Huang

Function : Author

South China Normal University [Guangdong, China] = Université normale de Chine du Sud [Canton, Chine] = 華南師范大學

Yuncheng Jiang

Function : Author
PersonId : 1118808

South China Normal University [Guangdong, China] = Université normale de Chine du Sud [Canton, Chine] = 華南師范大學

Abstract

Wikipedia provides rich semantic features e.g., text, link, and category structure. These features can be used to compute semantic similarity (SS) between words or concepts. However, some existing Wikipedia-based SS methods either rely on a single feature or do not incorporate the underlying statistics of different features. We propose novel vector representations of Wikipedia concepts by integrating their multiple semantic features. We utilize the available statistics of these features in Wikipedia to compute their weights. These weights signify the contribution of each feature in similarity evaluation according to its level of importance. The experimental evaluation shows that our new methods obtain better results on SS datasets in comparison with state-of-the-art SS methods.

Keywords

Semantic similarity IC tfidf Vector representation

Domains

Computer Science [cs]

Fichier principal

498234_1_En_10_Chapter.pdf (348.44 Ko)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03456962

Submitted on : Tuesday, November 30, 2021-12:33:30 PM

Last modification on : Wednesday, October 2, 2024-11:38:04 PM

Long-term archiving on : Tuesday, March 1, 2022-6:55:59 PM

Dates and versions

hal-03456962 , version 1 (30-11-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03456962 , version 1
DOI : 10.1007/978-3-030-46931-3_10

Cite

Shahbaz Wasti, Jawad Hussain, Guangjiang Huang, Yuncheng Jiang. Similarity Evaluation with Wikipedia Features. 11th International Conference on Intelligent Information Processing (IIP), Jul 2020, Hangzhou, China. pp.99-104, ⟨10.1007/978-3-030-46931-3_10⟩. ⟨hal-03456962⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-TC12 IFIP-AICT-581

49 View

38 Download

Similarity Evaluation with Wikipedia Features

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share