Extracting Part-Whole Relations from Online Encyclopedia - Intelligent Information Processing VII (IIP 2014) Access content directly
Conference Papers Year : 2014

Extracting Part-Whole Relations from Online Encyclopedia

Abstract

Automatic discovery of part-whole relations is a fundamental problem in the area of information extraction. In this paper, we present an unsupervised approach to learning lexical patterns from online encyclopedia for extracting part-whole relations. The only input is a few part-whole instances. To tackle the term recognition problem, terms from the domain of the seeds are extracted taking use of the semantic information contained in the online encyclopedia. Instead of collecting sentences that contain relation instances from the seeds, we introduce a novel process to select sentences that may indicate part-whole relations. Patterns are produced from these sentences with terms replaced by Part and Whole tags. A similarity measurement based on a new edit distance is used and an algorithm is described to cluster similar patterns. We rank the pattern clusters according to their frequencies, and patterns from the top-k clusters are chosen to be applied to identify the new part-whole relations. Experimental results show that our method can extract abundant part-whole relations and achieve a preferable precision compared to the other state-of-the-art approaches.
Fichier principal
Vignette du fichier
978-3-662-44980-6_7_Chapter.pdf (469.67 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01383317 , version 1 (18-10-2016)

Licence

Attribution

Identifiers

Cite

Fei Xia, Cungen Cao. Extracting Part-Whole Relations from Online Encyclopedia. 8th International Conference on Intelligent Information Processing (IIP), Oct 2014, Hangzhou, China. pp.57-66, ⟨10.1007/978-3-662-44980-6_7⟩. ⟨hal-01383317⟩
53 View
277 Download

Altmetric

Share

Gmail Facebook X LinkedIn More