Facilitating human intervention in coreference resolution with comparative entity summaries

Last update: 2015-06-25
URI: http://ws.nju.edu.cn/publications/xcq14
Authors: Danyun Xu , Gong Cheng , Yuzhong Qu
Type: Inproceedings
Published: 2014
Publisher: Springer
Pages: 535-549
inPublication: In Proc. of the 11th Extended Semantic Web Conference (ESWC)

A primary challenge to Web data integration is coreference resolution, namely identifying entity descriptions from different data sources that refer to the same real-world entity. Increasingly, solutions to coreference resolution have humans in the loop. For instance, many active learning, crowdsourcing, and pay-as-you-go approaches solicit user feedback for verifying candidate coreferent entities computed by automatic methods. Whereas reducing the number of verification tasks is a major consideration for these approaches, very little attention has been paid to the efficiency of performing each single verification task. To address this issue, in this paper, instead of showing the entire descriptions of two entities for verification which are possibly lengthy, we propose to extract and present a compact summary of them, and expect that such length-limited comparative entity summaries can help human users verify more efficiently without significantly hurting the accuracy of their verification. Our approach exploits the common and different features of two entities that best help indicate (non-)coreference, and also considers the diverse information on their identities. Experimental results show that verification is 2.7–2.9 times faster when using our comparative entity summaries, and its accuracy is not notably affected.

