Summarizing vocabularies in the global semantic Web

Last update: 2010-04-19
Authors: Xiang Zhang , Gong Cheng , Weiyi Ge , Yuzhong Qu
Type: Article
Published: 2009
Publisher: Institute of Computing Technology, the Chinese Academy of Sciences
Volume: 24
Issue: 1
Pages: 165-174
inPublication: Journal of Computer Science and Technology

In the Semantic Web, vocabularies are defined and shared among knowledge workers to describe linked data for scientific, industrial or daily life usage. With the rapid growth of online vocabularies, there is an emergent need for approaches helping users understand vocabularies quickly. In this paper, we study the summarization of vocabularies to help users understand vocabularies. Vocabulary summarization is based on the structural analysis and pragmatics statistics in the global Semantic Web. Local Bipartite Model and Expanded Bipartite Model of a vocabulary are proposed to characterize the structure in a vocabulary and links between vocabularies. A structural importance for each RDF sentence in the vocabulary is assessed using link analysis. Meanwhile, pragmatics importance of each RDF sentence is assessed using the statistics of instantiation of its terms in the Semantic Web. Summaries are produced by extracting important RDF sentences in vocabularies under a re-ranking strategy. Preliminary experiments show that it is feasible to help users understand a vocabulary through its summary.

Download: file