ABSTRACT:
Search result diversification aims at
returning diversified document lists to cover different user intents of a
query. Existing diversity measures assume that the intents of a query are disjoint, and do not consider their relationships. In this
paper, we introduce intent hierarchies to model the relationships between
intents, and present four weighing schemes. Based on intent hierarchies, we
propose several hierarchical measures that take into account the relationships
between intents. We demonstrate the feasibility of hierarchical measures by
using a new test collection based on TREC Web Track 2009-2013 diversity test
collections and by using NTCIR-11 IMine test
collection. Our main experimental findings are: (1) Hierarchical measures are
more discriminative and intuitive than existing measures. In terms of
intuitiveness, it is preferable for hierarchical measures to use the whole
intent hierarchies than to use only the leaf nodes; (2) The
types of intent hierarchies used affect the discriminative power and intuitiveness
of hierarchical measures. We suggest the best type of intent hierarchies to be
used according to whether the nonuniform weights are
available; (3) To measure the benefits of the
diversification algorithms which use automatically mined hierarchical intents,
it is important to use hierarchical measures instead of existing measures.
SYSTEM REQUIREMENTS:
HARDWARE REQUIREMENTS:
·
System : Pentium Dual Core.
·
Hard Disk : 120 GB.
·
Monitor : 15’’ LED
·
Input Devices : Keyboard, Mouse
·
Ram : 1 GB
SOFTWARE REQUIREMENTS:
·
Operating system : Windows 7.
·
Coding Language : JAVA/J2EE
·
Tool : Netbeans 7.2.1
·
Database : MYSQL
REFERENCE:
Xiaojie Wang, Ji-Rong Wen, Zhicheng Dou, Tetsuya Sakai, and Rui
Zhang, “Search Result Diversity Evaluation based on Intent Hierarchies”, IEEE
Transactions on Knowledge and Data Engineering, 2018.