XHITS: ESTENDENDO O ALGORITMO HITS PARA EXTRAÇÃO DE TÓPICOS NA WWW / XHITS: EXTENDING THE HITS ALGORITHM FOR DISTILLATION OF BROAD SEARCH TOPIC ON WWW

AUTOR(ES)
DATA DE PUBLICAÇÃO

2005

RESUMO

The network structure of a hyperlinked environment can be a rich source of information about the content of this environment. Jon Kleinberg developed a set of algorithms, called HITS (Hyperlink Induced Topic Search), for extracting information from the hyperlink structures of those environments. The aim of these algorithms is the distillation of broad search topics, through the discovery of related authoritative information sources. The notion of authority is based on the hyperlink structure relationship between a set of relevant authoritative pages and the set of hubs. Thus, hubs and authorities exhibit what could be called a mutually reinforcing relationship: a good hub is a page that points to many good authorities; a good authority is a page that is pointed by many good hubs. In this work, we present the XHITS (Extended Hyperlink Induced Topic Search) algorithm, an extension of the HITS algorithm by introducing new concepts on the mutually reinforcing relationship. In XHITS, a good authority is a page that is pointed by many good hubs, some good portals and points to good novels; a good hub is a page that points to many good authorities, some good novels and is pointed by some good portals; and a good novel is a page that is pointed by good authorities, some good hubs and some good portals; a good portal is a page that points to some good authorities, some good hubs and some good novels. In addition, we show that XHITS converges and, through some experiments, the improved quality of the hyper documents retrieved.

ASSUNTO(S)

authority link analysis hits hubs xhits autoridade analise de hyperlinks hits hubs xhits

Documentos Relacionados