Document Clustering
Mostrando 1-8 de 8 artigos, teses e dissertações.
-
1. Representação de coleções de documentos textuais por meio de regras de associação / Representation of textual document collections through association rules
O número de documentos textuais disponíveis em formato digital tem aumentado incessantemente. Técnicas de Mineração de Textos são cada vez mais utilizadas para organizar e extrair conhecimento de grandes coleções de documentos textuais. Para o uso dessas técnicas é necessário que os documentos textuais estejam representados em um formato apropriad
IBICT - Instituto Brasileiro de Informação em Ciência e Tecnologia. Publicado em: 16/08/2011
-
2. Aprendizado não supervisionado de hierarquias de tópicos a partir de coleções textuais dinâmicas / Unsupervised learning of topic hierarchies from dynamic text collections
The need to extract new and useful knowledge from large textual collections has motivated researchs on Text Mining methods. Among the existing methods, initiatives for the knowledge organization by topic hierarchies are very popular. In the topic hierarchies, the knowledge is represented by topics and subtopics, and each topic contains documents of similar c
IBICT - Instituto Brasileiro de Informação em Ciência e Tecnologia. Publicado em: 19/05/2011
-
3. Sistemas baseados em mapas auto-organizÃveis para organizaÃÃo automÃtica de documentos texto
This work proposes and evaluates hybrid systems for automatic text document organization based on Self-Organizing Maps (SOM). The aim is to design a system that combines SOM with other clustering algorithms, in order to generate document maps for large text document collections of good quality at a low computational cost. The posprocessing of a neural networ
Publicado em: 2008
-
4. Analysis of the Clustering Algorithms for the Databases / Análise de Algoritmos de Agrupamento para Base de Dados Textuais
The increasing amount of digitally stored texts makes necessary the development of computational tools to allow the access of information and knowledge in an efficient and efficacious manner. This problem is extremely relevant in biomedicine research, since most of the generated knowledge is translated into scientific articles and it is necessary to have the
Publicado em: 2008
-
5. MINERAÇÃO DE TEXTOS NA COLETA INTELIGENTE DE DADOS NA WEB / TEXT MINING AT THE INTELLIGENT WEB CRAWLING PROCESS
This dissertation presents a study about the application of Text Mining as part of the intelligent Web crawling process. The most usual way of gathering data in Web consists of the utilization of web crawlers. Web crawlers are softwares that, once provided with an initial set of URLs (seeds), start the methodical proceeding of visiting a site, store it in di
Publicado em: 2008
-
6. Uso de sintagmas nominais na classificação automática de documentos eletrônicos
This research work presents a proposal for the classification of electronic documents using techniques and algorithms based on natural language processing and noun phrases indexing along with plain keywords. Two tools, OGMA and Weka, were used for the experiments proposed. OGMA was developed by the author to automate the extraction of noun phrases and to per
Publicado em: 2008
-
7. Interactions of platelet integrin αΙΙb and β3 transmembrane domains in mammalian cell membranes and their role in integrin activation
Clustering and occupancy of platelet integrin αIIbβ3 (GPIIb-IIIa) generate biologically important signals: conversely, intracellular signals increase the integrins' affinity, leading to integrin activation; both forms of integrin signaling play important roles in hemostasis and thrombosis. Indirect evidence implicates interactions between integrin α and �
American Society of Hematology.
-
8. Organization of the Micronuclear Genome of Oxytricha Nova
In the hypotrichous ciliated protozoan Oxytricha nova, approximately 95% of the micronuclear genome, including all of the repetitive DNA and most of the unique sequence DNA, is eliminated during the formation of the macronuclear genome. We have examined the interspersion patterns of repetitive and unique and eliminated and retained sequences in the micronucl