The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection
The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection

The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection

Beitrag, Englisch, 10 Seiten, IEEE Computer Society

Autor: Prof. Adrian Müller, PMP

Erscheinungsdatum: 1999

ISBN: 0769500013

Quelle: Proceedings of the 32nd Hawaii International Conference on System Sciences

Seitenangabe: 2034-2044


Aufrufe gesamt: 657, letzte 30 Tage: 1

Kontakt

Verlag

IEEE Computer Society

Telefon: +1-202-371-0101

Telefax: +1-202-728-9614

Preis: k. A.

Kaufen
Text Mining is an active area of research and development, which combines and expands techniques found in related areas like information retrieval, computational linguistics, and data mining to perform an analysis of large corpora of digital documents. This paper describes the TaxGen Text Mining project carried out at the IBM Software Development Lab. at Boeblingen, Germany. The goal of TaxGen was the automatic generation of a taxonomy for a collection of previously unstructured documents, namely a set of 73.000 news wire documents spanning one year.

Prof. Adrian Müller, PMP

DE, Zweibrücken

Professor

Fachhochschule Kaiserslautern Standort Zweibrücken LG Information Retrieval und Agentensysteme

Publikationen: 6

Veranstaltungen: 1

Aufrufe seit 11/2004: 5944
Aufrufe letzte 30 Tage: 6