The Center for Education and Research in Information Assurance and Security (CERIAS)

The Center for Education and Research in
Information Assurance and Security (CERIAS)

Data mining on text

Download

Download PDF Document
PDF

Author

Christopher Clifton

Tech report number

CERIAS TR 2001-90

Entry type

conference

Abstract

Data mining technology is giving us the ability to extract meaningful patterns from large quantities of structured data. Information retrieval systems have made large quantities of textual data available. Extracting meaningful patterns from this data is difficult. Current tools for mining structured data are inappropriate for free text. We outline problems involved in Knowledge Discovery in Text, and present an architecture for extracting patterns that hold across multiple documents. The capabilities that such a system could provide are illustrated.

Download

PDF

Date

1998 – 08

Key alpha

Clifton

Note

The Twenty-Second Annual International Computer Software and Applications Conference August 19-21, 1998 in Vienna, Austria Also in IEEE Digital Library

Publication Date

2001-08-01

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.