Abstract
Data mining technology is giving us the ability to extract meaningful patterns from large quantities of structured data. Information retrieval systems have made large quantities of textual data available. Extracting meaningful patterns from this data is difficult. Current tools for mining structured data are inappropriate for free text. We outline problems involved in Knowledge Discovery in Text, and present an architecture for extracting patterns that hold across multiple documents. The capabilities that such a system could provide are illustrated.
Note
The Twenty-Second Annual International Computer Software and Applications Conference
August 19-21, 1998 in Vienna, Austria
Also in IEEE Digital Library