STATISTICA









(add-on product)

See also Common Features of STATISTICA Enterprise Systems
See also Overview of STATISTICA Enterprise Systems



How can I Use STATISTICA Text Miner ?
  • Analyze the contents of Web pages. For example, users can automatically process and summarize all Web pages of particular companies, message boards, etc.

  • Include unstructured notes in predictive data mining projects. For example, users may include responses to open-ended interview questions, patients' own descriptions of medical symptoms, etc. in data mining projects involving the clustering of patients and symptoms.

  • Analyze large document repositories. For example, users may analyze repositories of documents such as narratives of insurance claims, etc., to include such information in fraud detection projects.

STATISTICA Text Miner is an optional extension of STATISTICA Data Miner, ideal for translating unstructured text data into meaningful, valuable clusters of decision-making "gold." As most users familiar with data mining already know, real-world data comes in a variety of forms, not always organized or easily ready to analyze. STATISTICA Text Miner digs for the underlying information not readily apparent in traditional structured data.

STATISTICA Text Miner is seamlessly integrated into STATISTICA or STATISTICA Data Miner and like other StatSoft products, features the most comprehensive and powerful tools on the market, implemented with uncompromising attention to efficiency and scalability, and employing multi-threaded computing technology to extract optimum performance from advanced multiple-processor server hardware.

As all components of STATISTICA Data Miner, STATISTICA Text Miner was specifically designed as a general and open-architecture tool for mining unstructured information. The feature extraction/selection and other analytic tools available in STATISTICA Text Miner are not only applicable to text documents or Web pages, but can also be used to index, classify, cluster, or otherwise include in your analyses unstructured information such as (pre-processed) bitmaps, sound files, etc.


Core functionality of STATISTICA Text Miner:

Accessing Documents
Processing Documents
Analyzing Documents



Integration with STATISTICA, STATISTICA Data Miner, and WebSTATISTICA

The text miner software is fully integrated into the STATISTICA line of software; it is not a stand-alone product manufactured by another vendor and somehow "connected" to STATISTICA! This makes this text mining solution unique in the market: By being fully integrated (and automated), the text mining functionality becomes "just-another-module", that can be integrated into the STATISTICA Data Miner workspace environment, WebSTATISTICA, or custom STATISTICA applications (via SVB; for example, users may automatically and routinely access files stored in a data warehouse, using IDP technology, to update certain analyses and numeric summaries of the textual information available in the warehouse; this could be done via WebSTATISTICA, so that the results of those analyses can be accessible to authorized users via the Web worldwide).

Back to Top
Request Quote
StatSoft Home Page



[StatSoft]
2300 East 14th Street, Tulsa, OK 74104
Phone: (918) 749-1119; Fax: (918) 749-2217

[StatSoft]e-mail: info@statsoft.com

©Copyright StatSoft, Inc., 1984-2004.
StatSoft, StatSoft logo, STATISTICA, SEWSS, SEDAS, Data Miner, SEPATH and GTrees are trademarks of StatSoft, Inc.