System Analysis and Design for Document Classification

System Analysis and Design for Document Classification

DOI: 10.4018/978-1-7998-3772-5.ch008
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

The text-mining process starts with a keyword search in text collections. Current text processing technology allows a search technique beyond simple Boolean searches by using natural language queries. Since search engines can recognize any of thousands of keywords and phrases but not the concepts behind the text, it is necessary for researchers to construct an automatic keyword extractor to generate the “Keyword List” for each document. Later, this list can act as the knowledge base to associate unorganized documents to meaningful classes. Failures in identifying the keywords for a certain concept will result in missing values or data for that specific concept.
Chapter Preview
Top

Data Flow Diagram For Document Classifier

Figure 1.

Context level (Level 0) of data flow diagram

978-1-7998-3772-5.ch008.f01
Figure 2.

Level 1 of data flow diagram

978-1-7998-3772-5.ch008.f02
Top

Entity Relationship Diagram Of Document Classifier

Figure 3.

Entity, relationship and cardinality of document classifier

978-1-7998-3772-5.ch008.f03
Top

Database Design Of Document Classifier

Table 1.
Document
     Sr.No     Attribute Name     Data Type     Size
doc_id (p)     Text     10
doc_content     Memo
doc_path     Text     100

Complete Chapter List

Search this Book:
Reset