Automatically Extracting and Tagging Business Information for E-Business Systems Using Linguistic Analysis

Automatically Extracting and Tagging Business Information for E-Business Systems Using Linguistic Analysis

Sumali Conlon, Susan Lukose, Jason G. Hale, Anil Vinjamur
DOI: 10.4018/978-1-59904-192-6.ch004
OnDemand:
(Individual Chapters)
Available
$33.75
List Price: $37.50
10% Discount:-$3.75
TOTAL SAVINGS: $3.75

Abstract

The Semantic Web will require semantic representations of information that computers can understand when they process business applications. Most Web content is currently represented in formats such as text, that facilitate human understanding, rather than in the more structured formats, that allow automated processing and computer understanding. This chapter explores how natural language processing (NLP) principles, using linguistic analysis, can be employed to extract information from unstructured Web documents and translate it into extensible markup language (XML)—the enabling currency of today’s e-business applications, and the foundation for the emerging Semantic Web languages of tomorrow. Our prototype system is built and tested with online financial documents.

Complete Chapter List

Search this Book:
Reset