Corporate Disclosure Measurement

Corporate Disclosure Measurement

Copyright: © 2018 |Pages: 11
DOI: 10.4018/978-1-5225-2255-3.ch165
(Individual Chapters)
No Current Special Offers


This chapter conducts the study on the use of text mining for corporate disclosure measurement. During the last few decades, research on corporate disclosure has received significant concentration from the researchers. Corporate disclosure measurement has been used by researchers to reveal the real world scenario of corporate disclosure practice. Very often, the researchers have used content analysis for making comments on the corporate disclosure practice. This chapter investigates the usability of text mining approaches for measuring the corporate disclosure. To do this, stages and general techniques of text mining have been discussed. The usability and advantages of text mining approaches for corporate disclosure measurement have been investigated. And prospective stages of text mining have been proposed to measure the corporate disclosure. The investigation and study indicates that text mining can contribute significantly to measure the corporate disclosure.
Chapter Preview


Researchers have mentioned that about 90% of the entire world’s data has been created in the past 2 years (e.g., Kolb & Kolb, 2013). Most of these data are made in electronic format. Corporate disclosures (CDs) are available now in electronic formats through different documents also. As the organizations are disclosing the disclosure annually or in the interim period, the numbers of volumes are increasing gradually. Organizations disclose disclosure through various media, e.g., annual reports, websites, supplementary of financial statements and others.

A number of researchers have investigated the corporate disclosure (CD) scenario in the last decades as it is assumed to have an impact on the capital market (Healy & Palepu, 2001). CD can be categorized into two broad groups – Mandatory disclosures and Voluntary disclosures. Mandatory disclosures are obligatory by the concern organization of a country (e.g., Securities and Exchange Commission). In contrast, voluntary disclosures are made by the firms voluntarily considering the interest of stakeholders. A number of topics on CD have been investigated e.g., financial disclosure (e.g., Malone et al., 1993), human resource capital (e.g., Bontis, 2003), environmental disclosure (e.g., Gamble et al., 1996), strategic disclosure (e.g., Wagenhofer, 1990) and others.

Very often, researchers have used content analysis to explore the scenario of CD practice (e.g., Bontis, 2003; Khan et al., 2011). In content analysis they normally count the frequency of words to explore the scenario. Researchers normally select a list of related terms or keywords to the concern topic and search the terms in the annual reports or other related supplementary where disclosure has been presented by the organizations (e.g., Bontis, 2003).

Researchers have urged for more reliable measurement of CD practice (e.g., Rahman & Post, 2011). Consistent with that, this chapter proposes the use of text mining (TM) approach for measuring the CD. Currently, different areas of researches are applying the approaches and techniques of TM. Researcher of medical science, politics, business and others have used the techniques of TM to extract the information (Moohebat et al., 2015). The purpose of TM is to process unstructured textual information. In large text collection, TM works as a tool for knowledge discovery (Gomez et al., 2002). TM uses several techniques to extract the information, e.g., text classification, text clustering, text summarization, sentiment analysis and others. This chapter investigates the usability and advantages of TM approaches in the measurement of CD. Therefore, the aim of this paper is fourfold. First, it reviews the background of CDs and their measurement and background of TM. Second, it discusses the TM approaches, including general model of TM and methodology. Third, the usability of TM in measuring the CD has been investigated. And finally, it discusses the prospects of research in the area of using the TM in CD measurement.

Key Terms in this Chapter

Corporate Disclosure: Financial and non-financial information of organizations those are disclose to the public.

Nested Terms: “Nested terms” is a concept to present the joint generalization of words.

Mapping: Describing the characteristics of the studies that have been explored by the researchers.

Stemming: The term stemming refers to the reduction of words to their roots so that different grammatical forms or declinations of verbs are identified and indexed as the same word.

Bag-of-Words Model: In this approach a text is considered as the bag of its words. The frequency or occurrence of each word is used as a feature in Bag-of-Words model to classify the documents.

Complete Chapter List

Search this Book: