Article Preview
TopIntroduction
The features of spoken language are widely discussed (e.g. Aijmer (1989, 2002, 2009, 2011), Brazil (1995), Biber et al. (1999), Carter (2006), Cutting (2006) and Adolph (2008)). Most of the studies explore native speakers’ (NSs) speech. There are relatively few studies of non-native speakers (NNSs), in particular Chinese NNSs. A few exceptions are Yang and Wei’s (2005) and Xu and Xu’s (2007) studies, in which the frequent use of I think in the Chinese NNSs’ speech has been interpreted as an overused fragment.
This paper begins with a review of the literature, including the grammar aspects of I think with emphasis on its grammatical ambiguity, which causes difficulty in the distinction between its non-discourse use (Type A) and discourse use (Type B). It then introduces the three major corpora under investigation and research methods. The analysis presents the frequency information first, showing an overall sense of the use of I think in the six sub-corpora under investigation. The major analysis is the examination of co-occurring linguistic environment of Type B I think. Based on the identified co-occurrence, its functions are suggested. This paper concludes from the corpus analysis that both Chinese NNSs’ and NSs’ awareness of usages differences of I think can be raised.
Similar to the previous studies, the present study, based on a detailed analysis of three corpora: The Spoken English Corpus of Chinese Learners (SECCL) (Wen, Wang, & Liang, 2005), the Michigan Corpus of Academic Spoken English (MICASE) (Simpson, Briggs, Ovens, & Swales, 2002) and the International Corpus of English The British Component (ICE-GB) (2006) (see Table 1 for more details), finds that the Chinese NNSs in SECCL use I think more frequently than the NSs. However, it is questionable whether or not Chinese NNSs overuse I think. It seems reasonable that the sheer number of occurrences is interpreted as overuse; on the other hand, it seems that frequency information has been over-generalised without considering possible factors in using I think. The aim of this paper is to discuss some possible contributing factors of the over-representation1 of I think in the corpus of Chinese NNSs’ speech. Two aspects are to be considered: the context of I think, within such macro-factors as discussion topics and micro-factors, like mirroring linguistic usages.
Table 1. Corpora under investigation
Corpus | Number of Texts | Word Counts (Tokens)* | Average Words per Text (Tokens) |
SECCL: Monologues (Chinese NNSs) | 1,143 | 336,303 | 294 |
SECCL: Dialogues (Chinese NNSs) | 1,143 | 596,639 | 522 |
MICASE: Highly monologic discourse mode (American NSs) | 13 | 134,096 | 10,315 |
MICASE: Highly interactive discourse mode (American NSs) | 48 | 577,996 | 12,042 |
ICE-GB: Unscripted monologues (British NSs) | 70 | 153,646 | 2,195 |
ICE-GB: Private direct conversations (British NSs) | 90 | 185,000 | 2,056 |