Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Mathematical values in the processing of Chinese numeral classifiers and measure words

  • One-Soon Her ,

    Contributed equally to this work with: One-Soon Her, Ying-Chun Chen

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliations Graduate Institute of Linguistics, National Chengchi University, Taipei, Taiwan, Research Center for Mind, Brain, and Learning, National Chengchi University, Taipei, Taiwan

  • Ying-Chun Chen ,

    Contributed equally to this work with: One-Soon Her, Ying-Chun Chen

    Roles Formal analysis, Investigation, Methodology, Resources, Software, Visualization, Writing – original draft

    Affiliation Graduate Institute of Linguistics, National Chengchi University, Taipei, Taiwan

  • Nai-Shing Yen

    Roles Conceptualization, Data curation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – review & editing

    nsy@nccu.edu.tw

    Affiliations Research Center for Mind, Brain, and Learning, National Chengchi University, Taipei, Taiwan, Department of Psychology, National Chengchi University, Taipei, Taiwan

Abstract

A numeral classifier is required between a numeral and a noun in Chinese, which comes in two varieties, sortal classifer (C) and measural classifier (M), also known as ‘classifier’ and ‘measure word’, respectively. Cs categorize objects based on semantic attributes and Cs and Ms both denote quantity in terms of mathematical values. The aim of this study was to conduct a psycholinguistic experiment to examine whether participants process C/Ms based on their mathematical values with a semantic distance comparison task, where participants judged which of the two C/M phrases was semantically closer to the target C/M. Results showed that participants performed more accurately and faster for C/Ms with fixed values than the ones with variable values. These results demonstrated that mathematical values do play an important role in the processing of C/Ms. This study may thus shed light on the influence of the linguistic system of C/Ms on magnitude cognition.

Introduction

Chinese is a numeral classifier language, where an element known as numeral classifier that denotes a unit is essential when a noun (N) is quantified by a numeral (Num). Numeral classifiers come in two varieties, sortal classifer (C) and measural classifier (M), also known as ‘classifier’ and ‘measure word’, respectively. In Table 1, ben and ke are classifiers (C); xiang (box) and da (dozen) are measure words (M). In this paper, the category of numeral classifiers is referred to as C/M.

thumbnail
Table 1. Examples of Chinese numeral classifiers and measure words.

https://doi.org/10.1371/journal.pone.0185047.t001

C and M converge in that they appear in the same grammatical position and are mutually exclusive [13]. However, C and M diverge in that Cs qualify the noun but Ms quantify the noun [46]. To be more specific, Cs categorise nouns by highlighting certain salient or inherent properties of the noun, while Ms denote the quantity of the entity of the noun [5,7]. For example, ben in Table 1 highlights the volume feature and can only be used for a bound copy of printed materials, such as a book or a magazine, whereas xiang means “a box of”, which carries new information indicating the quantity of the noun being quantified, since a box can contain different amounts of any object.

This convergence and divergence between C and M were reconciled by Her [8], where an innovative mathematical view was proposed to interpret the relation between Num and C/M as multiplication. The distinction between C and M is encoded precisely as follows: [Num X N] = [[Num × X] N], where X = C iff X = 1, otherwise X = M. In other words, X being the element required between Num and N, X is C if its inherent mathematical value is 1; otherwise, X is M. For example, in shi ke pingguo (ten C apple), shi (ten) and ke (C) form a multiplicative unit, i.e., (10×1); likewise, in shi da pingguo (ten M-dozen apple), shi (ten) and da (M-dozen) also form a multiplicative unit, i.e., (10×12). Under this view, C and M thus converge as the multiplicand of Num, the multiplier. C and M occupy exactly the same syntactic position and thus belong to a single syntactic category. Yet, C and M diverge in terms of their respective inherent values: C = 1, M ≠ 1, and thus constitute two distinct subcategories.

Note that a multiplicand 1 is unique in that it is the only identity element, or neutral element, in multiplication; 1 is thus redundant in the multiplicative equation. A multiplicand with any other value, numerical or non-numerical, is not redundant. This unique property of multiplicand 1 can explain why Cs may behave differently from Ms, in spite of C/M as a single syntactic category [8].

Her and Wu [9] further proposed a taxonomy of the magnitude values that C/Ms encode, along two dimensions: numerical vs. non-numerical and fixed vs. variable (See Table 2). While M1 and M2 both encode numerical values, the former has fixed values and the latter does not. Likewise, M3 and M4 both encode non-numerical values, but the former has fixed values and the latter does not. Thus, C, M1 and M3 encode fixed values, while M2 and M4 do not.

M2 and M4 are thus similar in that their values are vague and not fixed. The difference is that the vague value of a M2 is numerical, while that of a M4 is not. For example, the M2 cuo ‘small gang’ must take a count noun, e.g., yi cuo qiangdao ‘a small gang of bandits’, and must have a numerical value larger than one. Likewise, the M2 bang ‘gang’ in yi bang qiangdao ‘a gang of bandits’ must also have a numerical value larger than two. However, the typical number implied by bang ‘gang’ is larger than that implied by cuo ‘small gang’. In contrast to M2, the vague values denoted by M4 are not numerical and may be length, area, weight, volume, time, etc. For example, the M4 di ‘drop’, as in yi di shui ‘a drop of water’, refers to a vague volume of water in the shape of a teardrop. The M4 tan ‘puddle’, as in yi tan shui ‘a puddle of water’, though likewise referring to a vague volume of water, but in a random shape, has an implied value much bigger than that implied by di ‘drop’. Note, crucially, that the English counterparts of M2 and M4 are clearly nouns in terms of syntactic category. Yet, in Chinese, M2 and M4 are part of a distinctive syntactic category C/M, or numeral classifiers [8]. It is controversial whether the processing of Chinese numeral classifiers involves magnitude. The aim of our study was to address this issue.

However, as attractive as this theory may be, empirical evidence of the mathematical function of C/Ms was lacking. Thus, the aim of this study was to conduct a psycholinguistic experiment to examine whether participants process C/Ms based on their mathematical values as this multiplicative theory of C/M predicted they would. More specifically, the theory predicted that the difference between C/Ms with fixed values, i.e., C, M1, and M3, and those with non-fixed vague values, i.e., M2 and M4, would be more prominent than the difference between C/Ms with numerical values, i.e., C, M1, and M2, and those with non-numerical values, i.e., M3 and M4, for the simple reason that a M2/4 with a vague value cannot be coerced into having a rigid fixed value without an appropriate and robust discourse context, while a M3/4 with an inherent non-numerical value can be quite easily converted numerically to a smaller unit, e.g., one kilo into one thousand grams.

The most relevant previous study is Cui et al. [10], where a functional magnetic resonance imaging (fMRI) experiment compared the brain activities of processing classifiers with those of processing tool nouns, numbers, and dot arrays. Tool nouns are non-quantity words which refer to concrete objects used as tools, utensils, or instruments, e.g., liandao (sickle) and laba (trumpet). A semantic distance comparison task was used, where participants chose from two items the one that was semantically closer to the target item. For example, the target fuzi (axe) was presented on the top of the screen and participants had to judge whether liandao (sickle) or niezi (tweezers) which were displayed at the bottom of the screen was semantically closer to the target word fuzi (axe). Greater activation was found in the left middle frontal gyrus (MTG) and the left inferior frontal gyrus (IFG) instead of the right intraparietal sulcus (IPS) for processing classifiers and tool nouns than numbers and dot arrays. This result is rather unexpected under Her’s [8] theory, which predicts that brain activities of processing C/Ms should be more similar to those of processing numbers and dot arrays than to those of processing tool nouns. Given that some C/Ms (see Table 2), numbers, and dot arrays represent numerical magnitude, we expected that the processing of C/Ms, but not that of tool nouns, would elicit higher activations in the right IPS, which plays an important role in representation of numerical magnitude [1112].

One possible critical reason why Cui et al. [10] did not find the IPS more activated for processing C/Ms than processing tool nouns is that their experimental materials of the so-called “classifiers” mixed up Cs and Ms and thus no distinction was made between Cs and Ms. Yet, as reviewed above, linguistic studies suggested that Cs differ significantly from Ms [e.g., 56]. Furthermore, the taxonomy proposed by Her and Wu [9] also categorizes the mathematical values of C/Ms along the dimension of [fixed vs. variable]. Presumably, C/Ms with a fixed value may be related to exact representation of numbers, while C/Ms that encode a variable value may be associated with approximation. We hypothesized that participants would choose the C/M option that had the same or closer value as that of the target C/M, when the values in question were all fixed. However, it was unclear how participants would process C/Ms with variable values.

Note also that Cui et al. [10] did not use complete [Num X N] phrases, e.g. yi zhang haibao (1 C-flat poster, one poster), as stimuli in the semantic distance comparison task. Rather, they used [Num X] phrases, e.g. yi zhang (1 C- flat), in their study. Thus, the semantic context was not strictly confined in their study. Therefore, we replicated the paradigm by Cui et al. [10] but used a more appropriate set of stimuli, i.e. [Num X N] phrases, e.g. yi zhang haibao (1 C- flat poster, one poster), to examine whether C/Ms were processed based on mathematical values.

We hypothesized that, first, participants would compare the mathematical values the C/Ms encode and select the one with the same or closer value to the target C/M, and, second, the accuracy of C/Ms with fixed values to be higher than that of C/Ms with variable values.

Method

Participants

Twenty individuals (16 females, 4 males, ages 20–28, mean age = 22.6 ± 2.06) were recruited from National Chengchi University. Participants were right-handed, had normal or corrected-to-normal vision. Their first language is Mandarin. They gave written informed consent to the study approved by the Research Ethics Committee of National Taiwan University and received NT$100.

Stimuli and experimental design

We conducted a 2 × 2 within-subject design. The two independent variables were the numerical type (numerical: C, M1, and M2 vs. non-numerical:M3 and M4) and mathematical value type (fixed value: C, M1, and M3 vs. variable value: M2 and M4). There were thirteen C/Ms for each condition (see S1 Table). Each C/M repeated twice as the target C/M. For each trial, another two C/Ms from the same condition were selected to be paired with the target C/M. When three C/Ms phrases were paired together for a trial, two experimenters produced a reasonable noun for this set of C/Ms to confine the semantic contexts. One experimenter created these [Num X N] phrases and the other experimenter checked if all phrases were clear and understandable. For the phrases that were unclear, the two experimenter discussed and came up with another noun that better fit the set of C/Ms. The nouns were unrepeated throughout the experiment. Consequently, there were 104 sets of C/M phrases in total. Each condition included 26 trials. The target C/M phrases were composed of the number 1 and a C/M. The answer and distractor C/M phrases included the number 1, a C/M, and a noun. The answer and distractor C/M phrases differed in the C/M. The numeral enabled participants to process the C/M as a C/M, not a noun. By designing the answer/distractor phrase as a minimal pair, we strictly confined the semantic context for the C/M (Table 3). We recorded responses and reaction times (RT).

thumbnail
Table 3. Structure of the experimental stimuli with a sample set for each condition.

https://doi.org/10.1371/journal.pone.0185047.t003

Procedure

There were eight practice trials to ensure that participants fully understood the task. In each trial, participants saw three C/M phrases on the screen at the positions of the three end points of a triangle (Fig 1). They had to perform a semantic distance comparison task: which one of the two C/M phrases at the bottom was semantically closer to the target C/M phrase at the top. The positions of the answers and the distractors were randomized. Participants had up to 8 seconds (s) to respond. The inter-trial interval was 500 milliseconds (ms). There were 104 trials in total; each condition included 26 trials. The order of the trials was randomized. Right after the experiment, participants filled in a questionnaire to indicate their subjective mathematical values of the M2 and M4 used in the experiment. The questionnaire listed all C/Ms that appeared in the experiment in the form of [one C/M]. Take yi fu (one set of) for example, participants had to fill in the blank yue ““ge (around ““C) to indicate their subjective mathematical value. For non-numerical M3-4, participants had to fill in the blank to indicate around how much centimeter (length), square meter (area), gram (weight), and milliliter (volume) they think the M3-4 represented for different types of M3-4 respectively. For example, when participants saw yi bei (one cup of), they had to fill in the blank yue ““haosheng (around ““milliliter) to indicate their subjective mathematical value of yi bei (one cup of).

thumbnail
Fig 1. The experimental procedure.

In each trial, participants saw three C/M phrases on the screen. They had to choose between the two C/M phrases at the bottom the one that was semantically closer to the target C/M phrase on the top. The C/M options were composed of minimal pairs which included an identical numeral, a classifier or a measure word, and an identical noun.

https://doi.org/10.1371/journal.pone.0185047.g001

Data analysis

The responses and RT were analysed in a two-way (numerical/ non-numerical × fixed/ variable values) repeated measures ANOVA. IBM SPSS 20.0 was used for the statistical analysis with the α value set at .05. Post-hoc analyses of the simple main effects were made by means of t-tests applying Bonferroni’s correction for multiple comparisons. We calculated the accuracy of M2 and M4 based on the subjective mathematical values reported by the participants individually (see S1 Table for the descriptive statistic reports). For each participant, we determined the correct answer of each trial according to the subjective mathematical values that they reported. Take the sample set of M2 in Table 3 for example, if a participant reported that his/her subjective mathematical values of yi dui (one team of), yi qun (one group of), and yi bang (one gang of) were 10, 20, and 30, respectively, the answer of this trial for this participant would be yi qun shashou (one group of killers) instead of yi bang shashou (one gang of killers), as 20 is closer to 10 than 30 is.

Results

The mean (and standard deviation, SD) of accuracy and RT are shown in Table 4.

thumbnail
Table 4. The mean (and standard deviation) of accuracy and reaction times (RT) in experiment.

https://doi.org/10.1371/journal.pone.0185047.t004

Accuracy

The significant main effect of the numerical types was not significant, F (1,19) = .227, p = .639, such that the accuracy of the numerical C, M1, and M2 and that of the non-numerical M3 and M4 were not significantly different. However, there was a significant main effect of the mathematical value types, such that the accuracy of C/Ms with fixed values was significantly higher than those with variable values, F (1,19) = 68.298, p < .001. The accuracy of C and M1 was significantly higher than that of M2 (p < .001), and the accuracy of M3 was significantly higher than that of M4 (p < .001). There was no significant interaction effect between the numerical types and the mathematical value types, F (1,19) = .013, p = .91 (Table 4).

Reaction times

The significant main effect of the numerical types was not significant, F (1,19) = 2.098, p = .164, such that the RT of numerical C/M1-2 and that of non-numerical M3-4 were not significantly different. However, the mathematical value types displayed a significant main effect, F (1,19) = 37.726, p < .001, such that the participants responded faster while processing the C/M with fixed values compared with the C/M with variable values. The RT of C/M1 was significantly shorter than M2 (p = .001), and the RT of M3 was significantly shorter than M4 (p < .001). There was no interaction between the numerical types and the mathematical value types, F (1,19) = .068, p = .797 (Table 4). In general, the pattern of reaction times under the four conditions was consistent with that of accuracy. The higher the accuracy, the shorter the RT.

Discussion

The results showed that participants made semantic judgments based on the mathematical values of C/Ms, numerical or not, when the values were fixed rather than variable. They responded faster in processing C/Ms with a fixed value than a variable value. The mean accuracy of M2 (0.568) and M4 (0.583) with variable values was relatively low, even though the accuracy was calculated individually dependent on the subjective mathematical values reported by each participant. This was consistent with our prediction that C/Ms with fixed values are mathematically comparable, while C/Ms with variable values are too vague to be comparable.

It is still possible that participants represented a rough value that M2 or M4 encode and tried to compare. However, because of the variability of the values they encode, participants may not be able to represent these variable values exactly the same way every time. In other words, the subjective mathematical values may have fluctuated between the time of performing the semantic distance comparison task and filling in the post-experimental questionnaire. To modify this limitation in the current study, we suggest future studies ask participants to report their subjective mathematical values immediately after each trial.

It is worth noting that the mean subjective mathematical value of M2 ranged only from 5 to 18 and the variance was rather small (see S1 Table). This may make choosing between the two options of C/Ms difficult and result in fifty percent of chance to choose one of the two options of C/Ms. Even if the participants represented M2 as a mathematical value, the closeness of the two options of C/Ms may be too competitive to make a distinct difference. Furthermore, although the mean subjective mathematical value of M4 varied to a greater extent than M2 did, the variance was large. This indicates that there was a large individual difference of the subjective mathematical values of M4s. Future studies are suggested to use a complete sentence or story to confine the context to better control the semantic distance of C/Ms. Since behavioral responses could not answer whether participants processed C/Ms with variable values mathematically, future studies can further investigate the quantity processing of C/Ms that encode variable values using fMRI by examining the brain activations related to numerical representation such as the IPS [1112].

One may argue that the quantification of a gang of might be different for killers and for hooligans and the quantification of a litter of might be different for mice than for cats. This indeed may be true for M2 and M4, which have non-fixed variable values. Yet, this possible noun-contingency effect was not a factor in the experiment, as all minimal pairs of [one M2/4 N] have exactly the same N. One may also suspect that the discriminability of these pairs might vary over nouns, e.g., the difference between a team of salespeople and a gang of salespeople might be different in not only magnitude but sign from the difference between a team of killers and a gang of killers. Again, this possible effect was not a factor in the experiment as no such cross-pair comparison was elicited and only within-pair comparison was required.

Not surprisingly, there was no significant difference between numerical and non-numerical C/Ms. One of the reasons may be that we adopted the semantic distance comparison task in this experiment. It is likely that participants converted the non-numerical C/M into the same unit to make a comparison. For example, when participants had to choose between yi bang (a pound) and yi gongjin (one kilo), they represented them as 453 grams and 1000 grams, whether exactly or approximately, to make the judgment. In other words, it was possible that due to the nature of the semantic distance comparison task which may require accurate quantity comparison, participants preferred representing C/Ms as a numerical value to perform the task in the current study. This may explain why we did not observe significant difference between numerical and non-numerical C/Ms. We suggest future studies use other tasks to further investigate whether the cognitive processing of numerical and non-numerical C/Ms are similar in spite of experimental paradigms. Future study may also use neuroimaging techniques to examine whether numerical and non-numerical C/Ms engage in a similar neural network.

Partially consistent with our hypothesis that C/Ms encode mathematical values, we found that participants did represent and compared the mathematical values of C/Ms with fixed values. If participants did not process them based on their mathematical values, the accuracy of numerical C/Ms would not have been above the chance level. However, it remains unclear how participants processed the C/Ms with variable values. Moreover, it is unknown whether participants processed non-numerical C/Ms in a numerical form to perform the semantic distance comparison task. Therefore, future studies are needed to further examine the cognitive processing of non-numerical C/Ms using other tasks. In general, our findings, in part, corroborated Her's [8] mathematical theory of C/M that C/Ms encode mathematical values by providing behavioral evidence of C/Ms with fixed mathematical values.

To our knowledge, this study was the first study providing evidence that showed Chinese C/Ms encode mathematical values. Participants represented and compared fixed mathematical values of C/Ms to make a semantic judgment. This psychological finding laid an empirical foundation supporting Her's [8] mathematical theory, where C and M converge as the multiplicand of Num but diverge in terms of their respective value: C = 1, M ≠ 1. We verified the notion that Cs encode 1 and Ms encode certain other mathematical values by showing that participants chose the C/M that had the same or closer value to the target C/M when the mathematical values were fixed in the semantic distance comparison task. Future psycholinguistic and neurolinguistic studies should further investigate whether the mathematical relation between Num and C/M is multiplication. In sum, findings in the current study implied that the linguistic system of C/Ms might influence magnitude cognition.

Supporting information

S1 Dataset. Mean accuracy and RT in the current study.

https://doi.org/10.1371/journal.pone.0185047.s001

(DOCX)

S1 Table. Stimuli used in the experiment.

The word frequency was obtained from the Digital Resources Center for Global Chinese Language Teaching and Learning by Cheng et al. [13].

https://doi.org/10.1371/journal.pone.0185047.s002

(DOCX)

Acknowledgments

We thank Denise H. Wu for suggesting this study, Kuan-Ju Chou for helping to create experimental stimuli, and Jie-Li Tsai, Marc Tang, and Ya-Ting Chuang for helpful comments on a previous version of the paper. Previous versions of the paper were presented at the 2016 Annual Meeting of Taiwan Society of Cognitive Neuroscience, January 23, 2016, National Chengchi University, Taipei, Taiwan, and the 2016 Annual Meeting of the Cognitive Neuroscience Society, April 2–5, 2016, New York Hilton Midtown, New York. We thank the participants of the two conferences who commented on the work and gave constructive suggestions.

References

  1. 1. He J. Xiandai Hanyu Liangci Yanjiu [A Study of Measures in Modern Chinese]. Beijing: Beijing Language University Press; 2008.
  2. 2. Hsieh ML. The internal structure of noun phrases in Chinese. Crane Publishing Company; 2008.
  3. 3. Her OS. Structure of classifiers and measure words: A lexical functional Account. Language and Linguistics. 2012 Nov 1; 13(6): 1211.
  4. 4. Adams KL, Conklin NF. Toward a theory of natural classification. InAnnual Regional Meeting of the Chicago Linguistic Society 1973 Apr 13 (Vol. 9, pp. 1–10).
  5. 5. Her OS, Hsieh CT. On the semantic distinction between classifiers and measure words in Chinese. Language and linguistics. 2010 Mar 1; 11(3): 527–51.
  6. 6. Li X, Rothstein S. Measure readings of Mandarin classifier phrases and the particle de. Language and Linguistics. 2012 Jul 1; 13(4): 693–741.
  7. 7. Tai J, Wang L. A Semantic Study of the Classifier Tiao. Journal of the Chinese Language Teachers Association. 1990; 25(1): 35–56.
  8. 8. Her OS. Distinguishing classifiers and measure words: A mathematical perspective and implications. Lingua. 2012 Nov 30; 122(14): 1668–91.
  9. 9. Her OS, Wu JS. Taxonomy of numeral classifiers and measure words: A formal semantic proposal. Under review with Linguistics. 2017.
  10. 10. Cui J, Yu X, Yang H, Chen C, Liang P, Zhou X. Neural correlates of quantity processing of numeral classifiers. Neuropsychology. 2013 Sep; 27(5): 583–94. pmid:23937482
  11. 11. Dehaene S, Piazza M, Pinel P, Cohen L. Three parietal circuits for number processing. Cognitive neuropsychology. 2003 May 1; 20(3–6): 487–506.
  12. 12. Nieder A, Dehaene S. Representation of number in the brain. Annual review of neuroscience. 2009 Jul 21; 32: 185–208. pmid:19400715
  13. 13. Cheng CC, Huang CR, Lo FJ, Tsai MC, Huang YC, Chen XY, et al. Digital Resources Center for Global Chinese Language Teaching and Learning. 2005 [cited 22 May 2017]. Available from: http://elearning.ling.sinica.edu.tw/index.html.