This Illustration focuses on Kua’nsi (kʰwa33n̩21si33, ISO639: ykn), a Central Ngwi (or Yi) language of the Sino-Tibetan family (Bradley Reference Bradley1997; Fan et al., Reference Fan, Zhou, Yang and Guo2017). It is spoken by approximately 5000 people in Liuhe Township (六合乡, Liùhé Xiāng), Heqing County (鹤庆县, Hèqìng Xiàn), Dali Bai Autonomous Prefecture, Yunnan Province, China (see the map in Figure 1). Kua’nsi people refer to themselves as Kua’eshi (夸萼氏 Kuà’èshì) in Chinese and other ethnic groups, Bai (白 Bái) and Han (汉 Hàn) people living in the county, call them as Baiyi people (白依人 Báiyīrén) in Chinese because of their white traditional clothes.
In addition to Kua’nsi, Castro, Crook and Flaming (Reference Castro, Crook and Flaming2010) report that there are four other Ngwi groups in Liuhe Township: Kuamasi, Zibusi, Laizisi and Sonaga. All these Ngwi groups in Liuhe Township have been recognised as part of Yi ethnic minority nationality in China. These groups have not been noticed until recently, and their languages have not yet been studied in detail. As reported by Castro et al. (Reference Castro, Crook and Flaming2010), Kua’nsi is most closely related to Kuamasi, which has around 1000 speakers living in Songping village (松坪村 Sōngpíng Cūn), Liuhe Township. Kua’nsi and Kuamasi are partially mutually intelligible, as there is around 65% of the lexical similarity between these two languages based on a comparison of the Swadesh 100 wordlists (Castro et al. Reference Castro, Crook and Flaming2010, p. 28). The map in Figure 2 shows the location of all Ngwi groups found in Heqing County.
Multilingualism is common among Kua’nsi speakers. The major ethnic group in Heqing county is Bai and most people can speak both Bai and the local variety of Southwestern Mandarin at different levels. Many Kua’nsi elders are bilingual in Kua’nsi and Bai. As Mandarin has become the more socially dominant language, young Kua’nsi people are more proficient in Mandarin and some of them may also have limited knowledge of Bai. Kua’nsi children are still learning Kua’nsi as their first language and use it actively in their daily life except for schooling. Education is conducted primarily in Standard Mandarin (普通话 pŭtōnghuà ‘common speech’). In the early years of primary school, if the teachers are also Kua’nsi, education is sometimes supplemented in Kua’nsi to help children understand Standard Mandarin properly. Afterwards, the schooling is only in Standard Mandarin.
As not much work has been done on Kua’nsi except Duan (Reference Duan1998) which provides a sketch of Kua’nsi phonology and morphosyntax, this Illustration aims to provide a detailed description of the phonetic aspects of Kua’nsi. This Illustration is based on both lexical data, containing over 1700 words, and textual data collected during the author’s field trips in 2018 and 2019. The lexical data includes the Swadesh 200 wordlist and a long wordlist compiled based on Bradley (Reference Bradley1979) and works on other Ngwi languages, especially the most related Ngwi language Talu (Zhou, Reference Zhou2004). Each of the words was repeated three or more times when recorded. The lexical data were recorded mostly with two elder Kua’nsi speakers in Wuxing village (五星村, wŭxīng cūn): JIAO Jize (绞基泽, JJZ, male, born in 1962), JIAO Xiangxiu (绞香秀, JXX, female, born in 1964), and the textual data were recorded with them and ten other speakers. The appended text was recorded with JXX. All the participants were born and raised in Kua’nsi villages, and Kua’nsi is their first language. They can also speak Mandarin and Bai at different levels.
Consonants
As demonstrated in the consonant table, the consonant inventory of Kua’nsi comprises 37 phonemes. The examples below illustrate the phonemic contrast between each phoneme. The bilabial affricates [pɸ, pɸʰ, bβ] are in complementary distribution with bilabial plosives /p, pʰ, b/. The bilabial affricates only occur before the close central /ɨ/, while the bilabial plosives do not precede /ɨ/ in Kua’nsi. The labio-dental fricatives /f, v/ also occur only before /ɨ/, but they can appear before other vowels in Chinese loanwords where they are in contrast with other consonant phonemes; thus, they are not considered as allophones of other phonemes. The Kua’nsi consonants only occur in the syllable onset position.Footnote 1
The plosives and affricates, except the glottal stop, show a three-way contrast: voiceless unaspirated, voiceless aspirated and voiced. The plosives show four places of articulation: bilabial, alveolar, velar and glottal. The distribution of velar plosives /k, kʰ, ɡ/ is restricted, and they cannot be followed by the front vowels /i, ɛ/, or the palatal approximant /j/. The distribution of other plosives is not restricted.
The glottal stop /ʔ/ occurs intervocalically in Kua’nsi in addition to the word-initial position, and the realisation of intervocalic /ʔ/ is varied. It can be pronounced with full closure of the vocal folds, but often without full closure. This aligns with the crosslinguistic tendency of glottal stops to be realised with incomplete closure in intervocalic positions (Ladefoged & Maddieson, Reference Ladefoged and Maddieson1996:75; Garellek, Chai, Huang & Van Doren Reference Garellek, Chai, Huang and Van Doren2023). Figure 3 shows the waveforms of the word bo 21 ʔo 33 ‘ant’ produced by speaker JXX successively. As indicated in the waveform at the top of Figure 3, in the first production, /ʔ/ is pronounced with irregular voicing and the non-modal voicing seems to continue into the following vowel. In the second production of this word, /ʔ/ shows full occlusion surrounded by irregular voicing, as a long silent closure period can be observed for the intervocalic /ʔ/ in the waveform at the bottom of Figure 3.
Table 1 gives the mean voice onset times (VOT) for voiceless plosives in the word-initial position. These measurements are based on data from two older speakers, JJZ and JXX, across 71 and 155 tokens respectively. VOT ranges between about 10 ms and 30 ms for the unaspirated plosives. By contrast, for the voiceless aspirated plosives, VOT ranges between about 80 ms to 120 ms.
The affricates distinguish three places of articulation: bilabial, alveolar and alveolopalatal. The bilabial affricates [pɸ, pɸʰ, bβ], as mentioned above, can only occur before /ɨ/ and are in complementary distribution with the bilabial plosives. The distribution of alveolopalatal affricates /tɕ, tɕʰ, dʑ/ is restricted, and they can only precede /i/, /ɥ/, /j/ and /w/. By contrast, the alveolar affricates /ts, tsʰ, dz/ have a wider distribution in Kua’nsi, and only /i, ɥ, j/ cannot occur after them.
Kua’nsi has a large inventory of fricatives. They show five places of articulation: labiodental, alveolar, alveolopalatal, velar and glottal with a voiceless vs. voiced contrast, except for the glottal fricative /h/. Some of the fricatives have very limited distributions in Kua’nsi. The labiodentals /f, v/, except in Mandarin loan words, occur before /ɨ/. The voiced alveolar fricative /ʑ/ only occurs before /u/, and the voiced velar fricative /Ɣ/ only occurs before /a, ɘ/.
Figure 4 shows averaged Fast Fourier Transformed spectra for the four voiceless fricatives /f, s, ɕ, x/ of Kua’nsi. It can be seen that /x/ has the lowest spectral energy almost at any points. /s/ has more spectral energy above 6 kKz than any other fricatives, while within the range of around 3–6 kHz, /ɕ/ has more spectral energy. The spectral energy of /f/ fluctuates mostly between 20–30 dB in the whole spectral range of 1–12 kHz.
Similar to many other Ngwi languages, for example, Central Lisu (Tabain, Bradley & Yu, 2018), /h/ and /ʔ/ in Kua’nsi also cause the following vowels to be nasalised. This close relationship between nasalisation and glottal consonants is found in many languages and is termed rhinoglottophilia by Matisoff (Reference Matisoff1975).Footnote 2 In Kua’nsi, all vowels after /h/ are nasalised, regardless of the position of /h/ in the word as illustrated in the following examples.
On the other hand, the nasalisation of vowels after /ʔ/ is not prevalent in Kua’nsi. When /ʔ/ occurs word-initially, the following vowels are not nasalised. When it occurs intervocalically, the vowels are nasalised. However, this kind of nasalisation is only found in the male speaker JJZ’s speech, while vowels after /ʔ/ are not nasalised in the female speaker JXX’s speech as indicated in her pronunciation of the word bo 21 ʔo 33 [bo21ʔo33] ‘ant’ in the above example in Figure 3. It should be noted that this description of vowel nasalisation after an intervocalic /ʔ/ is based on only three words in the data. While it is clear that vowels are not nasalised after a word-initially /ʔ/, the exact status of vowels nasalisation after an intervocalic /ʔ/ is unclear given the limitation of the data.
Kua’nsi nasals distinguish four places of articulation: bilabial, alveolar, alveolopalatal and velar. Like the velar plosives, the velar nasal /ŋ/ also cannot occur before /i, ɛ, j/. This restriction does not apply to other nasal consonants. Vowels after nasal consonants tend to be nasalised, but there is no phonemic distinction between nasal and oral vowels in Kua’nsi.
There is one voiced lateral approximant /l/ in Kua’nsi. The distribution of this consonant is not restricted. Except /ɨ/, other vowels can follow the lateral approximant /l/.
There are three central approximants /j, w, ɥ/ in Kua’nsi. The distributions of approximants /j, w, ɥ/ are restricted. /j/ can occur with /a, o, u, ɛ/. /w/ can occur with /a, o, ɛ, i/. Compared with /w/ and /j/, the voiced labio-palatal approximant /ɥ/ has a more limited distribution in Kua’nsi. Only the front vowels /i, ɛ, a/ occur after /ɥ/, and it only occurs in a handful of words. The approximants /j, w/ alone can occur as the syllable onset more frequently, but only one word /ɥɛ3 3lwa3 3/ ‘bell’ in the data shows that /ɥ/ can occur as a syllable onset.
The approximants /j, w, ɥ/ can follow other consonants to form a consonant cluster as a syllabic onset, as illustrated by the examples below. Each of the approximants has different distributions in consonant clusters. /j/ forms clusters with bilabials (plosives and nasal), alveolars (plosives, nasal, fricative, affricates and lateral) and glottal (plosive and fricative): /p, pʰ, b, t, tʰ, d, ʔ, ɕ, tɕ, tɕʰ, dʑ, m, n, l, h/. /w/ can occur after all plosives, alveolar fricatives /s, z/ and affricates /tɕ, tɕʰ, dʑ/, velar fricative /x/, glottal fricative /h/ and three nasals /n, m, ŋ/. /ɥ/ only occurs after alveolopalatal fricatives and affricates /ɕ, ʑ, tɕ, tɕʰ, dʑ/. The vowels that can occur after these approximants are also restricted. /a, o, u, ɛ/ can occur after /j/; /a, o, ɛ/ can occur after /w/; and only the front vowels /i, ɛ, a/ can occur after /ɥ/.
Syllabic nasals
The nasal consonants /m, n, ɲ, ŋ/ in Kua’nsi can occur as syllabic consonants and form the nucleus of a syllable, but only when there is a homorganic consonant as the onset of the following syllable. This means that syllabic nasals only occur word-initially or inter-syllabically, and never word-finally. There is no restriction on the consonants after syllabic nasals, and it can be another nasal, for example, /u⁵⁵m̩21mo3 3/ ‘aunt (wife of father’s brother)’. The occurrence of syllabic /ɲ/ is restricted. Although it is theoretically possible, it has not been found that /ɲ/ occurs inter-syllabically in the lexical data. It is mostly found as the negative marker /N̩21-/, whose place of articulation is conditioned by the following syllable onset of the verb.
In Kuamasi, a closely related Central Ngwi language to Kua’nsi, the corresponding syllable in cognate words contains a vowel. According to the word list provided by Castro et al. (Reference Castro, Crook and Flaming2010), some cognate words of the above examples in Kuamasi are: /ʔa3 3ta3 3nɯ⁵⁵/ ‘tomorrow’, /ŋɯ21ka⁵⁵/ ‘head’, /mo21/ ‘not (negation)’. It seems that the syllabic nasals in Kua’nsi come from different sources and the historical development needs further examination.
The duration of syllabic nasals is significantly different from non-syllabic nasals (one-way ANOVA, p < 0.001), as shown in Figure 5. The left box shows the distribution of the duration of non-syllabic nasals (n=219), and the right one shows that of syllabic nasals (n=17). The boxplots reflect the interquartile range with the median indicated by a horizontal line in the middle of the box. Nearly three-quarters of non-syllabic nasals have durations of less than 100 ms, while most syllabic nasals are longer than 100 ms.
Preglottalised consonants
Kua’nsi has a series of preglottalised sonorants. The difference between preglottalised and non-preglottalised nasals is illustrated in the following (near) minimal pairs. Although the nasals /m, n, ŋ/ have preglottalised counterparts, there are no examples showing that the palatal /ɲ/ can also be preglottalised.
The evidence for the preglottalised nasals can be found in the waveform and spectrogram of the recordings. When JJZ pronounced the preglottalised nasals, the closing of the glottis is often signalled by a small pulse in the waveform as well as a visible shadow at the corresponding time in the spectrogram, as circled in the waveforms and spectrograms of two words, /ʔma̠21/ ‘old’ and /ʔmja21kwa21/ ‘donkey’ in Figure 6. Figure 7 shows the waveform and spectrogram of /mi55la55/ [mi55la̤55] ‘night’ produced by JJZ. As indicated in Figure 7, the small pulse before the preglottalised nasals as observed in Figure 6 is absent in the production of the non-preglottalised nasal.
Vowels
There are seven modal vowels in Kua’nsi, and each modal vowel has a tense counterpart as indicated by an underlined vowel symbol. The phonemic contrast between each vowel is established by their occurrence in sequence /tV/ when it is available in the data, as illustrated in the examples above. However, such a sequence is not possible for [z̩] and /ɨ/, as they have limited distribution in Kua’nsi and cannot occur after /t/. A diphthong [ɛi] is found in the data, but it is in free variation with /ɘ/; thus, it is not considered a phonemic vowel. For example, JXX produced /tsɘ21tsɘ33/ ‘thing’ [tsɘ21tsɘ33] while another speaker JHS produced this word [tsɛi21tsɛi33] as in the phrase /tsɘ21tsɘ33 ʔu̠21/ ‘borrow something’.
The vowel /ɨ/ has a fricative allophone, the syllabic high central fricative vowel [z̩], after alveolar fricatives and affricates /s, z, ts, tsʰ, dz/, while the high central [ɨ] allophone occurs after bilabial affricates [pɸ, pɸʰ, bβ], the bilabial nasal /m/ and labiodental fricatives /f, v/. Sequences of an alveolar affricate or fricative plus the high front vowel [i] do not occur in Kua’nsi. In the Chinese linguistics tradition, the symbol [ɿ] is often used to represent the fricative vowel (Handel, Reference Handel2015). The fricative vowel has less friction than the voiced alveolar fricative /z/. This is shown in the spectrogram of the word zɨ 55 [zz̩⁵⁵] ‘boat’ in Figure 8. Sometimes the frication is not strong and the vowel seems to be the voiced continuation of the preceding fricative, a similar case to the realisation of the front lingual fricativised /z/ and /z̠/ vowels in Nuosu Yi (Edmondson, Esling & Lama, Reference Edmondson, Esling and Ziwo2017).
Kua’nsi vowels show a contrast between modal (‘lax’) and laryngealised (‘tense’) phonation, which is also widely attested in many other Ngwi languages, for example, Central Lisu (Tabain et al., Reference Tabain, Bradley and Yu2019), Southen Yi (Kuang, Reference Kuang2011) and Nuosu Yi (Edmondson, Esling & Lama, Reference Edmondson, Esling and Ziwo2017), as well as in Bai varieties such as Heqing Bai (Zhao, Reference Zhao2010) and Jianchuan Bai (Edmondson, Esling & Li, Reference Edmondson, Esling and Shaoni2021). In Kua’nsi, each lax vowel has a corresponding tense vowel. The tense vowels are restricted to low and mid tones, while the lax vowels can occur with all tones. Figure 9 shows the waveforms and spectrograms of the CV syllable sequence /tsu/ with a low falling tone /21/ in two different words, /tsu21/ ‘tea’ and /ja21tsu̠21/ ‘loom, weaving machine’. The waveform and spectrogram of the tense vowel /u̠/ in /tsu̠21/ on the right shows aperiodic noise, especially near the end of articulation.
The cross-linguistic difference between tense and lax vowels has been complex to describe phonetically. The investigation of the phonation contrast relies on several phonetic measures, such as H1*–H2* (the difference between the first and the second harmonic, corrected by formant), and CPP (Central Peak Prominence), which have been useful for analysis of other languages. For example, Kuang (Reference Kuang2013: 38) shows that CPP and all H1*-related measures, both H1* itself and its relative strength (H1*–Hn*, H1*–An* measures), contribute to the phonation contrast significantly in three Ngwi languages, Southern Yi, Bo, and Hani. Figures 10 and 11 show the mean CPP and H1*–H2* of /a21/ and /a̠21/ in Kua’nsi at four stages of vowel duration. Both vowels show a similar range of CPP values around 65–80 dB. While the lax vowels show peak CPP values between T2 and T3, the tense vowels show lower CPP values at the same points. The measurements of H1*–H2* seem unable to distinguish the phonation types significantly, although the tense vowels tend to have a lower value of H1*–H2* during T1 and a higher value of H1*–H2* during T2. Since the dataset is limited in the number of tokens for different tense vowels with different tones, further study would be necessary to find out any differences between these vowels in more detail.
In addition, laryngealised phonation also occurs with the nasals, e.g., [n̠̩21, m̠̩21] are also found in the data, but the tense phonation on the nasals is not a phonemic distinction. All the syllabic nasals can be realised as either lax or tense phonation. Figure 12 shows two waveforms and spectrograms of /m̩21pɨ̠21/ ‘comb’. The right spectrogram does not show aperiodic noise on the first syllable /m̩21/, while the left spectrogram shows aperiodic noise on both syllables. This indicates that the syllabic nasals can either be laryngealised or not. This means that the tenseness on the syllabic nasals is possible but variable.
Figure 13 shows the plots of mean Kua’nsi vowel formants F1 and F2 for two speakers. The allophones of /ɨ/, [ɨ] and [z̩], are measured and plotted separately. F1 and F2 were measured at the midpoint. The measurements were taken only from monosyllabic words or the final syllable of polysyllabic words to avoid the effect of the following consonants. This analysis did not control for the place and manner of articulation of the syllable onset.
The vowel /ɘ/ can be realised by four vowels [ɘ, ə, ɯ, ɤ] but mostly [ɘ]. Duan (Reference Duan1998) proposes /ɯ/ and /ɤ/ as two different phonemes in Kua’nsi, but such a phonemic distinction is not found in the dataset presented here. Instead, it is found that /ɘ/ has great allophonic variation. The variation is not related to the place of articulation of preceding consonants, as it is constantly observed in the data that the formants of /ɘ/ in the same word can be of great difference. For example, Figure 14 shows the plots of formants of /ɘ/ in /tʰɘ21/ ‘bucket’ by JXX. These tokens fall into two different groups whose F2 are in different ranges. This suggests that /ɘ/ in these tokens is realised by two different surface vowels, [ɘ] and [ɤ].
Tone
Kua’nsi has three phonemic lexical tones: low-falling /21/, mid-level /33/, and high-level /55/, as exemplified by the minimal set given below, based on the sequence /ta/. The lax vowels can occur with all three tones, while tense vowels can only occur with /21/ and /33/. The f0 trajectories for these tones are shown in Figure 15.
Besides these three tones, there are two other contour tones found in the text data, a high falling tone /51/ and a low rising tone /13/. /51/ occurs when a word-final syllable with /55/ tone is contracted with the following syllable consisting of a single vowel with /21/ tone, for example, xu55 ‘pfv’ + =a21 ‘sfm’ → [xwa51] ‘pfv.sfm’. However, this is not systematic. The outcome of this tonal fusion sometimes will be /21/ or close to /21/, for example, ɲu 55 ‘1sg’ + =a21 ‘cm’ → [ŋwa21] ‘1sg.cm’. The other contour tone found in the data is a rising tone /13/, but it is only found in augmentative particle dwa 13 ∼lwa 13 ‘agmt, very’ and in reduplication of some words.
The tones in loanwords are not stable. The tone of the same word borrowed into Kua’nsi would have various tones among different speakers. Even within the speech of the same speaker, the tones in the loanwords are not consistent. For example, the Chinese word 办法 bànfă ‘method’ can be found in the transcription of the recorded passage. This word is borrowed into Kua’nsi, and its tone varies each time it occurs in the speech. It appears as [pan51fa213] the first time, and the original Mandarin tones are maintained. When the word occurs the second time in the recording, the tone on the second syllable becomes /13/. The tonal change does not indicate any syntactic or semantic function, but rather it reflects the inconsistent adaptation of loanwords. The loanwords from Chinese have been borrowed recently since it became the dominant language in the last few decades. On the other hand, those words borrowed from Heqing Bai, the more dominant local language before the prevalent use of Mandarin Chinese, have adapted the Kua’nsi phonology, and it is hard to distinguish between loanwords from Heqing Bai and native Kua’nsi words only based on the phonology.
Syllable structure
The canonical Kua’nsi syllable consists of an initial consonant, a vowel and a tone. However, there are other possible syllable structures. Minimally, a syllabic nasal with a low-falling tone can be a syllable nucleus and directly form a single syllable. Maximally, two consonants can occur together at the onset of the syllable, which is followed by a vowel associated with a tone. In the textual data, it is found some syllables can be contracted. This results in a new syllable that contains two vowels in the syllable nucleus. For example, tɘ 55 ‘return’ + =a 21 ‘sfm’ → tɘa 21 ‘return.sfm’. All possible syllable structures are listed below, where ‘N’ represents syllabic nasals or vowels. Syllables in Kua’nsi are all open, i.e., there is no syllable coda. The words below exemplify the possible syllable structures in Kua’nsi. Note that sequences like CCVV and CCCVV are not found in the data.
For word-initial syllables, the vowel nucleus does not always require a consonant syllable onset. The glottal stop /ʔ/ can occur word-initially as the syllable onset when there is no other word-initial consonant. There is no contrast between these syllables with or without the glottal stop; for example, both u 33 nu 21 and ʔu 33 nu 21 mean ‘dog’.
Syllables with the structure CVV are only found in text data. This type of syllable is usually formed as a result of syllable contraction. The syllable contraction in Kua’nsi happens when the case marking clitic /=a21/ or sentence-final marking clitic /=a21/, which share the same phonological shape, is attached to a word. When the vowel nucleus in the final syllable of the hosting word is one of the three vowels /ɛ, ɘ, ɨ/, the vowel is reduced to /ɘ/ and the contracted syllable has the structure CVV. More specifically, it will be /Cɘa/. If the vowel in the final syllable of the hosting word is not one of these vowels /ɛ, ɘ, ɨ/, other types of syllable contraction will occur but the structure of the contracted syllable does not produce a new type of syllable structure. See Huang (Reference Huang2024: 41–46) for more details about the syllable contraction in Kua’nsi.
Transcription of the recorded passage
This passage was produced by the female speaker JXX and is transcribed phonemically, using the symbols presented in the consonant and vowel charts unless it is a Chinese loanword which may have a different syllable structure, diphthong and/or tone. Note that many vowels in the word-finally syllables are lengthened. Vowel length is not a phonological contrast in Kua’nsi, and non-final syllables do not show a significant difference in terms of duration. Lengthening usually occurs at the final syllable of a phonological word, typically when speakers are thinking while speaking, listing different items or finishing up the speech.
Acknowledgement
I am grateful to the language consultants for sharing their knowledge of Kua’nsi with me. I would like to thank Prof. David Bradley, Dr. Bethwyn Evans, Dr. Rosey Billington, two anonymous reviewers and the journal editors, Prof. Marija Tabain and Prof. Marc Garellek, whose valuable comments on the earlier versions of this paper have greatly improved the quality of this paper. The funding support from the ARC Centre of Excellence for the Dynamics of Language [Project ID: CE140100041] for my field trips is gratefully acknowledged.
Appendix
Possible CV syllables and CC clusters in Kua’nsi
The table in this appendix shows possible CV syllable and CC clusters in Kua’nsi. The blank cells indicate the CV syllable is possible and attested in the data. Grey-shaded cells filled with a dash indicate the sequence is not allowed in Kua’nsi. Grey-shaded cells filled with a question mark mean that such a sequence is possible but is not confirmed in the data, given that a similar CV combination is found in which other consonants of the same place and manner of articulation occur. Black-shaded cells indicate the overlap of the approximants /j, w, ɥ/.