Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2015, The Linguistic Review
…
27 pages
1 file
We argue that the cross-linguistic distribution of vowel systems is best accounted for by grammar-external forces of learnability operating in tandem with cognitive constraints on phonological computation, as argued for other phonological phenomena by
2nd North American Phonology Conference, …, 2002
Linguistics, 1984
It is frequently supposed that there is a balance, or correlation, between the number of vowels and the number of consonants in the languages of the world. We test this hypothesized balance in a genetically stratified sample of languages and show that there is no evidence for it. In fact, the sizes of vowel and consonant inventories are statistically independent; the statistical observations previously offered as evidence for balance have no linguistic significance but are artefacts of the definitions of the variables chosen for measurement. We also show that the probability distribution that describes both vowel and consonant inventories is the log-normal and that the lognormal distribution receives straightforward linguistic interpretation in terms of distinctive features. We discuss a number of implications for the cross-linguistic exploitation of phonetic features as a determinant of the segment-level statistical distributions of vowels and consonants.
Proceedings of the 12th International Conference on the Evolution of Language (Evolang12)
Since the seminal work of , a key testing ground for functional, evolutionary, or emergentist approaches to sound systems has been the typology of vowel inventories (for example, .
Perception & Psychophysics, 2008
Consonants and vowels may play different roles during language processing, consonants being preferentially involved in lexical processing, and vowels tending to mark syntactic constituency through prosodic cues. In support of this view, artificial language learning studies have demonstrated that consonants (C) support statistical computations, while vowels (V) allow certain structural generalizations. Nevertheless, these asymmetries could be mere byproducts of lower-level acoustic differences between Cs and Vs, in particular the energy they carry, and thus their relative salience. Here we address this issue and show that vowels remain the preferred targets for generalizations even when consonants are made highly salient or vowels barely audible. Participants listened to speech streams of nonsense CVCVCV words in which consonants followed a simple ABA structure. Participants failed to generalize this structure over sonorant consonants (Experiment 1), even when vowel duration was reduced to one third of that of consonants (Experiment 2). When vowels were eliminated from the stream, participants showed only a marginal evidence of generalizations (Experiment 4). In contrast, participants readily generalized the structure over barely audible vowels (Experiment 3). These results show that different roles of consonants and vowels cannot be readily reduced to acoustical and perceptual differences between these phonetic categories.
It is frequently supposed that there is a balance, or correlation, between the number of vowels and the number of consonants in the languages of the world. We test this hypothesized balance in a genetically stratified sample of languages and show that there is no evidence for it. In fact, the sizes of vowel and consonant inventories are statistically independent; the statistical observations previously offered as evidence for balance have no linguistic significance but are artefacts of the definitions of the variables chosen for measurement. We also show that the probability distribution that describes both vowel and consonant inventories is the log-normal and that the lognormal distribution receives straightforward linguistic interpretation in terms of distinctive features. We discuss a number of implications for the cross-linguistic exploitation of phonetic features as a determinant of the segment-level statistical distributions of vowels and consonants.
1979
Thus; whereas the .American .child everttually overcomes her substi-_tut.ion of Co.:! for C'PJ, the Nupe chiid confronts no foJ in his mother tongue, and the (Q.J-substitution manifests, itself wheen, in the inf'lex:.;. ibility of ~dulthood, be enc_ounters (:,J in :a foreign vord. ') If one could collect all the_sound-subst;ituticins of children, the dfacbr_onist Grammont (1965} _said, one would hf.I.Ye a. sort of grammar of all the possible sound-changes. The synchronist Baudouin de Courtenay {1895} would have added, also all the possible phonological alt_erna.-tic>ns. And, as the panchronist Jakobson argued in his monumental Child Lan@ase, Aphasia 1 and Phonological Universals (1968}, _also all the• • possible phoneme systems. Such systems are determined, a.s Greenberg (1966) concluded in his study of phonological universals, by the collective effect of phonological processes (particularly the 'unconditioned' processes vhich, since they affect a. sound)n all its oc,currences, thereby affect the inventory as well a.s the distribution of phonemes). But, pace Greenberg, such processes are not mere historical events: they are the living expression of the phonetic capacity of the individual. This dissertation presents my explorations in the world of vowels. Although I have occasion to mention aesimilatory processes, my focus is on those processes traditionall.y labeled 'unconditioned I or 1 sponta.-' neous 1-the ones pessimistically cWed unexplainable because they apply to segments regardless of their contexts (or worse, in dissimila:.;. tion, despite their contexts), and because, as is implied by their classification as 'strengthening' processes, they seem to defy the law of least effort. These are processes which, in the earliest speech of a child, ca.n scramble all his vowels into one, and yet can, in a language like Faroese, Juggle two dozen vowels with hardly a merger. A full account of the nature, operation, and causality of these processes is not in sight, Thorough and useful descriptions are unavailable for many languages: the brief phonemic sketches of vowel systems that appear in many surveys and in some monographs often fail to provide sufficient phonetic information on vowel quality; and conversely, some descriptions vhich do provide such phonetic information lack the phonological data that is crucial to the sort of analysis that the study of processes requires. For many little-know languages, historical studies, which provide much usef'ul data, are unavailable-and indeed impossible, given linguists' current knowledge of such languages. ' (Corresponding to this difference in the nature of processes and rules is a difference in the order of application in speech proce~sing: processes apply after the applications of secret-1anguage rules, and after unintentional slips of the tongue (Stampe 1973a:45; Donegan and Stampe 1978a, Sec. 2. 5). • (1,7) For example, the process that palatalizes [kJ to [cJ before a palatal vowel applies after the secret-language 9 Another way :irt ~hich children: •sometimes limit the surface effects of a process is by constrairting its natural iterative application (Stampe 1973,: 59-68). The fUnction of each process is to Efubstitute •. a'iess difficult class of sounds or sound-sequences for a ~ore difficult cla.ss~ But once a process has applied~ sveeping away~. as it vere, a.• certain class 'or difficulties, another process may (subsequently or simultaneously), in removing some entirely different difficulty, create new me~ rs of the very class the fir~t process got rid of. /' (1.14) For example, for a child who substitutes zero for CJJ, this CJJ-deletion eliminates a difficult segment. But suppose another process-delateralization, as in (1,10)-simultaneously substitutes CJJ !or CIJ. Unless the first process is alloved to apply again, the child vill have to produce CJJ's-for ti l's. If each process is to accomplish its function on the surface forms (the forms that are actually pronounced), then the (JJ-to-¢ process should apply again, after the CIJ-to-[JJ process, and again after any other process that creates [JJ's. Such absence of ordering restrictions-unconstrained iteration-is the natural state of process application. But there is a catch to this free-handed elimination of difficulties: the processes thus applied merge, in actual pronunciation, the distinction between between /1/ and /j/ in the child's underlying representations (corresponding to adult tlJ and CjJ); both become zero, so that e.g. less and yes would both be pronounced [€SJ. One way for the child to maintain a distinction without having to suppress either process is to restrict the iteration of [JJ-to-¢,.so that it may not apply again after [IJ-to-(JJ. Thus the child says [€SJ for~ but [JESJ for less-not the underlying or adult distinction, to be sure, but a distinction nevertheless. This seems to be the situation which holds vith children who initially substitute zero for both (jJ and ti] but who later produce zero for [JJ and (JJ for.ti] (cf, Jakobson 1968:15, and Donegan and Stampe 1978a, Sec. 3.3), Such apparently paradoxical sets of substitutions, described by Jakobson as 'sound shifts', may persist into adulthood. Thus, such constraints may account for some of the peculiar situations in language in which a speaker cannot pronounce a segm~nt or sequence when he tries to produce it, but produces that very segment or sequence when he is trying to pronounce something else. (1.15) For exe.mple, many speakers of English find it difficult or impos'sible to produce the seq'l,lence ta.9J in phrases like How now, brown cow? or in bo!rowed or foreign words like He.usfrau or Laut (cf, (1.6)). Instead they substitute Cais!J quite automatically. But many or the same speakers in whom the ta.2J-to-(6!12J process is active also make a substitution which produces the phonetic sequence Ca2J: de.rk, syllable-final C:lJ optionally becomes tyJ"'[S?J so that doll, Sol, etc. a.re pronounced (dC1.2J, Isa.2J, etc.
2018
When we speak, we do not use sound sequences arbitrarily. For instance, no English word ends in the sequence /mb/, while many words end in /ks/ (like in box), yet we do not find any words that begin with /ks/. The study of the rules and tendencies that determine which sound sequences are permitted, ruled out, or preferred in a language is called 'phonotactics'. Some of the constraints on the phonotactic setup of natural languages reflect weak biases in processing, whose effects accumulate when languages are transmitted in vast numbers of parallel and iterated acquisition and interaction processes, and as a consequence become visible in language history. Thus, history provides evidence not only of articulatory and auditory constraints on language production and perception, but also on cognitive constraints on the processing of language. In this dissertation project, I study how sound sequences like /mb/ and /ks/ replicate and spread through languages and populations of speake...
Rutgers Optimality Archive, 2004
This paper gives an Optimality-Theoretic formalization of several aspects of the acquisition of phonological perception in a second language. The subject matter will be the acquisition of the Spanish vowel system by Dutch learners of Spanish, as evidenced in a listening experiment. Since an explanation of the learners' acquisition path requires knowledge of both the Dutch and the Spanish vowel system, the 12 Dutch and 5 Spanish vowels are presented in Figure 1. Along the vertical axis we find the auditory correlate of perceptual vowel height (first formant, F1), and along the horizontal axis the auditory correlate of perceptual vowel backness (second formant, F2), whose articulatory correlates are tongue backness and lip rounding. A third auditory dimension, duration, is implicit in the length sign ("!") used for 4 of the 12 Dutch vowels.
Linguistic inquiry (Online), 2012
A recent computational model by Hayes and Wilson (2008) seemingly captures a diverse range of phonotactic phenomena without variables, contrasting with the presumptions of many formal theories. Here, we examine the plausibility of this approach by comparing generalizations of identity restrictions by this architecture and human learners. Whereas humans generalize identity restrictions broadly, to both native and non-native phonemes, the original model and several related variants failed to generalize to non-native phonemes. In contrast, a revised model equipped with variables more closely matches human behavior. These findings suggest that, like syntax, phonological grammars are endowed with algebraic relations among variables that support across-the-board generalizations.
Introduction
"That 'unnatural' systems, with attributes that in no way follow from the conflicting forces of 'ease of articulation' and 'maintaining contrasts', exist is, I hope, obvious to anyone who has examined the matter in any reasonable detail. Such 'unnatural' (but computationally tractable) systems reveal that the computational system of human phonology is more than the interaction of these two forces." (Hale 1997) Phonologists since at least Jakobson and Trubetzkoy (e.g., Trubetzkoy 1939) have been concerned with why languages display their particular array of sound patterns. The central questions raised in treatments of the distribution of vowel systems in particular are (i) why vowels fall into the particular zones of the perceptual/ articulatory space that they do, (ii) why some sounds are more common in vowel inventories, and (iii) why 'unnatural' vowel systems exist, such as the vertical ones of the sort we find in Abkhaz (Hewitt 1979) and Marshallese (Choi 1992).
The theories in (1) have been proposed to answer these questions concerning vowel inventories:
(1) Theories of vowel inventory typology i. Quantal Theory prioritizes perceptual invariance: the point vowels {a i u} are typologically preferred because they represent regions of the articulatory space in which variability in production has relatively small acoustic impact (Stevens 1972, Stevens 1989;Lang and Ohala 1996). ii. Traditional markedness theory derives its hierarchy largely from phonological behavior and cross-linguistic distribution, or by counting features. iii. Dispersion Theory (DT) maintains that the structure of inventories derives from functional principles such as the minimization of effort and maximization of contrast.
Our focus here will be on Dispersion Theory (1.iii). As has been noted in the literature, Quantal Theory (1.i) makes several incorrect predictions. First of all, the fourth most common vowel, [ε], is not stable as predicted by the theory (Carré 1996). Second, Livijn's (2000) survey of 28 differently-sized inventories showed no evidence for acoustically-favored hotspots, contrary to the core prediction of Quantal Theory. Third, vertical systems such as those of Abkhaz, Kabardian, and Marshallese are missed by Quantal Theory, which predicts that such small vowel systems should select their members from the quantal set {a i u}. Finally, Quantal Theory wrongly predicts that the point vowels should be invariant crosslinguistically (Disner 1983).
Traditional markedness theory (1.ii) has faced the objection that it merely formalizes the attested facts, rather than explaining them in terms of constraints on human articulation, perception, or processing. Consequently, Dispersion Theory has gained support as it incorporates these principles. One of the early dispersion-based approaches was Lindblom's purely computational phonetic theory of Adaptive Dispersion (Liljencrants and Lindblom 1972;Lindblom 1990, etc.), followed by Flemming's (1995) implementation of dispersion in Optimality Theory (referred to here as OT-DT; see (2.v) for references). Some advocates of dispersion limit its application to the diachronic axis; Lindblom and Maddieson (1988: 72) for example maintain that "consonant inventories tend to evolve so as to achieve maximal perceptual distinctiveness at minimum articulatory cost." OT-DT, however, incorporates the mechanisms of dispersion directly into synchronic phonological systems, as we shall see.
Here we present a number of empirical and theoretical arguments that Dispersion Theory cannot adequately describe the range of attested vowel systems, and we present an alternative based on Evolutionary Phonology (Blevins 2004). The main focus of our critique will be on OT-DT, since it attempts to account for phonological facts as well as static phonetic distributions. As McMahon (2000) and Jansen (2002) have pointed out, linguistic phenomena that incorporate products of history, including the vocalic patterns we discuss here, highlight the difficulties with universalist theories like synchronically-implemented DT. Our overarching point is therefore that the structure of vowel systems is largely an evolutionary and historical problem, not a synchronic one. Less is more, in this case: a more permissive theory than DT is required to account for the rich variation in vowel systems seen cross-linguistically.
Early dispersion theories
We begin by summarizing the range of forms that DT has taken, dating back to the pre-generative era. There are six major incarnations of DT in the literature, summarized in (2).
(2) Incarnations of Dispersion Theory i. Structuralist functionalism (Jakobson 1941;Martinet 1955): vowels are maximally dispersed. ii. Theory of Adaptive Dispersion: maximal dispersion (Liljencrants and Lindblom 1972), sufficient dispersion (i.e., articulatory economy balanced with perceptual distinctiveness; Lindblom 1975, Lindblom 1986, Lindblom 1990, Lindblom 1992, Lindblom 2000;Kawasaki 1982;Lindblom and Maddieson 1988;Carré et al. 1994;Yang 1996;Traunmüller 1998;Cho et al. 2000, and many others). iii. Lexically-based Dispersion: distance is proportional to the functional (¼ lexical) load borne by the contrast (ten Bosch 1991, Bosch 1995;cf. van Son et al. 1998). iv. Focalization Theory (a combination of Quantal Theory and Dispersion): Distinctive Regions Model (Mrayati et al. 1988;Carré andMrayati 1992, Carré andMrayati 1995;Carré and Mody 1997;Carré 1996); Dispersion-Focalization Theory (prefer vowels showing a convergence between two formants; Vallée 1994;Schwartz et al. 1997a, Schwartz et al. 1997b, Schwartz et al. 1999). v. Optimal Dispersion Theory (Flemming 1995, Flemming 1996, Flemming 2004;Ní Chiosáin and Padgett 1997;Boersma 1997;Riggle 1999;Ahn 2000, Ahn 2002a, Ahn 2002b;Harris and Lindsey 2000;Sanders 2002;Padgett 2004). vi. Self-organizing systems theory: eschews linguistic rules and constraints (de Boer 2001).
All variants of DT consider maximization of distinctiveness to be the main method by which languages select phonemic distinctions. The elements involved in this dispersion process have been claimed to include maximal contrast in the articulatory space (Jakobson 1941), minimization of effort (Lindblom 1975foot_2 ),foot_3 focalization (Stevens 1972;Schwartz et al. 1997b;Roark 2001), maximal overall contrast compensating for F 0 (Ryalls and Lieberman 1982;Diehl et al. 1996;Traunmüller 1998), lexical frequency (ten Bosch 1995), and formant normalization (Roark 2001). The plots in (3) show the basic idea of how vowels in three-and five-member systems can be maximally dispersed in a given acoustic space according to their F 1 and F 2 values.
(3) Maximal dispersion of 3-and 5-vowel systems in a given acoustic space acoustic space controllable by humans
The first major treatment of dispersion in the generative era was proposed by Liljencrants and Lindblom (1972), who used a computer model designed to maximize the distance between a given number of vowels within a formant space based on a theoretical construct from Lindblom and Sundberg (1971) merging F 2 and F 3 into an "effective second formant" conveying both backness and rounding. Perceptual effects were simulated using mels. This computer model was used to predict F 1 and F 2 /F 3 values, which were assigned to corresponding phonetic categories and compared to known typology. This procedure yielded what the authors deemed "approximately correct" results for systems of three to six vowels. This early model and its Adaptive Dispersion (AD) relatives fell short of predicting the attested range of vowel systems, for both small and large inventories. Overall, and perhaps most importantly, human languages permit more variation than the model predicts: it only accurately generates a small number of the attested inventories of languages with fewer than six vowels ( de Boer 2001), and it is limited to generating systems of 3-9 vowels. Secondly, it does not predict the most common four-vowel configuration (Roark 2001: 2). Roark's (2001) version of AD, which incorporates quantality and normalization, correctly predicts the four-vowel system but does not get the facts right for six-and sevenvowel systems. For systems with six or more vowels, AD wrongly predicts that inventories should commonly include an increasing number of high central vowels, contrary to fact, and in general it models the behavior of central vowels poorly. As Vallée et al. (2001) point out, AD also fails to generate the /i y u/ series within the high vowel set.
The unattested proliferation of vowels between /i/ and /u/ generated by AD in larger systems includes four contrasting high vowels in systems with as few as seven vowels. However, attested languages never contrast more than three degrees of backness (Lindau 1975: 13). There are moreover significant differences between the high vowels of real systems and the theory's predictions. Figure ( 4) exemplifies for seven-vowel systems how the inventory predicted by Lindblom (1990) is quite unlike the two most commonly attested inventories. The AD-predicted inventory is asymmetric and disorganized from a featural perspective, as Boersma (1997) has noted: the reason is that the distance function in the model favors a height asymmetry between front and back vowels, because an F 1 difference always contributes positively to the perceptual difference between a pair of vowels.
(4) Predicted and actual 7-vowel systems a.
Lindblom's predicted optimal system (1990
The most common 7-vowel systems according to Crothers (1978) i
Although in this case (4a) AD predicts a system that is too asymmetric, a certain degree of asymmetry may not be undesirable in general. Disner (1983) points out that symmetrical systems are the exception rather than the rule, and indeed, the systems predicted by DT tend to be less clustered than those found in nature. Swedish, for example, clusters vowels in the high front region. Disner further points out that AD "predicts that the vowels in one nine-vowel system ought to be phonetically similar to the vowels in another nine-vowel system. Yet not even such closely related vowel systems as those of Norwegian and Swedish are correctly predicted by the theory" (1983: 115).
Dispersion-Focalization Theory (DFT) attempts to address some of these shortcomings of Dispersion Theory by incorporating elements of Quantal Theory. The dispersion component of DFT proposes that sounds are dispersed so as to maximize the perceptual distance between them, and the focalization component (drawing on Quantal Theory) states that vowels which show a convergence between two formants are preferred. The general procedure in DFT is to associate an energy function to each system. The optimal system has the lowest energy function, either locally (in which case it is a stable system) or globally (in which case it is the best system).
While DFT is useful in explaining the appearance of the high round vowel /y/, it does not correctly generate mid front round vowels (Carré 1996). Like AD, it generates too many high vowels and not enough central vowels in larger systems (Carré 1996), and it is also limited to making predictions for three-to nine-vowel systems (Vallée et al. 2001).
Despite the surface differences between these variants of DT, they suffer from many of the same problems. First, there are many cases like that of Navajo, described by McDonough et al. (1993), which has a vowel inventory consisting of the unbalanced set {i e ae o}. Note that these vowels are not equidistant, thus contradicting the prediction of AD -/i/ and /e/ are too close together. DT in general fails to predict or account for the fact that /u/ shows more variance than the other vowels. McDonough et al. propose in response that the upper back corner of the vowel triangle may be less "sharp" or perhaps high back vowels are less defined. This implies that, while /i/ and /ae/ may be considered to be quantal, there is no obvious high back quantal vowel.
In summary, the forms of Dispersion Theory presented thus far have four significant shortcomings: (i) they cannot or do not generate sub-optimal systems, though such systems are widely attested; (ii) they do not perform well with central vowels; (iii) they work only for three-to nine-vowel systems; and (iv) they make particularly poor predictions regarding systems of six or more vowels. Flemming (1995Flemming ( , 1996Flemming ( , 2004) ) addresses the problems just mentioned and incorporates dispersion effects into OT, thereby using independently-required phonological machinery (i.e., ranked constraints on surface representations) to produce a theory of vowel systems.
The functional principles that drive OT-DT are the following (Flemming 1996, Flemming 2004): maximize the distinctiveness of contrasts (using the MINDIST family of constraints), minimize articulatory effort (*EFFORT), and maximize the number of contrasts (MAXCONTRAST). Markedness is treated as being paradigmatic (i.e., a property of contrasts), not syntagmatic (a property of features), which is claimed to account for vertical systems as well as predict the quality of epenthetic vowels. Also, unlike typical OT analyses of other phonological phenomena, inputs and outputs are sets of words rather than individual words. Figure ( 5) illustrates the possible interactions of these principles, formalized as MINDIST, *EFFORT, and MAXCONTRAST ("#").
(5) Interaction of MINDIST, *EFFORT, and MAXCONTRAST (Flemming 1996) Before we delve into empirical matters, it should be pointed out that these constraints are controversial at a basic level. McCarthy (2005: 40) argues that there is a fundamental difficulty with constraints like MAXCONTRAST and MINDIST, which represent "more of an intuition than a usable phonological principle. . . . Injunctions like 'minimize the differences' or 'be as metrically consistent as possible' are too ill-defined to serve as constraints." While such principles may have heuristic value, the difficulty in formulating them specifically enough to be incorporated fruitfully into EVAL must be appreciated. For example, Lang and Ohala (1996) note that it is difficult to evaluate the hypothesis that consonant inventories tend to evolve so as to achieve maximal perceptual distinctiveness at minimum articulatory cost (Lindblom and Maddieson 1988), due to uncertainty over how to measure articulatory cost. In the remainder of this section, we set these issues aside to focus on the predictions made by OT-DT.
Height systems
Let us examine a key case for DT, namely that of vertical vowel systems. Vertical systems do not employ an underlying [back] contrast, but instead oppose height among a series of (underlyingly) central or perhaps underspecified vowels. One example is provided by Kabardian, which has an underlying inventory consisting of the two short vowels {ə, a}; we will discuss the surface manifestations of these phonemes later in the text. In the case of small, vertical vowel systems like this, there is a difference between on the one hand Quantal Theory and traditional markedness theory, which predict that a subset of the cardinal vowels / ɑ i u ae/ should always be preferred in these systems regardless of size, and on the other hand DT, which does not make this prediction. Some languages putatively lacking back-front vowel contrasts in their underlying inventories include (according to Flemming 2004: 251) Marshallese (Bender 1968;Choi 1992), the Northwest Caucasian languages Kabardian (Colarusso 1988, Colarusso 1992), Shapsug (Smeets 1984), Ubykh (Colarusso 1988), and Abkhaz (Hewitt 1979;Vaux and Psiypa 1997), and some Ndu languages of Papua New Guinea, such as Iatmul (Laycock 1965;Staalsen 1966).foot_4 Flemming (1995Flemming ( , 2004) ) and Ní Chiosáin and Padgett (1997) observe that universal markedness rankings of the sort *ɨ >> *u >> *i, which have been postulated to account for the implicational hierarchies of vowel systems containing three or more vowels, incorrectly predict that a language with only one high vowel, such as Abkhaz, should select [i]. However, attested languages of this type choose [ɨ] rather than [i]. On the basis of this fact, Flemming and Ní Chiosáin & Padgett argue for a universal family of CONTRAST constraints requiring that segments maintain at least a specified perceptual distance. To account for the high vowel case, Ní Chiosáin and Padgett (1997: 22) postulate a constraint CONTRAST(COLOUR), which states "maintain a colour contrast with sufficient perceptual distance." Minimally sufficient distance is defined in this case as the perceptual distance between [i] and [u] on the scale i ↔ ɨ ↔ u. Given this scale, the pairs {i ɨ} and {ɨ u} violate CONTRAST(COLOUR), because the perceptual distance between the two members of each of these sets is smaller than the distance between [i] and [u]. By varying the ranking of CONTRAST(COLOUR) relative to the universal hierarchy of segmental markedness constraints, systems with one or two high vowels can be obtained.
An issue with the OT-DT system of constraints with respect to height systems is that the constraints described above must necessarily hold over surface representations, not underlying representations, as Flemming and Ní Chiosáin & Padgett explicitly note. 4 This is a logical requirement given that perceptual distance is a coherent notion only when applied to concrete surface forms with specific acoustic properties, and in an OT context, Richness of the Base prohibits constraints on underlying representations. However, the languages typically cited as having only two vocalic phonemes, Abkhaz and Kabardian, actually have rich surface vowel inventories: Abkhaz has 13 vocalic allophones (Hewitt 1979), while Kabardian has 22 (Halle 1970;Colarusso 1989). It is only at the most abstract level of phonemic analysis that these languages can be said to possess just two vocalic phonemes. In fact, this is true for all of the reduced-inventory languages that have been cited in the literature: the underlying inventory is limited, but the inventory of surface allophones is quite large (see Choi 1992; Hale and Reiss 2008: 150ff on Marshallese). Since OT-DT countenances only surface phonetic representations, this abstract level at which only two vowels exist is neither available nor relevant.
With respect to the surface inventories of these languages, the contextual nuances of vowel color that adorn the underlyingly two-way contrast can be explained as linear interpolations between the F 2 values of the preceding and following consonants, as argued by Choi (1992) for Marshallese. Flemming (2004) outlines a simplified version of such a system in OT-DT, using a simplified version of Kabardian (call it K´) based on the generalizations made by Anderson (1978) and Choi (1992). In K´, front vowels appear after palatalized consonants (C j ) and central vowels appear after labials and coronals. On the basis of these generalizations Flemming draws up the simplified phonetic scale in (6), with the gradient F 2 (≈ backness) dimension discretized into six values: 4 Flemming's theory of phonology more generally contains only surface phonetic representations, and no abstract underlying representations. This creates a number of additional problems. Underlying representations are needed to capture the relations between categories in different idiolects (e.g., to identify a man's /a/ as belonging to the same phonemic category as a woman's /a/, even though the two occupy different phonetic spaces) and to correctly generate effects referring to neutralized contrasts, such as final devoicing. Moreover, surface-only models ignore the ample phonological and psycholinguistic evidence for abstract underlying and intermediate phonological representations (see Hyman 1970;Campbell 1986;Baumann 1996;Pallier et al. 1999;Hallé et al. 2000;Itô and Mester 2003 inter alia, as well as the discussion in Section 3.2). ( 6)
The inventory of surface CVC syllables in K´can then be derived via the tableau in ( 7), provided that one considers only extra-short vowels, secondary articulations of consonants, and F 2 contrasts among vowels. The basic idea in ( 7) is that backness contrasts separated by three or fewer columns in the chart in ( 6) are assigned a violation of the perceptual dispersion constraint MINDIST¼F 2 :4, but violations of this constraint are trumped by violations of the articulatory effort constraint *HIGHEFFORT, which penalizes sequences of a consonant followed by a vowel not drawn from its column in (6). In other words, any CV sequence other than C j i, C j e, C ɤ ɯ, or C ɤ o violates this constraint. MAXCONTRAST assigns a single asterisk for each member of a system, so for example the candidate system in (7d) receives two asterisks by virtue of containing two members, and the number of asterisks is potentially infinite. For reasons discussed by Karttunen (1998) and McCarthy (2003), gradient constraints of this sort are computationally intractable in a finite-state system; space considerations prevent us from considering this issue further here.
(7) Selection of a CVC inventory for K´ (Flemming 2004)
A further problem with Flemming's analysis of K´is that in real Kabardian, plain laryngeals are followed by back vowels (Colarusso 1992), not central vowels as predicted by (7). Furthermore, both Abkhaz and Kabardian contrast front and back vowels in loanwords, which freely contain {a e i o u}. One could propose a different ranking of constraints for the loanword stratum in which most tokens of these vowels occur, but then the vowel distribution would not be synchronically derivable from just the inventory of surface vowelsand with no underlying representations in OT-DT, surface vowels are all one has to work with. On top of this, Abkhaz contrasts front [ej] and back [aj] in native words (Hewitt 1979).
Elaborating on Choi's interpolation theory, Flemming (2004) states that "backness and rounding are governed by minimization of effort. This means that they are realized as smooth transitions between preceding and following consonants, which frequently results in central or centralized vowel qualities." Deriving centralization via smooth articulatory transitions is not the same as generating central vowels from acoustic dispersion constraints, though. The restriction imposed on very short vowels by *HIGHEFFORT also forces these "smooth transitions" of tongue body and lip positions between the two flanking consonants, but this should mean that a vowel between two plain labials should be round and central (with the tongue in the default rest state, since the consonants specify no tongue position), rather than non-round and high. OT-DT essentially predicts systems like the Kabardian onemodulo the laryngeal/ back vowel problem mentioned earlierwhere the vowels receive all features except height from neighboring consonants. Such a system should never produce central vowels except perhaps with labials; coronals should yield front vowels, dorsals should yield back vowels, and so on. The Abkhaz system is therefore a problem, as Flemming rightly notes, because there is no coloring by plain consonants. Moreover, if an unspecified vowel occurs between two noncoloring consonants (e.g., [h]), why does it not surface as a copy of the preceding and following vowels, especially if they are identical?
The length of vowels in Abkhaz provides another challenge. Flemming (2004) ties reduction of contrast to shortness, saying "a given limit on effort effectively makes a smaller portion of the vowel space available in a short vowel than in a long vowel," which wrongly predicts that languages should always have more contrasts in long vowels than in short vowels. Because of historical changes, the opposite is often true, as in the Shamaxi dialect of Armenian (Vaux 1998). Harris and Lindsey (2000) also observe that vowel neutralization is not necessarily linked to stress or length; Chumash for example has a 6-vowel contrast in roots but a 3-vowel contrast in suffixes. This is independent of stress, which in Chumash is typically penultimate. The same holds for Turkish and many other familiar cases. The effects of vowel shortening also produce unpredictable results which can go against Flemming's claim: only short front vowels centralize in Nawuri (Casali 1995) and shortening significantly affects only F 2 (i.e., backness, not height) in Chickasaw (Gordon et al. 2000).
Finally, as we saw in the previous section, OT-DT predicts that two-vowel systems without a phonological [back] contrast should not have to surface with central vowels; ranking MINDIST above *EFFORT should produce a diagonal system of {a u} or {i ɑ}, invalidating the OT-DT claim to a correct prediction of centralization in two-vowel systems. Flemming (1996) includes the possibility of an {i ɑ} system (cf. our figure (5a), reproduced here as (8a)), but the problem that this poses with respect to predicting two-vowels systems is not noted. It should also be mentioned that the same constraints could generate {a u}, as in (8b).
(8) Two-vowel systems in OT-DT (Flemming 1996) A diagonal system in effect involves enhancement (Stevens and Keyser 1989), which Flemming (2004) allows for other contrasts such as voicing. It is not clear in this system why enhancement would be invoked in the preferred 3-vowel system, {a i u}, but not in the preferred vertical 2-vowel system. It is important to notice that OT-DT does not actually predict that 2-vowel systems will contain central vowels; free ranking of the relevant dispersion constraints yields both vertical and diagonal systems, and even horizontal systems. Many linguists assume horizontal systems to be impossible, and have proposed a variety of patches to rule out such systems and answer the related question of whether distinctness of F 2 universally has precedence over distinctness of F 1 . Ten Bosch (1991) achieves such an F 1 /F 2 asymmetry by stipulation, assigning a value of 0.3 to the importance of F 2 distance vs. F 1 distance; proponents of DFT similarly use a weighting of 0.25. Traunmüller (1998: 4) suggests that speakers are more concerned with contrast in F 1 than in F 2 because F 2 is less sensitive to variations in F 0 than F 1 is. Lindblom attributes the disparity to proprioception: jaw height can be felt more accurately than tongue-body position. Boersma (1997: 14) asserts that F 1 is more important because it is louder: "the second spectral peak has a larger chance of drowning in the background noise." Until one can show that a horizontal system is not learnable by children, though, the hypotheses just reviewed here are purely speculative. The important points are that the factorial typology yielded by OT-DT both overgenerates and misses the target with respect to the attested two-vowel systems, the surface orientation of the system is also problematic given the dramatic disparity between underlying phonemes and surface phones, and on the implementational level, there are considerable problems with formulating a number of the constraints that are crucial to the system.
Faithfulness problems in OT-DT
In the previous section we were chiefly concerned with height systems, since DT models differ from Quantal Theory and traditional markedness theory in the predictions they make regarding such inventories. We now move on to evaluating OT-DT on matters of faithfulness and markedness more generally. Faithfulness poses a challenge for OT-DT since, as Flemming correctly observes,
"it is not possible to combine dispersion constraints with the faithfulness-based account of allomorphic similarity [,] because the two are fundamentally incompatible . . . The inclusion of faithfulness constraints subverts the intended effect of the MINDIST and MAXIMIZE CONTRASTS constraints, because it makes the selected inventory of vowel height contrasts dependent on the input height under consideration." (Flemming 2004) As a result, OT-DT uses surface-driven Output-Output correspondence. While this remains a contentious issue, as we discussed in the previous section (see also Hale et al. 1998;Kiparsky 2000;McCarthy 2007), a phonological theory that includes only comparison of output forms without any consideration of underlying or intermediate representations cannot adequately account for a variety of generalizations (e.g., opacity). Boersma argues further that "inventory grammars . . . do not explain how a random input is filtered into a well-formed utterance . . . Flemming's global inventory evaluation procedure is not a model of grammar; it just shows that inventories can be described with a strict ranking of principles" (1997: 16).
Markedness problems in OT-DT
The OT-DT model has a conceptual problem with markedness as well as faithfulness, arising from its assumption that markedness is based on paradigmatic contrast rather than intrinsic properties of segments. Flemming (2004) claims that "there is no analogous notion of effort involved in perceiving a soundperceptual difficulties don't arise because particular speech sounds tax the auditory system, the difficulty arises in categorizing sounds." However, there are many instances where auditory difficulties are obviously involvedvery complex sounds, quiet sounds, and vowels that are fully or partially devoiced, for example. As Rice (1999) notes, and as should already be clear from previous sections, the number of attested systems that are unexpected in DT is vast. Rice summarizes the issue as follows:
There is a simplicity to this theory which does not always meet with phonetic facts. . . . [O]ne would expect maximal differentiation of contrasts in phonetic terms. However, this is often not the case. For instance, some languages have a three-way place contrast [along the F 2 dimension] between i-ü-u rather than i-ɨ-u (e.g. Rukai; Austro-Tai 417); some have a two-way contrast between i-ɯ (Adzera, Austro-Tai 419; Japanese, Ural-Altaic 71) rather than the expected i-u; some have a single high vowel which is front rather than central (e.g. Navajo, Athapaskan; Klamath, Penutian). These facts call into question the claim that articulatory simplicity governs in the absence of contrast and maximal distance in the presence of contrast." (Rice 1999: 414) One reason for this, as Rice notes, is that markedness can vary even among identical systems. This is incompatible with the paradigmatic notion of markedness employed by OT-DT, tied as it is to the existence of particular contrasts.
As mentioned earlier, one claimed advantage of OT-DT is that it can predict the quality of epenthetic segments in a particular inventory by "minimization of effort or other contextual markedness constraints" (Flemming 2004). As Rice (1999) notes, though, it is not necessarily true that articulatory concerns win out in positions of reduced contrast, such as epenthetic contexts. Vaux and Samuels (2003) and Blevins (2008) discuss a wide variety of epenthetic segments crosslinguistically, and though their focus is on consonants, their overarching point remains valid: epenthetic segment choice is highly idiosyncratic and often owes to reanalysis of historical deletion patterns. For example, the arbitrariness of epenthetic segments is illustrated by the difference between Standard Armenian and the Muslim Hamshen dialect, which have identical phonemic inventories but epenthesize different vowels: Standard Armenian selects [ə], whereas Hamshen selects [ɛ] (Vaux 1998). In short, the choice of epenthetic vowel is not predictable from the vowel inventory. 5 OT-DT also suffers because its relational notion of markedness rules out traditional markedness constraints like *ə. How then can OT-DT generate vowel systems with gaps, or choose between {a i u} and {a i o} for example? Instead of traditional markedness constraints, Flemming (2004) appeals to "constraints favoring less confusable contrasts over more confusable contrasts." Innate constraints for this purpose seem superfluous given that the principles they embody are well-motivated properties of general cognition independent of any encoding in the grammar (cf. Boersma 1997;Ohala 2005).
5 Dupoux et al. (2011) note that the epenthetic vowel in Japanese is /u/, which is the shortest vowel in the language, while the epenthetic vowel in Brazilian Portuguese is /i/, which is the shortest vowel in that language. However, the identity of the shortest vowel in a language is itself not predictable from the inventory.
We have seen so far that pre-OT and OT-based implementations of DT each encounter a number of problems, some distinct to the particular implementation in question. Before concluding, we will remark on some other, more general concerns that are common to all forms of DT.
One major empirical issue is that DT is based on generalizations about vowel systems that are not phonetically reliable. The familiar figures from the UPSID database (Maddieson 1984) cited by Flemming (2004) and others, such as the generalization that 94% of front vowels are unrounded and 93.5% of back vowels are rounded, are now rendered outdated by newer and more reliable phonetic data. For instance, we now know that contrary to UPSID, English back vowels are [-round] and centralized for many speakers (Joanisse and Seidenberg 1998: 4), that American English /u/ nearly overlaps with Korean /ɨ/ (Yang 1996: 259), and that there is a consistent difference in height and backness between the high front unrounded vowels of German and of Norwegian, although both are transcribed as [i:] in the Stanford archive (Disner 1983: 4). Recasens and Espinosa (2009) present a literature review as well as new data, which point in general to a conclusion that undercuts DT: there is typically not a correlation between the number of vowels in an inventory and the size of its phonetic categories. The DT principle of minimal distance predicts that an inventory with more vowels should have smaller categories, entailing more precise phonetic realization. This is not borne out by the cross-linguistic data. Nor is it clear that larger inventories cover a larger acoustic space, as would seem to be demanded by the DT principle of maximal contrast (see Becker-Kristal 2009: 14ff for an overview). Since Lindau and Wood (1977), it has been recognized that even closely related languages with the same vowel systems do not distribute their vowels the same way in the acoustic space. To take one example, in (13) it can be appreciated that the same seven-vowel system is implemented very differently in Italian and Yoruba (Disner 1983: 17ff). Each of these seven-vowel systems derived from an earlier nine-vowel inventory with the additional categories of *ɪ and *ʊ. However, they differ in the historical developments by which they lost these two categories. In Yoruba, *ɪ and *ʊ merged with the high vowels and a pull chain raised /e/, /ɛ/ and /o/, while in Italian *ɪ and *ʊ merged with /e/ and /o/. Analysis of variance and Duncan post hoc analysis confirm that the F1 values of /e, o, ɔ, a/ are significantly different in the two languages, and there is a significant interaction between the language and vowel variables, which Disner (1983: 22) calls a "pattern effect." Even within a single language, there are differences that dispersion theories do not predict. Indeed, it has been known since Quillis and Esgueva (1983) that vowel qualities differ across varieties of Spanish; O'Rourke (2010) has shown that the same {a e i o u} system in speakers of Peruvian Spanish from Lima even differs significantly from that of their countrymates in Cuzco. Yang (1996: 259) notes that Korean males and females have the same phonemic systems, but "Korean female speakers produced vowels with a wider range of jaw movement than the male speakers." Traunmüller (1984) discusses how the proportionally longer pharynx in males can produce differences in F 2 , while Ryalls and Lieberman (1982) and Diehl et al. (1996) attribute female peripherality to compensation for higher F 0 . Traunmüller (1998) in turn attributes this higher F 0 to the relation of speaking energy and energy supply to linear body size (1:5 and 1:2 respectively). It has also been shown that bilingual speakers often implement the same phoneme in their two languages differently, which proves that the difference cannot be attributed to the variation between individual vocal tracts (Disner 1983;Bradlow 1995). Dispersion theories also wrongly predict the non-existence of languages of the sort in ( 14), which lack some or all of the point vowels.
(14) Systems lacking point vowels: a. Tagalog and Berber have none of the point vowels (Disner 1984) b. Squamish, Alabama, and Amuesha have only [a] of the point vowels (Disner 1984) c. Many languages have [o] rather than [u]: Chickasaw (Gordon et al. 2000), Mura-Pirahã (Sheldon 1974), Hupa (Golla 1970;Gordon 1996), Navajo (McDonough andAustin-Garrison 1994), Banawá (Ladefoged et al. 1997), Nuu-chah-nulth (Sapir and Swadesh 1939) and Wari' (MacEachern et al. 1997).
These facts are particularly problematic for OT-DT, which requires that all vowel systems include high vowels. Systems consisting entirely of point vowels also pose problems for DT. The fact that {i e a u} is favored over {i a o u} is not derivable within DT, as Joanisse and Seidenberg (1998: 3) point out: both systems are equally dispersed, and thus they should be equally favored. Joanisse and Seidenberg propose that {i e a u} is preferred because the ability to stiffen the genioglossus muscle while making high front vowels reduces variation of [i], making the [i]~[e] contrast easier than the [u]~[o] contrast. They then argue that the system with the easier-to-distinguish contrast is more likely to be learned successfully, which results in the front-heavy system being favored cross-linguistically. To preview our discussion in the next section, this is precisely the type of approach that we espouse: it is an argument about learnability, not synchronic grammar, and it is able to decide between two systems that are not differentiable by DT.
DT also suffers from a conflation of acoustic distance with confusability. Diehl et al. (1996) have shown that, for a given set of formants, higher F 0 increases confusability. By virtue of this fact, two vowel pairs wherein the acoustic distance between the two members is identical can differ in confusability, if the F 0 for the vowels is not identical. Also, the potential for confusion is not symmetrical; short vowels tend to be perceived with lower first and second formants, which means that misinterpreting [ɑ] as [ɔ] is more likely than vice versa (ten Bosch 1995). The asymmetry of confusability and its uncertain relationship with acoustic distance is problematic for DT because maximum perceptual distinctiveness can no longer be equated with maximum dispersion, which calls the whole DT enterprise into question.
Pulling back from the details, one can see that the basic problem is that DT, as a surface-driven, functionalist typological analysis, misses both underlying generalizations and idiosyncratic subtleties. In the first case it misses deep generalizations of the sort that link the underlyingly identicalfoot_5 but superficially different inventories of Italian and Yoruba shown in (13), for example, and in the second it fails to deal with gaps in languages like Navajo, which typically result from accidents of history. The notion that synchronic sound systems are best understood through their diachronic origins dates back to the Neogrammarians, Otto Jespersen, Joseph Greenberg, and perhaps most notably Baudouin de Courtenay. It has long been understood that sound changes can produce a vowel system with no felicitous synchronic explanation. One such example involves an ATR contrast that spreads from consonants to adjacent vowels and then disappears entirely in the consonants, as happened in many dialects of Armenian (Vaux 1998). This change resulted in an F 1 contrast produced not from the structure of the vowel system itself, but from a now-lost property of the consonant system. As Lang and Ohala (1996) state, "[w]e should not ignore a purely historical factor in the shaping of languages' segment inventories: less common sounds such as glottal stops, aspirated stops, etc. may evolve fromsplit off fromthe more common ones."
Evolutionary phonology as an alternative to dispersion theory
Taken together, the arguments presented in the previous sections and throughout several decades of literature on various incarnations of DT combine to suggest that DT is inherently too restrictive to account for the observed variety of vowel systems. We may nevertheless retain the basic intuition that a more dispersed system, or an easier to pronounce system, is easier to learn than one that is less so, as ten Bosch (1995) and Joanisse and Seiderberg (1998) have suggested; however, this does not entail that dispersion needs to be built into rankable synchronic constraints, nor does it entail that a less dispersed or harder to pronounce system cannot be learned, as Joanisse and Seiderberg claim. In fact, the entire array of attested vowel systems can be explained solely via the acquisition process, as argued by Ohala (1992), Hyman (1998), Hale andReiss (2000a, 2000b), Harris and Lindsey (2000), and Blevins (2004). If we can explain the attested linguistic patterns with independently needed acquisition processes, there is no need to duplicate them in the grammar (Hale andReiss 2000a, Hale andReiss 2000b;Blevins 2004;Samuels 2011). With respect to vowel systems in particular, Harris and Lindsey (2000) observe that "it is not immediately obvious how positional asymmetries in vowel distribution can simultaneously 'arise from' grammar-internal constraints (Beckman 1997: 7) and be motivated by grammar-external speech functioning. The constraints in question could have at best some intermediate place in a chain of causation." De Boer (2000: 2) adds that synchronic constraints do not make acquisition easier, or help explain it (see also Hale and Reiss 2008):
"These functional explanations [NB: of the distribution of vowels] are not the full explanation, either. They assume that the systems of speech sounds one finds are the result of an optimization of one or more of the proposed criteria. However, it is not clear who is doing the optimization.
Certainly children that learn a language do not do an optimization of the system of speech sounds they learn. Rather, they try to imitate their parents (and peers) as accurately as possible."
How then should the attested range of vowel systems be accounted for? We believe that the answer lies in Evolutionary Phonology (EP; Blevins 2004), which we view an extension of Ohala's listener-based analyses of sound change, Kiparsky's (1972) account of apparent phonological conspiracies, Pierrehumbert's ( 2001) exemplar-based model, and to some degree Boersma's (1997) Functional Phonology. 7 The core of EP is simple and intuitive: a system that is easier to learn than a competitor is predicted to become more common than that competitor over evolutionary time. This is a theory of "probable languages" in the sense of Newmeyer (2005); it is essentially a form of survival of the fittest. Crucially, just as application of survival of the fittest in the animal kingdom does not require that a less fit creature disappear, so its linguistic analogue does not require that harder-to-learn systems disappear (or fail to appear in the first place); they are merely predicted to become less common over evolutionary time.
At the heart of EP is an exemplar-based model of sound change (based on Pierrehumbert 2001) overlaid on a continuous psycho-acoustic vowel space, thus permitting gradual change (as opposed to theories that permit only discrete changes in terms of phonological features). In this model, when a speaker wants to produce a vowel -/u/, for examplehe attempts to produce the "best" exemplar of /u/ that he has previously heard. Crucially, "best" in this context means "most likely to be categorized as /u/." The likelihood of being categorized as [u] comes 7 The model proposed here diverges from Boersma's (1997) theory of Functional Phonology, which purports to be able to explain the symmetries and gaps in inventories of vowels and consonants. In Boersma's model, the attested symmetries are language-specific results of general human limitations on the acquisition of perceptual categorization and motor skills. The gaps are results of local hierarchies of articulatory effort, perceptual contrast, and perceptual confusion. The proposal developed here is largely in agreement with Boersma with regards to how asymmetries may arise, but departs from his theory's explanation of the gaps, for which we seek no functionalist synchronic explanations. from the exemplar's summed similarity to other exemplars of [u] taken as a fraction of its summed similarity to exemplars of all vowels (Pierrehumbert 2001). The other members of the vowel system thus become crucial to the categorization process. In a five-vowel system of /i, e, a, o, u/, /u/ may begin to front because it is closer in the acoustic space to /o/ than to /i/. This may explain why /u/ exhibits some degree of fronting in a wide variety of languages and dialects (e.g., French, Greek, Swedish, São Miguel Portuguese, and most varieties of American English; see Haudricourt and Juilland 1949;Labov et al. 1997), as it is to be expected when /i/ is not the closest neighbor to /u/.foot_6 This creates a feedback loop enhancing distinctness in perception and production, with the effect of distributing the vowels evenly throughout the perceptual space, because the system will reach equilibrium only when the best exemplar of each category is also its mean (see Samuels 2007 on the implications of this view for vocalic chain shifting).
EP categorizes phonetically-motivated sound changes into three types: CHANGE, CHANCE, and CHOICE. CHANGE includes cases in which a learner analyzes an utterance as having an underlying representation that is perceptually similar to but distinct from the underlying representation of the speaker, due to mishearing. For example, /anpa/ may be heard as [ampa] and therefore analyzed by the listener as /ampa/ due to the weak cues indicating the place of the preconsonantal nasal. CHANCE changes are those in which the listener analyzes an ambiguous signal in a way that differs from the speaker's underlying form. For example, [ʔ ̰aʔ] may be analyzed as /ʔ ̰aʔ/, /ʔa/, /aʔ/, or / ̰ aʔ/. Finally, CHOICE can occur when there is variation in the pronunciation of a particular underlying form and the listener posits a different underlying form along with a different phonological rule governing the variation. An example would be an underlying form /kakata/ with an optional vowel shortening/deletion rule producing [kăkata] and [kkata] instead being analyzed as /kkata/ with an optional epenthesis rule.
To take an example of how CHOICE can operate in the evolution of vowel systems, consider the case of Yurok /e/ and /a/ (Blevins 2003). The former *e phoneme in Yurok has split into /e/ and /a/ following the loss of the conditioning environment for a rule lowering /e/ to [a]. Specifically, the loss of /h/ before laryngealized consonants rendered opaque the complementary distribution of these erstwhile allophones in some classes of verbs, resulting in phonemicization of /a/. Blevins further reports on a sound change in progress that lowers /e/ to /a/ before rhotics, /w/, and the fricatives /s, c, ɬ/. This is a gradient phenomenon that produces considerable allophonic variation on the continuum from [e] to [a] through [ɛ] and [ae], sometimes even produced as a diphthong in careful speech. In the aftermath of the /e/~/a/ phonemic split, the opportunity for CHOICE emerged: listeners can now posit either /e/ or /a/ as an underlying representation for the variable vowels. This may explain the variation observed across speakers. Blevins hypothesizes that the establishment of a phonemic contrast between /e/ and /a/ opened the door for more instances of *e to shift into the /a/ category. Note that /a/ did not emerge out of any dispersion-related principles such as maximal distinctiveness of contrasts, but rather due to loss of the conditioning environment for a predictable allophonic distribution of [e] vs. [a] in one small corner of the grammar.
It is also emphasized in EP that common synchronic sound patterns are those that arise from common sound changes, while rare sound patterns arise from unusual sound changes or rare sequences of sound changes. We emphasize in this regard that what is "diachronically possible" (i.e., what phonological patterns can be produced by possible linguistic changes) may be a subset of what is "computationally possible" (i.e., see Hale andReiss 2000a, Hale andReiss 2000b). The overarching factor that we have called "ease of learnability" crosscuts this distinction, since cognitive factors are intimately involved in the language acquisition process, from which linguistic change results. As our understanding of perception and cognition increases, the exact interplay of these factors will become better defined.
EP has the advantage over all-or-nothing synchronic models of providing a straightforward account for cross-linguistic tendencies, while simultaneously allowing a place for accidents of history. This problem of how to deal with 'unnatural' patterns is endemic in phonology, as has been recognized and debated for decades. Idiosyncrasies of history inevitably intrude in language; the synchronic phonological component must be able to accommodate these, but need not incorporate them as synchronic functional constraints.
Conclusions
We have argued here that Dispersion Theory is insufficient to characterize the variety of vowel systems found cross-linguistically. The issue at the core of many of the arguments we have presented hereabove and beyond the issues we have raised concerning the implementation of various types of dispersion theoriesis that the properties of vowel systems are not predictable simply on the basis of opposed articulatory and perceptual concerns. That is to say, at least partially due to accidents of history, inventories are not always symmetrical, do not make ideal use of the articulatory/perceptual space, and so forth. Attempts to force synchronic explanations onto phenomena that are historical or evolutionary miss important phonological generalizations, fail to account for the peculiarities of individual languages, produce unnecessary redundancy, and obscure the actual driving force behind both the similarities and the differences that we observe in human language. An Evolutionary Phonology account of the evolution of vowel systems is more permissive than Dispersion Theory, which is desirable, yet still predicts that some vowel systems will be more common than others for reasons of articulation, perception, and other properties that may affect their learnability.
(
Bert Vaux and Bridget Samuels Authenticated | bv230@cam.ac.uk author's copy Download Date | 9/9/15 11:49 PM
Authenticated | bv230@cam.ac.uk author's copy Download Date | 9/9/15 11:49 PM
Cf. alsoTraunmüller (1998), who asserts that men have less dispersed formants than women because it requires more energy for them to speak, and they do not need the extra contrast to be understood, thanks to their lower fundamental frequency.
Cf.Lang and Ohala (1996) on the difficulty of measuring articulatory cost.
One should not be too quick to form generalizations about two-vowel systems since our data come from only about six languages, four of which are closely related. However, let us assume for the sake of argument that the patterns found in these languages reveal a general trend.
A reviewer suggests that Yoruba, which has ATR harmony, may utilize different features in the mid vowels than Italian, which does not (and, e.g., could use [tense/lax] instead). Even if these features are indeed different, and if it can further be argued that Italian uses [tense/lax], this does not explain all the differences observable in (13).
Note that /u/ fronting to [y] is unexpected from a markedness perspective; seeCalabrese (2005) on this diachronic development as an instance of "emergence of the marked" and on the possibility that this fronting process stems in part from the articulatory difficulty of producing a [ þ ATR] back vowel.
Working Papers, 2007
Annals of Regional Science, 1998
Astrophysical Journal, 1994
Journal of Measurement and Evaluation in Education and Psychology , 2020
International Review of Sociology, 2011
2009
ECONOMÍA PARA ENTENDER A UN MUNDO INNECESARIAMENTE DESDICHADO , 2017
Cadernos de Campo, 2024
The British journal of radiology, 2018
Springer eBooks, 2018
Caligrama: Revista de Estudos Românicos, 2007
arXiv (Cornell University), 2012