Szentgyörgyi Szilárd
UR’s in OT



0. Introduction

This paper addresses some very basic assumptions in standard Optimality Theory1 (OT) (Prince and Smolensky 1993, McCarthy and Prince 1995; for a detailed description of Optimality Theory, see for instance Kager 1999); namely, it discusses the consequences of the principle of the Richness of the Base (ROTB), Lexicon Optimization (LO) with respect to underlying representations (UR’s). We will try to show how the above notions lead to some most disturbing contradictions concerning UR’s.

We will start with the original claim that there are no restrictions or constraints on underlying representations in OT. As a result, any language may have any kind of segment or structure in the Lexicon, and it will be the language-particular ranking of the universal constraint set, CON, that determines the segment inventory of the particular language. However, many linguists have suggested that ROTB is just an inconvenience which may be quite irrelevant. If we want to keep to the original assumption that ROTB is a principle present in the grammar, then we should set up CON so that it produces possible output forms independent of the inputs. It does not mean, of course, that all input forms should yield existing forms but it does mean that all output forms should be well-formed.

The second problem to be addressed is that of Lexicon Optimization addressed by Beckman and Ringen (2003), Inkelas (1995), McCarthy (2003), and Szentgyörgyi (2004, to appear), among others. As the original formulation of LO only works on single forms, Inkelas (1995) suggested an alternation sensitive version (LO’), which takes into consideration all the possible contexts of a form and determines the underlying representation on the basis of the sum of violations by the output forms in all the possible environments. Inkelas claims that the application of LO’ will yield underspecified inputs whenever there is a surface alternation. However, this is only true if we assume that faithfulness constraints are not violated by the filling in of a binary feature. On the other hand, if we assume the opposite, as many do, then inputs will be fully specified but it will always be the structure occurring in the majority of the environments that will be posited as underlying. A significant consequence of this is the fact that in the case of x alternating with y, it is very much possible that both x and y will be found underlyingly as demonstrated in Siptár and Szentgyörgyi (2002) and Szentgyörgyi and Siptár (2004) for Hungarian H-type segments and in Szentgyörgyi (to appear) for RP English /l/ variants.

For instance, Hungarian [x] and [h] will both occur as underlying segments in the language as there are non-alternating forms with both of these segments. To make matters a bit more complicated, we have to note that Hungarian has two more variants of so-called H-types segments: a fronted variant of the voiceless velar fricative and a voiced variant of the glottal one. For the sake of simplicity of the argumentation and the tableaux to be presented we will only concentrate on [x] and [h] and will leave the other two versions out of the discussion. Note though that none of the arguments presented hinges upon the number and exact quality of the surface variants of the underlying H-type segment(s), i.e. the same arguments are valid for an analysis taking all four allophones into consideration. As a result of both [x] and [h] appearing underlyingly, these segments normally considered to be allophones of the same phoneme will have independent underlying status, which is clearly against the usual assumptions made by generative phonological theories. The paper attempts to show how such undesirable results may be avoided.

1. Richness of the Base: setting up the constraint system for predictable variation

The basic assumption of phonological theories concerning predictable alternating and non-alternating segments that appear in complementary distribution is that they should be represented with the same underlying segment. Then it is the responsibility of the constraint hierarchy of an OT grammar to select which of the possible surface forms will occur in a given environment.

The example that we are going to use throughout the paper is a well-known alternation of Hungarian H-type segments: that of the voiceless velar, [x], and voiceless glottal [h] variants:

(1) Alternating and non-alternating Hungarian H-type segments

The usual assumption concerning the above data is that the UR of these segments is always a glottal /h/ and in codas it surfaces as [x]. To achieve this, the following constraints are necessary:

(2) *Coda-h Glottal fricatives are prohibited in codas.
(3) ID Place Correponding segments have the same specification for Place features.
*Coda-h >> ID Place

Note that only the absolutely relevant constraints are shown throughout the paper. For the same reason, candidates involving deletion, insertion, metathesis etc. will not be indicated in tableaux as they are not relevant for the present discussion.

As can be seen in tableau (4), candidate (a) with the glottal [h] in coda position violates the higher ranked markedness constraint while candidate (b) with the velar [x] in the same position violates the lower ranked identity constraint and thus is judged to be optimal. Similarly, in tableau (5) the phonetically correct candidate containing an onset [h], (a), is selected as optimal as it does not violate either constraint, while candidate (b) does violate the identity constraint because of an underlying glottal /h/ corresponding to a surface velar [x].
Because of the Richness of the Base, however, other input forms must also be considered:

Tableau (6) demonstrates that an input with a velar /x/ in coda position will also yield the same optimal candidate as in (4) above. However, in tableau (7) it can be seen that a UR with an onset velar /x/ will result in the selection of a phonetically incorrect output form as optimal. This form, candidate (7b) does not violate either constraint as it does not contain a glottal [h] in a coda and it does not involve any change with respect to the input form. Note that it is not necessary that the same constraint hierarchy yield the same optimal output form for a different input: there is only one requirement, namely that all optimal output forms should be phonologically and phonotactically possible forms, which is clearly not the case here. Thus a new constraint must be introduced to exclude a velar fricative, [x], in onsets:

(8) *Onset-x Velar fricatives are prohibited in onsets.
*Coda-h, *Onset-x >> ID Place

The tableaux (9)-(12) demonstrate that adding the new markedness constraint to the hierarchy will always result in the attested being forms selected as optimal, a desired outcome. The same surface forms are achieved regardless of the choice of the underlying H-type segment.
Another possible solution would be to collapse the two markedness constraints:

(13) H-Place H-type segments are velar if and only if in the coda.

As is clear from tableaux (14)-(17), the new markedness constraint does the very same job just as well as the two separate constraints above. In both cases there is a (set of) constraint(s) which prohibits one surface variant in some environments and other variants in others. If there were no such constraint(s), in some environments, it would not be the actual surface form that would be selected as optimal, i.e. such a constraint (set) is necessary for the treatment of predictable variation.

2. Lexicon Optimization: selecting the underlying representation

The next question to be answered in this section is how we can treat those cases where two or more different input forms yield phonetically identical optimal outputs. Prince and Smolensky (1993) adopted the mechanism of Lexicon Optimization to deal with the problem:

Suppose that several different inputs I1, I2, ..., In when parsed by a grammar
G lead to corresponding outputs O1, O2, ..., On, all of which are realized as
the same phonetic form ? — these inputs are all phonetically equivalent with
respect to G. Now one of these outputs must be the most harmonic, by
virtue of incurring the least significant violation marks: suppose this optima
one is labelled Ok. Then the learner should choose, as the underlying
form for ?, the input Ik.
(Prince and Smolensky 1993:192)

That is, we have to compare the phonetically identical optimal cases arising from different inputs and select the one with the fewest violations; it is the input form belonging to this optimal candidate that will be selected as the UR of the lexical item.

In non-alternating cases it is the input belonging to some of the surface variants that is selected in some environments and the input belonging to other surface variants that is selected in others as demonstrated by the tableaux below.



Tableaux (18)-(21) contain only the input forms and the selected optimal candidates from tableaux (14)-(17). If we compare (18) and (19), where both the input with a glottal /h/ and the other one with a clear /x/ result in the optimal output containing a surface velar [x], it is the input with the velar segment that scores the fewest violations as, besides not violating the markedness constraint, it does not violate the lower ranked faithfulness constraint either. In cases of onset /h/, it is just the opposite; therefore, the input with a glottal /h/ will be selected as the UR of the optimal output.

However, this kind of mechanism clearly leads to a contradiction: although we started out by assuming that the same UR is used for H-type segments in all cases – and that is also the assumption made by most phonological theories – we arrived at a conclusion that contradicts this. Both surface segments underlie, i.e. they contradict, the traditional notion of underlying segments/phonemes. Let us now turn to alternating cases and see how a modified version of LO can or cannot treat those.


3. Alternation-sensitive Lexicon Optimization: full specification or underspecification?

In alternating stem final cases, the surface realization of the H-type segment depends on the following segment: before vowels it can be syllabified into the onset, i.e. it will be realized as a glottal [h], while before a consonant or a pause it will be syllabified into the coda, i.e. it will surface as a velar [x].

Unfortunately, traditional LO would predict two different UR’s for the same morpheme, i.e. it would suggest a morpheme with at least two different underlying forms, a clearly unwanted result:


/1/
(To be more precise, we have to note that intervocalic and intersonorant occurrances of H-type segments are realised as a voiced [?]. However, this is one of the two surface realisations that we are not discussing here.)

For the stem doh~dohos [dox]~[do.ho?] ’musty smell’~’with a musty smell’, (23) with an underlying velar /x/ is selected by normal LO if no suffix follows (or before a consonant-initial suffix), while (24) with an underlying glottal /h/ is selected before a vowel-initial suffix, i.e. the two different underlying forms would be: /dox/~/doh/.
A possible solution may be presented by underspecification: if in both cases the input with an H-type segment underspecified for place features won, we would have one UR for lexical items involving alternations. Let us now turn to such underspecified inputs and see whether they can be selected as the UR by LO. An H-type segment underspecified for place features is indicated with a capital H in the paper.

Tableaux (26) and (27) raise a very important question in connection with underspecified inputs: does the filling in of a binary feature count as a violation of identity constraints? If it does, then (23) is preferred over (26), and (24) is preferred over (27), i.e. fully specified inputs will be preferred to underspecified ones.

However if filling in a binary feature does not count as a violation of ID constraints, then (23) and (26), and (24) and (27) are equally well-formed. But then how can they be distinguished? SPECIFY, a constraint requiring outputs fully specified for binary features, clearly does not work as it cannot refer to inputs, just like any other constraint cannot solely refer to inputs. Instead, DEP [feature] constraints will prefer fully specified inputs over underspecified ones, as they penalise those output features that do not have a correspondent in the input representation.

(28) DEP Place. Place features in the output have correspondents in the input.

This way even if the filling in of a binary feature counts as a violation of an IDENTITY [F] constraint, fully specified inputs are still preferred to underspecified ones by LO. Thus we may conclude that whether IDENTITY [F] constraints are violated by filling in a binary feature or not, underspecified input forms will be dispreferred by Lexicon Optimization.

Consequently, the forms in (23) and (24) are selected as the optimal underlying representations of the word forms in question, i.e. in all such cases we have one morpheme with two underlying forms on the one hand, and we also have two segments in complementary distribution which will both be found in underlying forms as seen earlier.

Another way of approaching this problem would be to change LO in a way to enable it to address alternations. Such a version of LO, let us call it LO’, has been suggested by Inkelas (1995):

Given a grammar G and a set S = {S1, S2, ... Si} of surface phonetic forms
for a morpheme M, suppose that there is a set of inputs I = {I1, I2, ... Ij},
each of whose members has a set of surface realizations equivalent to S.
There is some Ii ? I such that the mapping between Ii and the members of S
is the most harmonic with respect to G, i.e.it incurs the fewest marks for
the highest ranked constraints. The learner should choose Ii as the underlying
representation for M.
(Inkelas 1995:7)

That is, we have to compare the phonetically identical optimal outputs corresponding to different input forms, evaluate them in all the relevant environments, and see which one fares best on the constraint hierarchy all in all. Let us take a look at such a complex evaluation for one of the above examples. Markedness constraints are not indicated in these tableaux as they are equally violated by the candidates regardless of the input specifications.

DEP Place and ID Place may be ranked whichever way; it will not influence the selection of the optimal candidates in any of the earlier tableaux. However, it does influence the choice of the winning candidate in (29). If DEP Place dominates ID Place then it is clearly candidate (b) with the underlying velar fricative that is the most optimal input form as it causes no violation of the higher ranked DEP Place, while candidates (a) and (c) do. Note that for this we assume that glottal segments do not have Place features. As a result changing an underlying velar into a surface glottal does not violate DEP Place but it does violate ID Place. However, should the ranking be the reverse, i.e. should ID Place dominate DEP Place, then the choice of the optimal input, yet again, depends on whether ID constraints are violated by the filling in of a binary feature. If they are, then candidate (b) is selected again as it causes the fewest violations of ID Place. To be more precise, it causes one violation of ID Place just like the first input candidate, but while in (b) there is no violation of the other constraint, DEP Place, there is a violation for candidate (a).On the other hand, if ID constraints are not violated by filling in binary features, candidate (c) with the underspecified input will be selected as optimal, as it causes no violation of ID Place, while the outputs of the other two input candidates do.

Whichever the case, it is still not a desired result at all. Let us remember that in some non-alternating cases – in those lexical items where the H-type segment is always in onset position – a glottal /h/ will be selected as the optimal input form, while in others – in those with the H-type segment always in the coda – an underlying velar /x/ is selected by LO. Note that Inkelas’s (1995) LO’ will not have any influence in these non-alternating cases, i.e. these segments remain stable regardless of the environment. Thus there are at least two, but possibly three, different underlying representations for surface H-type segments: two for the non-alternating cases; and, depending on the ranking of ID Place and DEP Place in (29) and depending on whether the filling in of binary features violates ID constraints, even underspecified underlying H-type segments may appear, which is the third possible input form. It seems then that regardless of how we treat filling in underspecified features and regardless of the ranking of constraints, there must be at least two different UR’s for H-type segments.

One might wonder whether changing the interpretation of the relevant environments would have any effect on the selection of the ideal input forms. In the above discussion we considered just two environment types: onset and coda positions. However, one might suggest two more subtypes of the latter that matter: before consonants and before a pause.

However, it can be seen that the very same arguments hold for the input and output forms in tableau (30) as in tableau (29). As a result, we can safely claim that whatever the relevant environment(s), the number of underlying representations suggested by LO or LO’ is at least two. That is, this way it is not possible to reduce the number of UR’s for allophonic variation. Let us now turn to the possible solutions that may be proposed to answer this challenge.


4. Possible solutions to the problems

In the above discussion we have seen that if we take the Richness of the Base seriously, and if we apply either Prince and Smolensky’s LO or Inkelas’s LO’, we get a solution which contradicts the usual assumption that alternating segments have a common underlying representation. Thus either it is the ROTB that is incorrect, or LO (or LO’), or the general claim concerning UR’s. Let us take a look at these three factors - and one more at the end – and see how they could be modified and what kind of consequences these modifications would entail.

Richness of the Base Abandoned

One possibility is to give up the idea of the Richness of the Base. However, this would lead to some undesirable problems that would challenge the very core of the theory: on the one hand giving up ROTB would mean imposing restrictions on UR’s. Since not everything is a possible UR, UR’s are limited; moreover, they have to be stipulated as there are no clearcut mechanisms to calculate them from output forms. This would lead to a grammar that is far more powerful than the current version of OT allows: it is very much possible that a grammar flawlessly works on the stipulated input forms, but fails to select grammatically possible optimal output forms in cases where a different input form occurs.

Lexicon Optimization (or LO’) Modified

Another possible amendment could be to give up or change Lexicon Optimization – or Inkelas’s alternation sensitive version. What are the necessary modifications that we have to apply to make LO work?
We have to ensure that allophonic variation results from one underlying segment – it was this assumption that our results above contradicted. That is, if two surface segments are in complementary distribution, then they have to come from the same input form, i.e. even if traditional or alternation sensitive LO suggests two (or three) different input segments, another mechanism has to undo this. For instance, we could suggest that the underlying forms in such cases will always contain the underlying segment found in alternating forms – i.e. either /dox/ or /doH/ depending on the ranking of ID Place and DEP Place in (29) and (30). The choice between these two may be governed by Grammar Optimization (GO):

The optimal grammar is the most transparent, i.e. the one in which
alternations are maximally structure-filling. In terms of Optimality Theory,
this means that PARSE
[i.e. members of the MAX constraint family
in the Faithfulness version of OT] is ranked as high as possible.’ (Inkelas 1995:11)

On this basis, MAX or ID constraints must be ranked higher than DEP constraints to maximize transparency in the grammar. If this is applied to our case, then in (30) DEP Place must be ranked below ID Place. Moreover, if ID constraints are violated by feature filling, (30b) is selected as optimal while if ID is not violated by feature filling, (30c) with an underspecified input is selected to be the UR. That is if we assume that ID constraints are not violated by feature filling and we also apply Grammar Optimization, then we will get underspecified inputs in the case of alternating forms. Then we may also assume that GO goes one step further and the underlying form in alternating cases will be extended to non-alternating cases by analogy as described above. This way the system can be made to work without one principle contradicting the other.

No Underlying Representations

A third possibility is to give up the idea of underlying representations, i.e. the traditional notion of underlying segments/phonemes would not be used any more. This way we would have to claim that UR’s do not exist, and as a result, there would be no need to calculate them by applying LO. Thus Lexicon Optimization would be deemed unnecessary as well. Also, if there are no UR’s, there could be no input-output faithfulness constraints, i.e. the MAX and DEP constraint families would cease to exist. Even in the case of ID constraints, ID input-output constraints would be ruled out and only output-output correspondence would play a role in determining faithfulness. This way underlying representations would be replaced by paradigms along the lines suggested by Burzio (1998), Kenstowicz (2001), Padgett (2001) and (Rebrus and Törkenczy 2004).

Only Unary Features

As some of the problems have been caused by a question concerning violations of ID constraints by filling in binary feature specifications, one might suggest that this kind of problem could be avoided if we had only unary features. Note also that in such a case, the abovementioned modification of LO is not an option because it is based on the proposal that in cases of allophony and alternation, an underspecified input form should be preferred. As underspecification does not make sense in the case of unary features, the modification of LO would not be an option. Also note that unary features result in a much more constrained grammar along the lines of element theory proposed in Government Phonology, CV-phonology and VC-phonology.

5. Conclusion

In this paper we wanted to show how the different assumptions made by generative phonology in general and OT in particular may lead to contradiction in the treatment of very simple allophonic data, like the H-type segments in Hungarian. Also, we showed that there are three possible options for avoiding these problems: either Lexicon Optimization must be modified and complemented by Grammar Optimization, or the use of underlying representations should be given up and grammar should be based on output-output correspondence relations in paradigms, or feature representations should be more restricted, i.e. only unary features should be allowed. Of course, any combination of the above three suggestions is also a viable option.



Bibliography
Beckman, J., Ringen C. 2003. Lexicon Optimization and redundant feature specification. University of Iowa. Paper presented at MCWOP 2003.
Burzio, L. 1998. Multiple correspondence, Lingua, 103: 79-109.
Inkelas, S. 1995. The consequences of optimization for underspecification. In Beckman, J. (ed), Proceedings of the North East Linguistic Society 25, Amherst: GLSA,. 287-302.
Kager, R, 1999. Optimality Theory. Cambridge: CUP.
Kenstowicz, M, 2001. Paradigmatic uniformity and contrast. Paper presented at 3émes Journées Internationales du
GDR Phonologie. Nantes: University of Nantes.
McCarthy, J. J. 2003. Richness of the Base and the determination of underlying representations. (Unpublished manuscript), Amherst: University of Massachusetts.
McCarthy, J. J., Prince, A. 1995. Faithfulness and reduplicative identity, In: Beckman, J., Dickey, L., Urbanczyk, S. (eds.) UMOP 18: Papers in Optimality Theory, Amherst, MA: GLSA, 249-384.
Padgett, J. 2001. Systemic Contrast and the Diachrony of Spanish Sibilant Voicing. ROA-634. [http://roa.rutgers.edu/files/574-0203/574-0203-PADGETT-0-0.PDF]
Prince, A., Smolensky, P.1993. Optimality Theory: Constraint interaction in generative grammar, Technical Report of the Rutgers Center for Cognitive Science, Rutgers University.
Rebrus, P., Törkenczy, M. 2004. Uniformity and contrast in the Hungarian verbal paradigm. In: Downing, L.J., Hall, T.A., Raffelsiefen, R. (eds.) Paradigms in Phonological Theory, Oxford: Oxford University Press, pp. 294-328.
Siptár, P., Szentgyörgyi, S. 2002. H as in Hungarian. Acta Linguistica Hungarica 49:427–56.
Szentgyörgyi, S., Siptár, P. 2004. Hungaria H-type segments in Optimality Theory. In: Siptár, P., Pinon, C. (eds.) Approaches to Hungarian: Vol. 9, Budapest: Akadémiai Kiadó, 261-83.
Szentgyörgyi, S. 2004. How rich is the Lexicon phonologically? Paper presented at the 12th Manchester Phonology Meeting.
Szentgyörgyi, S. 2006. How rich is the Lexicon phonologically? In: Bárdos, J. (ed.) Proceedings of the 2005 HUSSE Conference. Veszprém: Veszprém University Press.