1. Introduction
One of the strongest points of the architecture of Lexical-Functional Grammar (LFG), and one of the ways in which it differs most strikingly from the Chomskyan approach, is in its recognition that there are different types of information associated with linguistic elements, e.g. syntactic information, functional information and information about semantic roles and that each type of information needs to be represented independently at c-structure, f-structure and a-structure, respectively. The constraints on what is a possible information structure may vary between the different components, and there may be "discrepancies" between the levels. The limits of these discrepancies can then be defined in terms of constraints on correspondences, or mapping principles. The most well-defined of these are the mapping principle between f-structure and a-structure (Bresnan and Kanerva 1989, Bresnan and Moshi 1990, Bresnan and Zaenen 1990, Levin 1989). This separation of types of information has led to insightful analyses of "raising" and control phenomena (going back to Bresnan (1982)), and has also shed new light on problems which arose because syntactic arguments and arguments relating to functional structure appeared to point in different directions. Kroegers (1993) discussion of subjects in Tagalog is an excellent example of this.
Another attractive aspect of LFG is that morphology can create f-structure in much the same way that constituency can. A lot of work in this area has been done recently (see for instance Börjars and Chapman (1998), Bresnan (1997b, In press) and Nordlinger (1997a, 1997b). The exact status of morphological information still remains unclear; the architecture of LFG in principle allows separation between c(onstituent)-structure and m(orphology)-structure - explicit proposals to this effect is made in Butt, Niño and Segond (1996) and Frank and Zaenen (1998) - but most accounts appear to assume that morphological information is, in fact, part of c-structure. A discussion workshop held at LFG98 in Brisbane made clear that a consensus view does not exist on this issue within the LFG research community. The difference about what information should be represented in m-structure also varies between Butt, Niño and Segond (1996) and Frank and Zaenen (1998).
The purpose of this paper is to discuss a dataset which to my mind could be interpreted as an indication that there can be discrepancies between the morphological and the syntactic (or constituent) structure of the kind which would motivate a separate morphological level. The dataset involves clitics, and this kind of element is, of course, a prime candidate for a treatment in terms of different morphological and syntactic structure (see Autolexical Syntax accounts of such elements in Sadock (1991) Lapointe (1992) and Börjars (1997a)). Other candidates are so-called superlexemes, like the German or French preposition+determiner combinations vom (von+dem from the.M/N.DAT) and du (de+le of the.M), respectively. Ackema and Neeleman (1998) give further examples of morphology-syntax discrepancies from separable verbs in Germanic.
2. Clitics and how to deal with them
Clitic is a notoriously vague category, and I will make no attempt here at defining it. I will just assume that there are some characteristic properties which a typical clitic has; it is bound and its position is defined in terms of a phrasal rather than lexical category. The distinction between an affix and a clitic then lies mainly in position, but a number of associated characteristics follow from this, e.g. degree of boundedness and degree of morpho-phonological irregularities. Syntactic positioning is then a characteristic of all clitics, some clitics are even more syntactic in that they need to be represented as a separate category in the syntactic representation. This distinction can be illustrated by comparison with the possessive s in English - which positions as a clitic, but to which one would not want to assign a syntactic category - and the clitic auxiliaries in English - which also position in relation to a syntactic unit but which one would assume to belong to a syntactic category (V or I). Clitics can then be said to be elements which appear to participate in both the syntactic and the morphological components and hence share properties with both independent words and affixes.
In spite of this, many theoretical analyses of clitic phenomena play down either the morphological or the syntactic properties and place the element squarely in the other component. An example is Anderson (1992, 1993, 1996), in whose view any element which is bound, be it an affix or a clitic, has no representation in the syntax; clitics (including those that clearly manifest syntactic information, such as pronominals and auxiliary markers of tense and aspect) are analyzed as phonological material introduced into the PF-representations of phrases by rules belonging to the same broad class as those of Word Formation. (Anderson 1996:165) To Anderson, clitics represent information associated with "functional categories", which is represented as a feature at the phrase level. Word Formation rules then place the phonological material associated with the feature in its correct position. These rules may make reference either to phonological units (e.g. word) or syntactic units (e.g. syntactic object). In Anderson (1996), the positional differences between affixes and clitics is accounted for in terms of Optimality Theory. An approach similar to that of Anderson has been formulated within Generalized Phrase Structure Grammar and Head-driven Phrase Structure Grammar by Lapointe (1990) and Miller (1992a).
Work in the Chomskyan tradition on the other hand gives ample examples of how clitics are dealt with entirely in the syntactic component. In such analyses, the distinction between affix, clitic and weak pronoun becomes difficult to make since all these types of words differ from other categories mainly in being syntactically dependent (cf. work in van Riemsdijk (1991), Hellan (1993), Rizzi (1993) and van Riemsdijk and Hellan (1995), and analyses like Haegeman (1993)).
Any formalism with parallel modules, like LFG and Autolexical Syntax should be able in principle to analyse elements as having independent category status in the syntax, but forming part of another word in the morphology. If we can establish that elements exist for which such an analysis is motivated, then we have found another argument in favour of modular architecture for grammars. In the remainder of this paper, I will consider potential candidates for such an analysis.
3. The problem
The Balkan languages (except Greek) and the Scandinavian languages all have a definite ending which may allow the noun to which it is attached to function as a full referential noun phrase (for further data and discussion see Börjars (In press: Ch 3)):
| (1) | a. | får-et | Swedish |
| sheep-DEF | |||
| the sheep | |||
| b. | seyDur-in | Faroese | |
| sheep-DEF | |||
| the sheep | |||
| c. | c&ovek-ot | Macedonian | |
| man-DEF | |||
| the man | |||
| d. | om-ul | Romanian | |
| man-DEF | |||
| the man |
I will now compare these elements - which I shall refer to as DEFs - with respect to a number of properties.
The apparently morphological elements in (1) appear to contribute to the phrase that which it needs to function as a full referential noun phrase, much in the way that English the does. Within LFG, where selection is not defined in terms of category, but in terms of functional information, this does not force an analysis of these elements as a syntactic D (contrary to much work in the Chomskyan tradition, see for instance (Delsing 1989, Delsing 1991a, Delsing 1992, Dobrovie-Sorin 1987, Nilsson 1968, Santelmann 1992, SigurÍsson 1992, Taraldsen 1990)). Instead, we can assume that what allows a nominal to function as a full referential noun phrase in these languages is the definiteness marking, [±DEF]; an analysis along similar lines for English noun phrases has been proposed within Categorial Grammar by Payne (1995b). A value for the feature [±DEF] can then be contributed either through the syntax or by morphological means. We can suggest lexical entries like those in (2).1
| (2) | a. | the | D, | (^DEF) = + | English |
| b. | -ot | CL (D), | (^DEF) = + | Macedonian | |
| c. | -en | AF, | (^DEF) = + | Norwegian |
I follow Grimshaw (1982) here in using the notation CL for the clitic, but obviously this does not solve the issue of how to represent the necessary distinction between affix behaviour and clitic behaviour, and as far as I can tell, this problem is still not resolved in Frank and Zaenen (1998) who use the same way of distinguishing affixes and clitics. I have also added D, since if -ot is associated with a syntactic category it would have to be D. I will return to the issue of syntactic categories for clitics in §3.3.
For all these elements holds that they cannot be written as a separate word, but there are also other signs of boundedness, like morphophonological irregularities and arbitrary gaps in the distribution. This to my mind does not indicate affix (as opposed to clitic) status; the kind of irregularities I refer to are to be expected of any bound element and its host. The more selective an element is with respect to its host - the more affix-like an element is - the more likely we are to get irregularities. Hence any account which assumes that clitics are dealt with entirely by syntactic rules is inappropriate.
Morphophonological irregularities are more common in Mainland Scandinavian than they are in Insular Scandinavian and the Balkan languages, but can be found in all languages. Examples are provided in (3) to (5).
| (3) | a. | hestur+inn | Æ | hesturinn | Icelandic |
| horse(M).DEF | |||||
| b. | lifur+inn | Æ | lifrin | ||
| liver(F).DEF | |||||
| c. | hreiDur+iD | Æ | hreiDriD | ||
| nest(NT).DEF |
| (4) | a. | staDur | staDur-in | staD-num | Faroese |
| place(M) | place-DEF.NOM | place-DEF.DAT | |||
| b. | seyDur | seyDur-in | seyD-inum | ||
| sheep(M) | sheep-DEF.NOM | sheep-DEF.DAT | |||
| (5) | a. | teatar+ot | Æ | teatrot | Macedonian |
| theatre.DEF | |||||
| realisam+ot | Æ | realismot | |||
| reality.DEF | |||||
| turisam+ot | Æ | turisamot | |||
| tourism.DEF | |||||
| b. | familii+i | Æ | familiei | Romanian | |
| family(F).GEN/DAT.DEF | |||||
| c. | copiii | ||||
| children.DEF |
The kind of arbitrary gaps which might be expected if the DEF element is a morphological element are clearly there in Mainland Scandinavian as the examples in (6) show. Fakta appears not to be able to combine with the definite ending, even though as (6b) shows, there is no objection to using fakta within a definite noun phrase where no definite ending is required.
| (6) | a. | *Fakta-na | pekar | inte | åt | det hållet. | Swedish | |
| facts-DEF | point | not | at | that direction | ||||
| b. | Dessa | fakta | pekar | inte | åt | det hållet. | ||
| these | facts | point | not | at | that direction | |||
| The/These facts do not point in that direction. | ||||||||
All the DEF elements appear to be unable to take scope over a co-ordinated nominal. Examples from Macedonian and Swedish are provided in (7) and (8).
| (7) | a. | *[maz&i | i | z&eni] -te | Macedonian | |
| husbands | and | wives -DEF | ||||
| b. | *[maz&i-te | i | z&eni] | |||
| husbands-DEF | and | wives | ||||
| c. | maz&i-te | i | z&eni-te | |||
| husbands-DEF | and | wives-DEF | ||||
| the husbands and wives | ||||||
| (8) | a. | [hink | och | spade] | -n | Swedish |
| bucket | and | spade | DEF | |||
| b. | [mamma | och | pappa]-en | |||
| mother | and | father | DEF | |||
I am unsure what conclusion can be drawn about the morphological status of DEF on the basis of these facts. It is difficult to make a definite article take scope over a co-ordinated nominal even when it is clearly syntactic, as in English. A sentence like I dropped the children off at the school and nursery seem to imply that the school and the nursery are the same place. However, a definite article in English can at least determine co-ordinated nominals which form close combinations or collocations, like the fish and chips and the mother and father. As the examples in (7) and (8) show, this is not possible with DEF in either Macedonian or Swedish.
When it comes to syntactic status, there are at least two aspects to consider; firstly whether or not the elements position is defined in terms of a phrase, and secondly, whether there are arguments for assuming that the element should be assigned to an independent category in the syntactic component. If we assume that full argument status is not defined in terms of syntactic structure, as discussed in §3.1, examples like those in (1) do not in themselves form evidence in favour of DEFs status as determiner (D) in the syntax.
Consider placement first; in the Balkan languages the DEF elements appear to be second position elements as shown in (9) for Macedonian (see Lunt (1952), Elson (1976) and Scatton (1980) for Bulgarian and Macedonian; Dobrovie-Sorin (1987), Grosu (1988) and Renzi (1989) for Romanian; Morgan (1984) for Albanian, Halpern and Zwicky (1996) for theoretical articles on the issue, for a survey of the exceptions to this generalisation, see Börjars (In press: §3.6)).
| (9) | a. | c&ovek-ot | Macedonian | ||
| man-DEF | |||||
| the man | |||||
| b. | dobr-iot | c&ovek | |||
| good-DEF | man | ||||
| the good man | |||||
| c. | dobr-iot | mal | c&ovek | ||
| good-DEF | little | man | |||
| the good little man | |||||
| d. | *dober | c&ovek-ot | |||
| good | man-DEF | ||||
| e. | *dober | mal-iot | c&ovek | ||
| good | little-DEF | man | |||
In all the Scandinavian languages, DEF will always occur on the head noun, and hence it shows affix-like behaviour in that respect. Even though the positioning of DEF in (9) can be defined as second position, it can also be stated in terms of the initial element requiring marking for definiteness. The exact position then follows from the fact that DEF is enclitic (or suffixal) rather than proclitic (or prefixal). This view has lead to the Edge Feature accounts, and weakens the use of data like (9) as an argument for the syntactic status of DEF. However, it should be pointed out that even within an Edge Feature account, the "edge affix" needs to have syntactic knowledge in the sense that it needs to know about the nature of the phrase on whose edge it is to attach.
I will now consider arguments in favour of DEF requiring an analysis in which it actually fills the position of D in the syntactic structure. The first argument is based on co-occurrence; under the assumption that a noun phrase has at most one D position, if DEF co-occurs with another determiner, then we can assume that it does not fill a D position in the syntax. This is the case in Norwegian and Albanian, as (10) and (11) show.
| (10) | a. | den | gamle | mannen | Norwegian |
| this | old | man.DEF | |||
| b. | den | mannen | |||
| that | man.DEF | ||||
| c. | denne | mannen | |||
| this | man.DEF | ||||
| (11) | a. | ky | djalë / | djal-i | Albanian |
| this | boy | boy-DEF | |||
| b. | ai | djalë / | djal-i | ||
| that | boy | boy-DEF |
So, even though Norwegian and Albanian behave differently with respect to the positional criterion, their two DEFs follow the same pattern here.
Danish and Macedonian - which again differ diametrically with respect to the positioning of DEF - do not allow co-occurrence at all, as in (12) and (13).
| (12) | a. | toj | c&ovek / | *c&ovek-ot | Macedonian | |
| that | man | man-DEF | ||||
| b. | ovoj | c&ovek / | *c&ovek-ot | |||
| this | man | man-DEF | ||||
| (13) | a. | den | unge | mand / | *mand-en | Danish |
| the/that | young | man | man-DEF | |||
| b. | den | mand / | *mand-en | that | man | man-DEF |
| c. | denne | mand / | *mand-en | |||
| this | man | man-DEF |
Under the assumption that in order for a nominal to function as a full referential noun phrase it needs to contain a D projection, the data in (12) and (13) would lead to the conclusion that DEF must be a syntactic D itself. An alternative, in an account which accepts zero elements, would be to assume that DEF "licenses" a zero determiner in D. This can be restated within LFG as the inflection allowing N to be mapped into D lexically. Under either of the two latter analyses, the data in (12) and (13) do not force an analysis of DEF itself as a D in the syntax. Furthermore, under our functional interpretation of what makes a nominal a full referential noun phrase, we could just assume that the syntactic category of the phrases in (12) and (13) is DP and that the ones lacking a syntactic determiner but containing DEF are NPs, but that the two are functionally equivalent (in being specified for [±DEF]) and hence both able to be selected, say, by a verb requiring an object. The conclusion is then that, depending on your assumptions, the data in (12) and (13) need not force any conclusions about DEFs syntactic status.
I turn now to the second possible argument for syntactic status for DEF. There appears to be a constraint in Germanic languages, that when there are pre-modifiers, there must be a syntactic determiner. This constraint shows up most clearly in the Scandinavian languages, becausethey permit a definite noun to function as a full noun phrase without a syntactic determiner, and previous analyses have generally worked on the assumption that it holds only for definite noun phrases (Delsing 1992, Holmberg 1987, Svenonius 1992a, 1992b). Consider the data in (14).
| (14) | a. | *sömniga | katt-en | Swedish | ||
| sleepy.DEF | cat-DEF | |||||
| b. | den | sömniga | katt-en | |||
| the | sleepy.DEF | cat-DEF | ||||
| the sleepy cat | ||||||
The prenominal adjective forces the presence of a syntactic determiner, in Swedish and Norwegian DEF is maintained in these constructions, but in Danish the presence of a syntactic determiner excludes the presence of DEF.
This constraint appears to hold also for indefinite noun phrases. There are a few environments in which indefinite singular count nouns can occur without a syntactic determiner, particularly in predicative position, as in (15a). As (15b) shows, if a prenominal adjective is added a syntactic determiner is required, as in (15c).
| (15) | a. | August | är | författare. | Swedish | |
| August | is | author | ||||
| August is an author. | ||||||
| b. | *August | är | intressant | författare. | ||
| August | is | interesting.INDEF.C | author | |||
| c. | August | är | en | intressant | författare. | |
| August | is | a.C | interesting.INDEF.C | author(C) | ||
| August is an interesting author. | ||||||
The constraint turns out not to be limited to the Scandinavian languages. The data in (16) demonstrates that the same generalisation can be made for Dutch.
| (16) | a. | Oscar | is | leraar. | Dutch | |
| Oscar | is | teacher | ||||
| Oscar is a teacher. | ||||||
| b. | *Oscar | is | goede | leraar. | ||
| Oscar | is | good | teacher | |||
| c. | Oscar | is | een | goede | leraar. | |
| Oscar | is | a | good | teacher | ||
| Oscar is a good teacher. | ||||||
This appears then to be a very general constraint at work in the Germanic languages. The question is how to state it. In Börjars and Donohue (1998) we argue that this is due to a requirement for a headed D-projection whenever a prenominal adjective is present. We provide an analysis in terms of Optimality Theory in which the presence of the syntactic determiner is ensured by a high ranking of OB-HD, requiring an obligatory lexical filler for the head. If we assume that this criterion can indeed be stated in terms of an obligatory D-projection, then some of the Scandinavian DEFs are indeed D-fillers, and in this sense, syntactic elements. Consider the Icelandic data in (17), similar data may be found in Faroese, though the status of such noun phrases is somewhat unclear (Barnes 1994:207).
| (17) | a. | mikli | maDur-inn | Icelandic |
| great | man-DEF | |||
| the great man | ||||
| b. | gamli | bátur-in | Faroese | |
| old | boat-DEF | |||
| the old boat | ||||
The fact that DEF in Icelandic does permit the presence of an adjective can, under these assumptions, be seen as evidence that it should have the status of D in the syntactic representation. Analyses to this effect have been proposed (Börjars 1997a, Sadock 1991:113-5).
At first DEF in the Scandinavian languages appeared to be similar in that they have all the characteristic properties of an affix, and hence should be relatively unproblematic. Now we have two parameters along which they vary; firstly some of the Scandinavian languages allow (Faroese) or require (Norwegian) co-occurrence of DEF and a syntactic determiner and some forbid it (Danish); and secondly, the Mainland Scandinavian languages do not permit DEF to be the sole determiner when a prenominal adjective is present, whereas the Insular Scandinavian languages do. These two distinctions then need to be captured in any representation. The latter of the two, I have interpreted here as the Insular Scandinavian DEF having some degree of independent status in the syntax, whereas for the Mainland Scandinavian languages, there is no such argument. There is one further difference between the two sub-groups of Scandinavian languages which strengthens this view. Consider the paradigms from Icelandic given in (18), where hest horse is a masculine noun of what is usually referred to as one of the strong declinations.
| (18) | a. | indefinite forms of hest horse | Icelandic | ||
| singular | plural | ||||
| nominative | hest-ur | hest-ar | |||
| accusative | hest | hest-a | |||
| dative | hest-i | hest-um | |||
| genitive | hest-s | hest-a | |||
| b. | definite forms of hest horse | |||
| singular | plural | |||
| nominative | hest-ur-inn | hest-ar-nir | ||
| accusative | hest-inn | hest-a-na | ||
| dative | hest-i-num | hest-um-num | ||
| genitive | hest-s-ins | hest-a-nna | ||
The pattern which clearly emerges here is that a morpheme indicating
case and number precedes DEF, but at the same time DEF itself
- whose basic form we can assume to be (i)n - appears
to be inflected for the same features. Compare this with the element
which we can refer to as the demonstrative (translated by Kress
(1982:107) into German as jener that or der andere
the other), given in (19).
| (19) | paradigm for hinn in masculine | Icelandic | |||
| singular | plural | ||||
| nominative | hinn | hinir | |||
| accusative | hinn | hina | |||
| dative | hinum | hinum | |||
| genitive | hins | hinna | |||
Similar comparisons can be made for other genders and declinations, and also for Faroese (see Kress (1982:56-84) for Icelandic and Lockwood (1977:28-46) for Faroese), in many cases the kind of irregularities typical of inflectional affixes occur. The most plausible analysis of this is to assume that DEF, which developed from a syntactic determiner by grammaticalisation, has maintained its inflectional pattern, but that the noun itself also subject to regular inflectional rules before combining with DEF. This seems a far more economical solution, given the regularity, than assuming that, say DEF.M.SG.NOM is instantiated as -urinn whereas INDEF.M.SG.NOM is instantiated as -ur. Such an analysis would make the similarities appear accidental. This then also points towards DEF having a more syntactic status in Insular Scandinavian than it does in Mainland Scandinavian. This behaviour of grammaticalised clitic elements is not uncommon, and if the clitic is not itself inflected, the inflection will often externalise, leading to what can be referred to as reordering of affixes (see Haspelmath (1993) for an account of this phenomenon). In this case, the grammaticalised element was already inflected, and whereas one might have expected the internal inflection to disappear, parallel to the development from the intermeiate stage in the externalising examples provided by Haspelmath (1993), the Insular Scandinavian system appears to be stable, a further sign that DEF is not an affix in these languages.
4. Conclusions
In this paper I have used data from a small set of languages which
share one property - they all have a DEF element - to show that
a comparative analysis of the data brings out a number of distinctions
which have to be mirrored in any analysis. The properties can
be summed up somewhat simplified as in Table 1.
| Macedonian | |||||
| Albanian | |||||
| Norwegian | |||||
| Danish | |||||
| Icelandic |
Table 1: A comparison of the properties of DEF
As Table 1 shows, the different properties are not related in the sense that the fact that two elements share one property makes no prediction as to their behaviour with respect to another property. Hence we need to be able to represent variation in many dimension. One way of achieving this would be to explore the possibility of morphological information representing a separate level of information, thus adding m-structure to c-structure, f-structure and a-structure in the LFG architecture, as proposed by Butt, Niño and Segond (1996) and Frank and Zaenen (1998). In such an architecture, morphological arguments can be separated out from purely syntactic ones, in much the same way that arguments relating to c-structure are distinguished from those relating to the f-structure. In such a system, a feature could belong uniquely to m-structure or uniquely to f-structure, or to both. One could also imagine that constituency could vary between m-structure and c-structure. Even though I do not provide any formalisation here of m-structure representation, it is clear that such a separate level would give more scope to do justice to the subtle distinctions which exist.
There are, of course, alternative accounts, at least for aspects of the data presented here. The requirement that a syntactic determiner be present with prenominal adjectives, for instance, has been given a number of different analyses and some of these could possibly be extended to the Icelandic data (for an overview, see Börjars and Donohue (1998)). However, all previous accounts of which we are aware rely to some extent on the assumption that it is a constraint that applies only to definite noun phrases in the Mainland Scandinavian languages. As I have shown, this is not the case and hence accounts which rely specifically on definiteness are not acceptable.
An alternative could be to assume that the positional constraint on definiteness which clearly holds for the Balkan languages can be extended to the Scandinavian ones. Under such an analysis, DEF would be analysed as a morphological element, and lexical integrity would require it to form part of the word to which it is attached in the syntax. We could then assume that there was an ordering constraint for the Scandinavian languages requiring the first constituent to represent definiteness. If the noun phrase consists only of a noun or a noun with postmodification, then this constraint is satisfied by the noun. When premodifying adjectives occur, a syntactic determiner must be introduced to satisfy the constraint. Some independent constraint would then decide whether or not DEF was maintained in such constructions. This would neatly account for why a syntactic determiner is not normally required when there is only post-modification of the noun. It would, however, have to assume that the marking of the adjective which differs between definite and indefinite noun phrases is in fact not definiteness marking. Otherwise, we would predict that (14a) and (15b) would be grammatical. Under this analysis, we would have to ignore predeterminers for the purposes of position; they would be "extrametrical", and we would also have to account for why the definite noun cannot simply precede the adjective in Scandinavian, as it can in the Balkan languages.2 Furthermore, the difference in behaviour between the Mainland Scandinavian languages and the Insular ones, as illustrated in (18) and (19) is still unaccounted for. It is quite possible that solutions to all these issues can be found and the necessary distinctions argued for, but it is by no means obvious.
Sadler (1997) analyses another data set whose behaviour causes tension between the urge to maintain lexical integrity and the need to account for the positioning of the elements. She considers the distribution of a set of clitic pronouns in Welsh; these elements share a number of properties with the DEF elements that I have considered here, particularly in that they occur in a non-canonical position within their phrase - this is also the case for DEF in the Scandinavian languages compared to the syntactic determiners -, the fact that they show signs of not being syntactically independent elements, for instance by participating in morphological blocking (Börjars and Donohue 1998), and finally, but probably less importantly, they cannot have scope over co-ordinated structures. The solution provided in Sadler (1997) relies on the use of a structural type - introduced also in Sadler and Arnold (1994) - which is syntactically transparent, but which shares a number of features with morphological structures; small constructions. The result for a Welsh noun phrase which contains a pronoun, one adjective from a small set of possible prenominal ones and a noun is found in (20) (Sadler 1997:18 longer version).
(20)
In principle, this type of solution might be used for some subset of the distinctions required by the data discussed here. In particular, one might think of representing the clitic-like elements in this way, and the affix-like ones in a more conventional structure. One property that holds for the elements with which Sadler deals is that they all attach to the head of their phrase, and in this sense they are untypical clitics. The Welsh pronominal element can attach to different lexical categories, but only because it is associated with different phrasal categories; when it occurs in an NP it attaches to the head N, when it occurs within a VP it attaches to the V and within an IP it attaches to I. In order to capture the distinction between the Balkan DEF and the Scandinavian one, on the other hand, some parameter of placement would still need to be introduced, possibly in the form of an edge feature, a theoretical notion of which Sadler is critical (1997). However, if this could be done within the assumptions of this solution, then the distinction between the Balkan and the Scandinavian languages could be captured in terms of the placement parameter, and the distinction between the Insular Scandinavian languages and the Mainland Scandinavian languages could be stated in terms of whether the DEF element attaches in a small construction or as a conventional affix. There is, however, an important theoretical issue to be dealt with before such a solution is explored in more detail, and that is the exact status of the lexical elements involved in small constructions; is (20) a syntactic or a morphological representation? Sadler describes these small constructions as a structural type which is syntactically transparent, but which shares a number of features with morphological structures, but this description still leaves questions unanswered. The head category used is N, normally a lexical category to which morphological processes may apply. This is, however, not assumed to be the case with the two higher Ns in (20) (Sadler, pc). Still, the lowest N must be assumed to be an ordinary noun which could be the input to, say, an inflectional rule, or a compounding rule. Such issues as these seem to me to be of crucial importance for an evaluation of the solution to be possible.
There are then at least two promising paths within current LFG analyses, along which a detailed analysis of this dataset could be developed. My point with this preliminary paper has been to show that datasets can be found which will force us to develop in more detail the use of morphological information within LFG, and in particular to evaluate the introduction of a separate m-structure
Notes:
* I am grateful to Joan Bresnan, Louisa Sadler, Peter Sells and Nigel Vincent for helpful discussions on the topics touched upon in this paper. I have not had the time to follow up suggestions for possible solutions before the deadline for this paper, but this certainly does not mean that I do not find them worth pursuing. I am also grateful to Martin French and Mirjana Koc&oska for their help with the Balkan data. Back to text
1 I use a caret (^) instead of an up-arrow in the html version of this document. Back to text
2 Peter Sells has suggested that this may be due to the fact that if the noun was found in the D position (in the same way that an inflected verb can be mapped lexically to the I node), then there would be no lexical filler of the N position, and assuming that we do not allow the adjective to adjoin to a headless projection, the ordering would be ruled out. This solution seems well worth pursuing. Back to text
References:
Ackema, P. and Neeleman, A. 1998. Competition between syntax and morphology. Ms. University of Groningen/University College London.
Anderson, S.R. 1992. A-morphous morphology. Cambridge: Cambridge University Press.
Anderson, S.R. 1993. Wackernagels revenge: clitics, morphology, and the syntax of second position. Language 69:68-98.
Anderson, S.R. 1996. How to put your clitics in their place, or why the best account of second-position phenomena may be something like the optimal one. The Linguistic Review 13:165-191.
Barnes, M.P. 1994. Faroese. In E. König and J. van der Auwera (eds), The Germanic languages. London: Routledge. 190-218.
Börjars, K.E. 1997a. One (more) reason why we need morphology. In W.U. Dressler, M. Prinzhorn and J.R. Rennison (eds), Advances in morphology. Berlin: Mouton de Gruyter. 111-130.
Börjars, K.E. In press. Feature distribution in Swedish noun phrases. [Publications of the Philological Society] Oxford: Blackwell.
Börjars, K.E. and Chapman, C. 1998. Agreement and pro-drop in some dialects of English. Linguistics 36:71-98.
Börjars, K.E. and Donohue, M. 1998. Much ado about nothing - features and zeroes in Germanic noun phrases. Ms. University of Manchester (Available from http://lings.ln.man.ac.uk/html/keb/papers.html).
Bresnan, J. (ed) 1982. The mental representation of grammatical relations. Cambridge, Ma: MIT Press.
Bresnan, J. 1997b. Morphology competes with syntax: explaining typological variation in weak crossover effects. In P. Barbosa, D. Fox, P. Hagstrom, M. McGinnis and D. Pesetsky (eds), Is the best good enough? Proceedings from the workshop on optimality in syntax. Cambridge, Mass: MIT Press.
Bresnan, J. In press. Explaining morphosyntactic competition. In M. Baltin and C. Collin (eds), Handbook of contemporary syntactic theory. Oxford: Blackwell.
Bresnan, J. and Kanerva, J.M. 1989. Locative inversion in Chichewa: a case study of factorization in Grammar. Linguistic Inquiry 20:1-50.
Bresnan, J. and Mchombo, S.A. 1987. Topic, pronoun and agreement in Chichew_a. Language 63:741-782.
Bresnan, J. and Moshi, L. 1990. Object assymetries in comparative Bantu syntax. Linguistic Inquiry 21:147-186.
Bresnan, J. and Zaenen, A. 1990. Deep unaccusativity in Lexical-Functional Grammar. In K. Dziwirek, P. Farrell and E. Mejías-Bikandi (eds), Grammatical relations: a cross-theoretical perspective. Stanford, Ca: CSLI. 45-57.
Butt, M., Niño, M.-E. and Segond, F. 1996. Multilingual processing of auxiliaries in LFG. Paper presented at Natural language processing and speech technology: results of the 3d KONVENS conference, Bielefeld.
Delsing, L.-O. 1989. A DP analysis of the Scandinavian noun phrase. Paper presented at The NP Colloquium, Manchester University.
Delsing, L.-O. 1991a. Quantification in the Swedish noun phrase.
Delsing, L.-O. 1992. On attributive adjectives in Scandinavian and other languages. In A. Holmberg (ed), Papers from the workshop on the Scandinavian noun phrase. [DGL-UUM-Report 32] Department of General Linguistics, University of Umeå. 20-44.
Dobrovie-Sorin, C. 1987. A propos de la structure du groupe nominal en roumain. Rivista di Grammatica Generativa 12:123-152.
Elson, M.J. 1976. The definite article in Bulgarian and Macedonian. Slavic and East European Journal 20:273-279.
Frank, A. and Zaenen, A. 1998. Tense in LFG: syntax and morphology. Ms. Xerox Research Centre Europe.
Grimshaw, J. 1982. On the lexical representation of Romance reflexive clitics. In J. Bresnan (ed), The mental representation of grammatical relations. Cambridge, Mass: MIT Press. 87-148.
Grosu, A. 1988. On the distribution of genitive phrases in Rumanian. Linguistics 26:931-949.
Haegeman, L. 1993. The morphology and distribution of object clitics in West Flemish. Studia Linguistica 47:57-94.
Halpern, A.L. and Zwicky, A.M. (eds) 1996. Approaching second. Second position clitics and related phenomena. Stanford, Ca: CSLI Publications.
Haspelmath, M. 1993. The diachronic externalization of inflection. Linguistics 31:279-309.
Hellan, L. (ed) 1993. Eurotyp Working Papers VIII.4: Clitics in Germanic and Slavic. Katholieke Universiteit Brabant, The Netherlands.
Holmberg, A. 1987. The structure of NP in Swedish.
Kress, B. 1982. Isländisches Grammatik. München: Max Hueber Verlag.
Kroeger, P. 1993. Phrase structure and grammatical relations in Tagalog. Stanford, Ca: CSLI Publications.
Lapointe, S.G. 1990. EDGE features in GPSG. In M. Ziolkowski, M. Noske and K. Deaton (eds), Papers from the 26th Regional Meeting of the Chicago Linguistic Society. Chicago: Chicago Linguistic Society. 221-235.
Lapointe, S.G. 1992. Life on the edge: argument in favor of an Autolexical account of edge inflections. In C.P. Canakis, G.P. Chan and J. Marshall Denton (eds), Papers from the 28th Regional Meeting of the Chicago Linguistic Society. Chicago: Chicago Linguistic Society. 318-333.
Levin, L.S. 1989. Operations on lexical forms: unaccusative rules in Germanic languages. New York: Garland.
Lockwood, W.B. 1977. An introduction to Modern Faroese. Tórshavn: Føroya Skúlabókagrunnur.
Lunt, H.G. 1952. A grammar of the Macedonian literary language. Skopje.
Miller, P.H. 1992a. Clitics and constituents in phrase structure grammar. New York: Garland.
Morgan, J. 1984. Some problems of agreement in English and Albanian. In C. Brugman and M. Macaulay (eds), Proceedings of the Tenth Annual Meeting of the Berkeley Linguistics Society. [BLS10] Berkeley: Berkeley Linguistics Society. 233-247.
Nilsson, K.G. 1968. Noun and article in Swedish. Studia Linguistica 22:51-63.
Nordlinger, R. 1997a. Constructive case: dependent-marking nonconfigurationality in Australia. Dissertation, Stanford University.
Nordlinger, R. 1997b. Morphology building syntax: constructive case in Australian languages. In T.H. King and M. Butt (eds), Proceedings of LFG 1997. Stanford, Ca: CSLI On-line.
Payne, J.R. 1995b. English noun phrases in Categorial Grammar. Ms. University of Manchester.
Renzi, L. 1989. The Rumanian article as a Balkanism. In U. Klenk, K.-H. Körner and W. Thümmel (eds), Variatio Linguarum. Beiträge zu Sprachvergleich und Sprachentwicklung. [Festschrift zum 60. Geburtstag van Gustav Ineichen] 217-225.
Rizzi, L. (ed) 1993. Eurotyp Working Papers VIII.3: Clitics in Romance and Germanic. Katholieke Universiteit Brabant, The Netherlands.
Sadler, L. 1997. Clitics and the structure-function mapping. Paper presented at LFG 97 Conference, University of California, San Diego.
Sadler, L. and Arnold, D.J. 1994. Prenominal adjectives and the phrasal/lexical distinction. Journal of Linguistics 30:187-226.
Sadock, J.M. 1991. Autolexical syntax. A theory of parallel grammatical representations. [Studies in Contemporary Linguistics] Chicago: The University of Chicago Press.
Santelmann, L. 1992. Den-support: an analysis of double determiners in Swedish. In A. Holmberg (ed), Papers from the workshop on the Scandinavian noun phrase. [DGL-UUM-Report 32] Department of General Linguistics, University of Umeå. 100-118.
Scatton, E.A. 1980. On the shape of the Bulgarian definite article. In C. Chvany and R. Brecht (eds), Morphosyntax in Slavic. Columbus, Oh: Slavica. 204-211.
SigurÍsson, H.Á. 1992. Aspects of the DP analysis of the Icelandic NP. In A. Holmberg (ed), Papers from the Workshop on the Scandinavian Noun Phrase. Umeå: Department of General Linguistics, University of Umeå. 119-144.
Svenonius, P. 1992a. The structure of the Norwegian noun phrase. Paper presented at The Seventh Workshop on Comparative Germanic Syntax, Stuttgart.
Svenonius, P. 1992b. The extended projection of N: identifying the head of the noun phrase.
Taraldsen, K.T. 1990. D-projections and N-projections in Norwegian. In J. Mascaró and M. Nespor (eds), Grammar in progress, GLOW essays for Henk van Riemsdijk. Dordrecht: Foris. 419-431.
van Riemsdijk, H. (ed) 1991. Eurotyp Working Papers VIII.2: Clitics and their hosts. Katholieke Universiteit Brabant, The Netherlands.
van Riemsdijk, H. and Hellan, L. (eds) 1995. Eurotyp Working
Papers VIII.6: Clitics: their origin, status and position.
Katholieke Universiteit Brabant, The Netherlands.