A Companion to Chomsky. Группа авторов
on this approach, the space of linguistic variation is part of UG.
An essential part of the framework was clustering, that is, parameters should represent clusters of properties. Chomsky (1981, p. 6) put it as follows: “[I]deally we hope to find that complexes of properties […] are reducible to a single parameter, fixed in one or another way.” A prominent example of this reasoning is provided by the Null Subject Parameter (Rizzi 1982; see also D'Alessandro 2015 for a review), which governs the realization of silent subjects, like in Spanish and many other languages. An example is provided in (11), where the subject does not have to be pronounced. This contrasts markedly with the same sentence in English, where the subject cannot be left out.
1 (11)(Voi) state leggendo un libro.(Italian)you.PL are reading a book“You are reading a book.”(D'Alessandro 2015)
Rizzi (1982) argued that being a null subject language is correlated with several other properties: i) the subject can be moved out of a finite embedded clause that is headed by an overt complementizer, (12), ii) they allow for subject inversion (Kayne 1980), (13), iv) these are languages with so‐called rich agreement on the verb (Taraldsen 1980; Alexiadou and Anangostopoulou 1998), in which both referential and nonreferential (so‐called expletive) null subjects are licensed, unlike in non‐null subject languages, (14).
1 (12)¿Quiéni dijiste que ti salió temprano?(Spanish)who say‐PRET.2.SG THAT leave‐PRET.3.SG early‘Who did you say that left early?”(Perlmutter 1971, p. 103)
2 (13)a.È arrivato Gianni.(Italian)b.*Est arrivé Jean.(French)c.*Has arrived John.(English) (Roberts 2007, p. 28)
3 (14)a.It rains frequently in April.b.*Rains frequently in April.c.*Ello/Lo llueve a menudo en abril.(Spanish)it.STRONG/it.CLITIC rain.3SG frequently in Aprild.Llueve a menudo en abril.‘It rains frequently in April.’(Judy and Rothman 2010, pp. 200–201)
This clustering highlights another virtue of the clustering idea, namely that properties that are hard to observe for the child are linked to properties that are easy to observe. It is easy for a child to determine whether the language in question is a null subject language or not, but it is not easy to observe the relevant data that will tell her that extraction out of a finite embedded clause with an overt complementizer is licit or not, as in (12). Such data are not very frequent, and this may create problems when it comes to extracting the right generalization from them when they are observed. If the clustering claim is true, the model really would have solved Plato's problem in abstraction: Learners do not have to compare whole grammars; they need to learn one parameter after the other. This would be a small number of choices, each of which would not involve many options. However, this only holds in abstraction because the model was never supplemented with an actual theory of how the child would learn the relevant parameter settings from the input.
We said “if true” because a lot of work since has cast a lot of doubt on the particular clustering that Rizzi (1982) argued for (e.g. Haspelmath 2008, Jaeggli and Hyams 1988, Newmeyer 2005, Rothman and Iverson 2007; Rothman 2009a, Sheehan, Chapter 11 of this volume). Baker (2008, p. 352) claims that “[h]istory has not been kind to the Pro‐drop Parameter as originally stated.” Since Chomsky (1981), the mainstream view on parameters has changed quite a bit, as Sheehan (Chapter 11) illustrates and which we will return to briefly below.
The first part of Principles and Parameters (P&P) was immensely productive, in particular from a cross‐linguistic point of view. Hornstein (2013, p. 399) puts this well in the following quote:
It is important to appreciate how fecund and productive this period of research [up to the 1990s] was. Arguably, we learned more new facts about more typologically diverse languages than ever in the history of the study of language. We learned a tremendous amount about how languages and grammars operate, what they have in common and how they differ.
This is an important result that is often underappreciated by Chomsky's critics.
The Principles and Parameters theory came to have two different instantiations or models (cf. Freidin and Vergnaud 2001, Hornstein, Nunes and Grohmann 2005, Lasnik and Uriagereka 2005, Boeckx and Uriagereka 2006, Lasnik and Lohndal 2010, 2010, Freidin 2012). The first one was called Government and Binding, and this was the model that was developed and used throughout the 1980s. In the beginning of the 1990s, Chomsky developed a new model, called the Minimalist Program. We will discuss the essentials of Government and Binding in the next section, and then in section 3.5, explain the transition to the Minimalist Program.
3.4 Government and Binding
The main idea behind the model of grammar labeled Government and Binding (GB) was developed by Chomsky in several publications (1981, 1982, 1986). According to Chomsky (1981, p. 5ff.), the theory of grammar consists of a rule system and its subcomponents, depicted in (15), and a system of principles and its subcomponents, shown in (16). The overall architecture is provided in (17).
1 (15) Rule system1.lexicon2.syntaxa.categorial componentb.transformational component3.Phonetic Form4.Logical Form
1 (16) Principle systembounding theorytheta‐theorybinding theorygoverment theorycase theorycontrol theory
1 (17) Y‐model
In constructing a sentence (what is typically called the derivation), the grammar machine starts out with lexical and functional items from the lexicon. The lexicon contains all idiosyncratic information about lexical items. In turn, these items feed the syntax by way of assembling a D‐structure where argument structure properties are assigned, i.e. a transitive verb will be in a configuration with an object and a subject, and an intransitive verb with just a subject. Then transformations (called Move α, where α stands for any constituent) map this structure into an S‐structure (e.g. in the case of wh‐movement as in What did Jane read?, which starts out with the structure Jane read what), which fairly closely resembles the surface order of words. At this point, the resulting structure is transferred to the two interpretive interfaces: Phonetic Form (sound) and Logical Form (meaning). Note also that the principal subsystems in effect apply wellformedness constraints on the other levels of representation. Chomsky (1981, pp. 5–6) summarizes the organization of the grammar as follows:
The lexicon specifies the abstract morpho‐phonological structure of each lexical item and its syntactic features, including its categorial features and its contextual features. The rules of the categorial component meet some variety of X‐bar theory. Systems 1.1 and 1.2a constitute the base. Base rules generate D‐structures (deep structures) through insertion of lexical items into structures generated by 1.2a, in accordance with their feature structure. These are mapped to S‐structure by the rule Move‐alpha, leaving traces coindexed with their antecedents; this rule constitutes the transformational component 1.2b, and may also appear in the PF‐ and LF‐components. Thus the syntax generates S‐structures which are assigned PF‐ and LF‐representations by components 3 and 4 of 1. Bounding theory poses locality conditions on certain processes and related items. The central notion of government theory is the relation between the head of a construction and categories dependent on it. Theta‐Theory is concerned with the assignment of thematic roles such as agent‐of‐action, etc…. Binding theory is concerned with relations of anaphors, pronouns, names, and variables to possible antecedents. Case theory deals with assignment of abstract Case and its morphological realization. Control theory determines the potential for reference of the abstract pronominal element PRO.
Much more can be said about each of these components, and Haegeman (1994) provides an excellent and detailed exposition. Many other textbooks also exist, so in the interest of space, we will not provide a more detailed outline of the GB architecture. Some necessary details will be covered in the discussion of the Minimalist Program in the next section, but rather than go through the entire system