Faculty of Language: The future of linguistics; two views

Saturday, February 7, 2015

The future of linguistics; two views

Addendum: I would like to apologize for systematically mis-spelling Peter Hagoort's name. Before I revised the post below, I called him 'Hargoort.' This was not due to malice (someone called 'Norbert' is not really into the name mocking business) but because I cannot spell. Sorry.

Peter Hagoort and I are both worrying about the future of linguistics (see here). [1] We both lament the fact that linguistics once played “a central role in cognitive science” but that “studies in language relevant topics are no longer strongly influences by the developments in linguistics.” This is unfortunate, according to Hagoort, because “linguists could help cognitive (neuro)scientists to be more advanced in their thinking about the representational structures in the human mind.” He is right, of course, but why he thinks this is very unclear given what he actually proposes, as I show below.

I too lament these sad happenings. Moreover, I am saddened because I have seen that consistent collaboration between GGers and psychologists, computationalists and neuroscientists is not only possible but also very fruitful. Nor is it that hard, really. I know this because I live in a department that does this every day and all the time. Of course doing good work is always difficult, but doing good work that combines a decent knowledge of what linguists have discovered with techniques and insights from other near-by disciplines (psych, CS, neuroscience) is quite doable. It is even fun.

Unfortunately, Hargoort’s piece makes clear, he has no idea that this is the case. He appears to believe that linguists have very little to offer him. I suspect that this might indeed be true for the kinds of questions he seems interested in. But I would conclude from this that he is missing out some really interesting questions. Or, to be more charitable: what is sad about cognitive neuroscience of the Hagoort variety is that it has stopped investigating the kinds of questions that knowing something about linguistics would help answer. Why is this? Hagoort’s diagnosis of linguistics’ fall from cogneuro grace offers an explanation. He identifies three main problems with linguistics of the generative variety. I will review them and comment seriatim.

First, Hagoort believes that the sorts of representations that GGers truck in are just not right for the brain. In other words, he believes that cogneuro has shown that brains cannot support the kinds of mental representations that linguists have argued for. As Hagoort puts it: “language-like structures do not embody the basic machinery of cognition” (2). How does he know? His authority is Paul Churchland who believes that “human neuronal machinery differs from that of other animals in various small degrees but not in fundamental kind. ” The conclusion is that the language like representations that GGers typically use to explain linguistic data are not brain-structure compatible.

Unfotunately, Hagoort does little more than baldly state this conclusion in this short piece.[2] However, the argument he points to is really quite bad. Let’s assume that Hargoort is right and that non-linguistic cognition (he illustrates with the imagery debate between Kosslyn and Pylyshyn) does not use language like structures in executing its functions (I personally find this unlikely, but let’s assume it for the sake of argument). Does Hagoort really believe that linguistic behavior does not exploit “language-like structures” (what Hargoort calls “linguaform”)? Does Hagoort really believe that not even sentences have sentential structure? If he does believe this, then I await the dropping of the second shoe. Which one? The one containing the linguaformless reanalyses of the myriad linguistic phenomena that have been described using linguaforms. To my knowledge this has never been seriously attempted. Paul Churchland has never suggested how this might be done, nor has Hagoort so far as I know. The reason is that sentences have structure, as 60 years of linguistic research has established, and so far as we can tell, the structure that phrases and sentences have (and linguistic sounds and words and meanings) are unlike the structures that scenes and non-linguistic sounds and smells have. And as linguistics has shown over the last 60 years of research, these structural features are important in describing and explaining a large array of linguistic phenomena. So, if linguists have been wrong about the assumption that “linguiforms” are implicated in the description and explanation of these patterns, then there is a big empirical problem waiting to be tackled: to reanalyze (viz. re-describe and re-explain) these very well studied and attested linguistic data in non-“linguaform” terms. Hagoort does not mention this project in his short speech. However, if he is serious in his claims, this is what he must show us how to do. I very much doubt that he will be able to do it. In fact, I know he won’t.

Let me go further still. As I never tire of mentioning, GG has discovered a lot about natural language structure, both its universal properties and its range of variation. GGers don’t understand everything, but there is wide consensus in the profession that sentences have proposition-like structure and that the rules of grammar exploit this structure. This is not controversial. And if it is not, then however much our brains resemble those of other animals, the fact that humans do manipulate “linguaforms” implies that humans have some mental/neural capacities for doing so, even if other animals do not.[3] Moreover, if this is right, then Hagoort’s finger is pointing in the wrong direction. The problem is not with linguistics, but with the cognuero of language. It has decided to stop looking at the facts, something that we can all agree is not a good sign of scientific health within the cogneuro of language.

So Hagoort is ready to ignore what GG has discovered without feeling any obligation to account for this “body of doctrine.” How come? He actually provides two reasons for this neglect (though he doesn’t put it this way).

The first reason he provides is that linguists are a contentious lot who not only (i) don’t agree with one another (there is “no agreed upon taxonomy of the central linguistic phenomena”) but (ii) have also “turned their backs to the developments in cognitive (neuro) science and alienated themselves from what is going on I adjacent fields of research” (2). I somewhat sympathize with these two points. A bit, not a lot. Let me say why.

Let’s address (i): Contrary to the accepted wisdom, linguistics has been a fairly conservative discipline with later work largely preserving the insights of earlier research and then building on these. This may be hard to see if you are outsider. Linguists, like all scientists, are proud and fractious and argumentative. There exists a bad habit of pronouncing revolutions every decade or so. However, despite some changes in theory, GG revolutions have preserved most of the structures of the ancien regimes. This is typical for a domain of inquiry that has gained some scientific traction, and it is what has taken place in GG as well. However, independently of this, there is something more relevant to Hagoort’s concerns. For the purposes of most of what goes on in cogneuro, it really doesn’t matter what vintage theory one adopts.

Let me be blunter. I love Minimalist investigations, but for most of what is studied in language acquisition, language processing and production, and neurolinguistics it really doesn’t matter whether you adopt the latest technology or the newest fangled concepts. You can do good work in all of these areas using GB models, LFG models, HPSG and GPSG models, Aspects models, and RG models. For many (most?) of the types of questions being posed in these domains all these models describe things in effectively the same way, make essentially the same distinctions and adopt more or less the same technology.

I’m not making this up. I really do know this to be true for I have seen this at work in my own department. There may be questions for which the differences between these various approaches matter (though I am pretty skeptical about this as I consider many of these as notational variants rather than differing theories), but for most everything I have personally witnessed, this has not been the case. This indicates that contrary to what Hargoort reports, there is a huge overlapping consensus in GG about the basic structure of natural language. That he has failed to note this, IMO, suggests that he has not really taken a serious look at the matter (or asked anyone). Of course, life would be nicer were there less pushing and pulling within linguistics (well maybe, I like the contention myself), but that’s what intro texts are for and by now there are endless numbers of these in linguistics that Hargoort could easily consult. What he will find is that they contain more or less the same things. And they are more than sufficient for many of the things he might want to investigate, or that’s my guess.

Hagoort’ claim (ii) is that linguists ignore what is going on in cogneuro. Is this correct? Some do, some don’t. As I noted, my own department is very intellectually promiscuous, with syntacticians, phonologists and semanticists mixing freely and gaily with psycho, computational and neuro types on all sorts of projects. However, let’s again assume that Hagoort is right. The real question is intellectually speaking, who needs who more? I would contend that though checking in with your intellectual neighbors is always a good thing to do, it is currently possible (note the italicized adjective please) to do fine work in syntax while ignoring what is happening in the cogneuro of language. The opposite, I would contend, is not the case. Why? Because to study the cogneuro of X you need to know something about X. Nobody doing the cogneuro of vision would think that ignoring what we know about visual perception is a good idea. So why does Hagoort think that not knowing anything about linguistic structure is ok for the study of the cogneuro of language? All agree that the cogneuro of language aims to study those parts of the brain that allow for the use and acquisition of language. Wouldn’t knowing something about the thing being used/acquired be useful? I would think so. Does Hagoort?

This is not apparent from his remarks, and that is a problem. Imagine you were working on the cogneuro of vision, and say that the people who work on visual perception were rude and obstreperous, would it be scientifically rational to conclude that its ok to ignore their work when working on the cogneuro of vision? I would guess not. Their results are important for your work. So even if it might be hard to get what you need from a bunch of uncivilized heathens, it doesn’t make getting what you need any less critical. So even if Hagoort is right about the lack of interest among linguists for cogneuro, that’s not a very good reason to not claw your way to their results (is it Peter?).

This said, let me admonish my fellow linguists: If a cogneuro person comes and asks you for some linguistic instruction BE NICE!! In fact be VERY, VERY NICE!!! (psst: it appears that they bruise easily).

So, IMO, the first two reasons that Hagoort provides are very weak. Let’s turn to his third for, if accurate, it could explain why Hargoort thinks that work in linguistics can a safely ignored. The third problem he identifies concerns the methodological standards for evidence evaluation in linguistics. He believes that current linguistic methods for data collection and evaluation are seriously sub-par. More specifically, our common practice is filled with “weak quantitative standards” and consists of nothing more than “running sentences in your head and consulting a colleague.” I assume that Hagoort further believes that such sloppiness invalidates the empirical bases of much GG research.[4]

Sadly, this is just wrong. There has been a lot of back and forth on these issues over the last five years and it is pretty clear that Gibson and Federenko’s worries are (at best) misplaced. In fact, Jon Sprouse and Diogo Almeida have eviscerated these claims (see here, here, here and here). They have shown that the data that GGers use in everyday practice is very robust and that there is nothing lacking in the informal methods deployed. How do we know this? It can be gleaned from the fact that using more conventional statistical methods beloved of all psychologists and neurosceintists yield effectively the same results. Thus, the linguistic data that GG linguists have collected in their informal way (consulting their intuitions and asking a few friends) are extremely robust, indeed more robust than those typically found in psych and cogneuro (something that Sprouse and Almeida also demonstrate). Hagoort does not appear to know about this literature.[5] Too bad.[6]

In light of his (as we have seen, quite faulty) diagnosis, Hagoort offers some remedies. They range from the irrelevant, to the anodyne to the misinformed to the misguided. The irrelevant is to do “proper experimental research,” (i.e. do what Sprouse and Almeida show linguists already do). The anodyne is to talk more to neuroscientists. This is fine advice, the kind of thing that deans say when they want to look like they are saying something stimulating but really have nothing to say. The misinformed is to work more on language phenomena and less on “top heavy theory.” The misguided is to “embed linguistic theory in “a broader framework of human communication.” Let me address each in turn and then stop.

The irrelevant should speak for itself. If Sprouse and Alemeida are right (which I assure you they are; read the papers) then there is nothing wrong with the data that GGers use. That said, there is nothing inherently wrong (though it is more time consuming and expensive though and no more accurate) with using more obsessive methods when greater care is called for. My colleagues in acquisition, processing and production use these all the time. Sprouse has also used them when the data as conventionally gathered has failed to be clear cut. As linguistics develops and the questions it asks become more and more refined it would surprise me were we forced to the ~~anal retentive~~ more careful methods that Hagoort prizes. There is nothing wrong with using these methods where useful, but there is nothing good about using them when they are not (and to repeat they are far less efficient). It all depends, like most things.

The anodyone should also be self-evident. Indeed, in some areas (e.g. phonology and morphology) cogneuro techniques promise to enrich linguistic methods of investigation. But even in those areas where this is less currently obvious (e.g. syntax, semantics) I think that having linguists talk to neuroscientists will help focus the latter’s attention onto more interesting issues and may help sharpen GGers explanatory skills. So, in addition to it just being good to be catholic in one’s interests, it might even be mutually beneficial.

The third suggestion above is actually quite funny. Clearly Hagoort doesn't talk to many linguists or read what they write or go to their talks. Most current work is language based and very descriptive. I’ve discussed this before, lamenting the fact that theoretical work is so rarely prized or pursued (see here). Hagoort has already gotten his wish. All he has to do is talk to some linguists to realize that this desire is easily met.

The last point is the one to worry about, and it is the one that perhaps shows why Hagoort is really unhappy with current linguistics. He understands language basically from a communication perspective. He wants linguists to investigate language use, rather than language capacity. Here we should resist his advice. Or we should resist the implication that the communicative use of language is the important problem while liming the contours of human linguistic capacity is at best secondary. From where I sit, Hagoort has it exactly backwards. Language use presupposes knowledge of language. Thus, the former is a far more complicated topic than the latter. And all things being equal, studying simple things is a better route to scientific success than studying more complicated ones. At any rate, the best thing linguists who are interested in how language is used to communicate can do is keep describing how Gs are built and what they can do.

That’s what I think. I believe that Hagoort believes the exact opposite. He is of the opinion that communication is primary while grammatical competence is secondary, and here, I believe that he is wrong. He gives no arguments or reasons for this view and until he does, this very bad advice should be ignored. Work on communication if you want to, but understanding how it works will require competence theories of the GG variety. It won’t replace them.

Ok, this has been far too long a post. Hagoort is right. Linguistics has gone into the shadows. It is no longer the Queen of the Cognitive Neuro-Sciences. But this demotion is less for intellectual than socio-political reasons, as I’ve argued extensively on FoL. I am told that among cogneuro types, Hagoort is relatively friendly to linguistics. He thinks it worth his time to advise us. Others just ignore us. If Hagoort is indeed our friend, then it will be a long time before linguistics makes it back to the center of the cogneuro stage. This does not mean that good work combining neuro and GG cannot be pursued. But it does mean that for the nonce this will not be received enthusiastically by the cogneuro community. That is too bad; Sociologically (and economically) for linguistics, intellectually for the cogneuro of language.

[1] Actually entered my e-mail. Thx Tal and William.

[2] This is not to fault him, for this was an address, I believe and like all good addresses, brevity is the soul of wit.

[3] Some readers may have captured a whiff of the methodological dualism discussed here and here.

[4] He refers to Gibson and Federenko’s 2010 TICS paper and this is what it argues.

[5] Those interested in a good review of the issues can look at Colin Phillips’ slides here.

[6] There is a certain kind of cargo-cult quality to the obsession with the careful statistical vetting of data. I suspect that Hargoort insists on this because it really looks scientific. You know the lab coats, the button boxes, the RSVP presentation really looks professional. Maybe we should add ‘sciency’ to Colbert’s ‘truthy’ to describe what is at issue sociologically.

75 comments:

UnknownFebruary 7, 2015 at 2:00 PM
Is there an example of a case where the results of a psycholinguistic experiment were used to decide between two theories of syntactic representations in a way that became generally accepted among syntacticians?
ReplyDelete
Replies
William MatchinFebruary 7, 2015 at 5:27 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownFebruary 7, 2015 at 6:13 PM
In my opinion, adverting to Sprouse and Almeida in response to Hagoort's worry about methodological standards is a bad idea, almost as bad as his (ii) suggestion to "do proper experimental research".

Before continuing, let me flag the fact that I haven't actually read the Sprouse and Almeida papers myself. I disagree with the way that their work has often been presented to me by other linguists, but I honestly have no idea what Sprouse and Almeida actually have to say for themselves.

Anyway, the reason that I think both Hagoort's (ii) suggestion and adverting to Sprouse and Almeida in response are bad ideas is because doing so legitimizes the worry that we haven't been doing proper experimental research. And I think this worry is indicative of a fundamental misunderstanding of the I-language hypothesis about the nature of language. If the I-language hypothesis is correct—and, as far as I can tell, there is absolutely no reason to doubt this—then judgments from any one speaker are useful for investigating both properties of grammars (Gs, as Norbert calls them) and properties of UG/FoL. Attempting to legitimize our field, then, by either doing 'proper experimental research' or by dismissing the methodological concern by adverting to Sprouse and Almeida validates the concern that judgments from a single speaker cannot tell us anything interesting about language.

When such methodological concerns are raised I think we should instead be pushing back against the concern itself, not saying that it's okay because everybody has these same judgments and go look at Sprouse and Almeida. Instead, we should be explaining the I-language hypothesis and the evidence for it as well as explaining the untenability of an E-language hypothesis about the nature of language.

Two last clarifying points:

First, this is not at all to say that 'proper experimental research' isn't interesting or useful. It's certainly interesting and useful. But, as a strategy for increasing the impact and visibility of linguistics within cognitive science, I think it is completely counterproductive. Again, doing so validates a concern that comes from not understanding what linguists take the nature of language to be, which I think will make it even harder to get cognitive scientists to understand and appreciate linguistics if we are validating their misunderstanding of our central claim about the very nature of language.

Second, I am also not saying that the Sprouse and Almeida work is uninteresting or not useful. I think it is, but I think it is intersting and useful for a very different reason. What it tells us is that all speakers more or less converge on the same grammar despite having completely different primary linguistic data. And this is something that quite easily might not have been true. It certainly seems to be the case from basic inspection, but it might not have been the case for the 'weird' example sentences that linguists invent. So I think it is interesting and useful to see proof of this hunch that speakers really do converge on the same grammar, more or less, even for the very obscure constructions invented by linguists. I just don't think it's an interesting or useful response to the methodological worries that have been raised by non-linguists, which is how their work has always been presented to me. (Again, I have no idea what they actually say for themselves.)
ReplyDelete
Replies
Colin PhillipsFebruary 7, 2015 at 10:15 PM
Norbert: I agree with some of this, but not all. You are reliably informed that Peter Hagoort is more receptive to linguistic findings than others of his ilk. He is more inclined than others in his area to pay attention to explicit (psycho)linguistic models.

I take his first observation not to be an assertion that Churchland is right, but a descriptive comment that the language of thought is no longer in the ascendancy. That seems descriptively accurate, regardless of whether it is desirable.

Hagoort's second observation [and you might want to do a global search/replace to spell his name right] is about PR. I agree with you, of course, that there's lots and lots that has been stable over time. But I'm sure that you also agree that we could do a better job of making that apparent on the outside.

I of course think that the gripes about data quality are overstated, and are an excuse for simply dismissing a field. I don't believe for a minute that the adoption of quantitative methods would make the broader world pay closer attention -- I know because we use those methods in our work all the time, and it doesn't change the fact that many would rather not worry about the phenomena that we care about. The notion that they'd pay attention if only we used careful methods is a fraud. They don't pay attention because they think that the key to understanding language lies elsewhere. Simple as that.

However, I would urge a bit of caution in characterizing the Sprouse & Almeida findings (recently more-or-less replicated by Gibson & colleagues). There is a sampling bias in the phenomena discussed in those studies. For practical reasons, they focus on cases where acceptability judgments involve just string acceptability, and do not rely on interpretations, prosody, focus structure, etc. Some of the most contentious cases involve phenomena that are excluded from the large sample studies because they're hard to test. Testing those additional phenomena either (i) yields a mess when done using the same methods as Sprouse & Almeida (Gibson et al. see this; we've seen this ourselves), or they require more laborious methods to test them.
ReplyDelete
Replies
Shravan VasishthFebruary 8, 2015 at 1:52 AM
I have some comments on Hagoort's piece here:

http://vasishth-statistics.blogspot.de/2015/02/quantitative-methods-in-linguistics.html
ReplyDelete
Replies
Shravan VasishthFebruary 8, 2015 at 3:24 AM
Two more comments:

1. Was Norbert deliberately mis-spelling Hagoort's name, perhaps to mock him?
2. " If Sprouse and Alemeida are right (which I assure you they are; read the papers) then there is nothing wrong with the data that GGers use."

One should never be 100% sure of anything. There is always uncertainty and we should openly discuss the range of possibilities whenever we present a conclusion, not just argue for one position. That has been a problem in psychology, with overly strong conclusions, and that is a problem in linguistics, experimentally driven or not. But this is specially relevant for statistical inference. We can never be sure of anything.
ReplyDelete
Replies
Ted GibsonFebruary 8, 2015 at 9:28 AM
This comment has been removed by the author.
ReplyDelete
Replies
Ted GibsonFebruary 8, 2015 at 9:29 AM
Hi Norbert:

You raise many points that are worth discussing. One point that I will respond to:

"Sadly, this is just wrong. There has been a lot of back and forth on these issues over the last five years and it is pretty clear that Gibson and Federenko’s worries are (at best) misplaced. In fact, Jon Sprouse and Diogo Almeida have eviscerated these claims (see here, here, here and here)."

I am not sure if you have read Gibson & Fedorenko (2013) or Gibson, Piantadosi & Fedorenko (2013):

http://tedlab.mit.edu/tedlab_website/researchpapers/Gibson_&_Fedorenko_2013_LCP.pdf
http://tedlab.mit.edu/tedlab_website/researchpapers/Gibson_et_al_2013InPress_LCP.pdf

(The second one summarizes the first and responds to many of Sprouse & Almeida’s claims directly.)

Sprouse, Schutze & Almeida (2013) show that in a random sample of 146 LI judgments that approximately 90-95% of them are right. Mahowald et al. (submitted; link below) replicate this number on 100 different LI contrasts from the same set of years: about 90-95% right, with 5-10% errors, depending on how conservative one is with deciding on significance levels. Perhaps you think that this shows that not doing quantitative work is ok. On the contrary, we argue that there is a real problem, for the following reasons:

(a) If you don't do the experiment, you never know which 90-95% are ok. This is a serious problem. This means that some fraction of the critical contrasts for your theories are wrong, but you don’t know which ones. This problem is even more severe if you don't speak the language: then you don’t even have your own intuitions about which judgements are probably right, and which ones might be questionable (or wrong).

(b) Effect sizes: you get no effect size information without a quantitative experiment. Spouse et al. and Mahowald et al. show that the effect sizes are clearly in a continuum, from very tiny (and non-existent) to huge. The notion of grammaticality presupposes some threshold (between “grammatical” and “ungrammatical”) that probably isn't there. In real language, the effects are probabilistic. It's impossible to find this out from an armchair experiment, if one presupposes a threshold between grammatical and ungrammatical.

(c) Relative judgements across many sentence types: without quantitative methods, you can’t compare judgments across experiments. So even if sentence a is better than sentence b, you won’t have judgment data for the comparisons of a and b relative to many other structures, without a quantitative experiment.

(d) Interactions: there is no way to measure an interaction among multiple factors using a non-quantitative experiment.

So, I would say that the state of the field is very different from your description (“it is pretty clear that Gibson and Federenko’s worries are (at best) misplaced”). Rather, it seems clear to me (and to many others) that quantitative experiments are clearly useful, and can move the field forward in a productive direction. Indeed, I was at a meeting not long ago attended by Jon Sprouse, Diogo Almeida, Carson Schutze and Colin Phillips (among others), and I think that they all agreed with the points above.

cont’d in next comment: I seem to be limited to 4096 characters

Ted Gibson
ReplyDelete
Replies
Ted GibsonFebruary 8, 2015 at 9:29 AM
…cont’d from previous comment

(Colin makes it clear in his comments here and elsewhere that he thinks that there are more important issues to be addressed in the field of linguistics. This may be so. But that seems orthogonal to the point that we are making. If we know how to solve this problem, then we should solve it. I agree with Colin that *just* adopting quantitative data analysis standards may not make a broader field more interested in the particular questions that one addresses (such as the questions that Colin feels he and his colleagues are addressing). But adopting more quantitative standards at least gives us all a common language to discuss whatever results we want, and it therefore makes it possible for people in other fields to pay attention to these questions. It is my hope that adopting more quantitative standards will make it clearer what the more important effects are, which more people should care about.)

One further note on this issue:

Colin and others have said that one reason that people may not want to use quantitative methods is that it may be too expensive: perhaps it’s hard to get a lot of native speakers for the language in question. As a result, Kyle Mahowald et al. worked out the math to see how many people one needs to be pretty sure that a difference between two constructions is a real difference. Assuming the same distribution of judgments across the 100 judgments that we sampled from Linguistic Inquiry, then the answer is about 7: one needs about 7 people to agree unanimously on the judgment (with one different version of the contrast given to each person). If you do that “mini-experiment” (which we call a “SNAP judgment”), then you can be reasonably confident that the contrast is real. If the judgments aren’t unanimous, then you probably need to do a full-blown experiment to test your claim.

Here’s a draft of this paper:
http://web.mit.edu/kylemaho/www/SNAP.pdf

Note that doing these mini-experiments solves problem 1 above but not problems 2, 3 or 4: you still don’t get effect sizes, relative judgments across sentence types or interactions. So this is not ideal. But it’s a step forward.

Ted Gibson (egibson@mit.edu)

ReplyDelete
Replies
Ted GibsonFebruary 8, 2015 at 10:20 AM
Norbert, you said a lot in your blog. I feel it’s worth responding to at least one other comment of yours (beyond the stuff about quantitative methods, which I clearly disagree with you on).

You say:
“However, despite some changes in theory, GG revolutions have preserved most of the structures of the ancient regimes. … For the purposes of most of what goes on in cogneuro, … it really doesn’t matter whether you adopt the latest technology or the newest fangled concepts. You can do good work in all of these areas using GB models, LFG models, HPSG and GPSG models, Aspects models, and RG models. For many (most?) of the types of questions being posed in these domains all these models describe things in effectively the same way, make essentially the same distinctions and adopt more or less the same technology.”

I think that the field is more complex than you describe. Whereas there may be a lot of similarities among the work done by people in different syntactic frameworks, this is not obvious to non-linguists. Indeed, it seems like there are intense debates among camps, such that it is difficult to see one unified theme coming from all of them. Perhaps the themes are there, but they are not well described in any recent summary review articles that I am aware of. In contrast, I understand the field as having a lot of disagreements about what the basic building blocks for grammatical structure are.

For example, suppose I were to take introductory linguistics at MIT. Then I would learn something like minimalist syntax, using a textbook like Adger or Haegeman. But suppose I were to take a similar course at Stanford or Berkeley. I would learn some very different syntactic framework (HPSG), using Sag, Wasow & Bender’s text. If I learn syntax in Princeton, then maybe I would learn Construction Grammar, using Goldberg’s books.

Maybe what you mean is that there are lots of surface-level phenomena that have been revealed by linguists of all persuasions. E.g., many of the English observations can be found in Huddleston & Pullum (2002):

http://www.amazon.com/The-Cambridge-Grammar-English-Language/dp/0521431468

But beyond this, there may not be much agreement among linguists. One big difference among frameworks is whether there exists “underlying” syntactic structure. Among constructionist approaches that do not appeal to underlying structures, there may be a lot of agreement, but I don’t think that that what’s you’re referring to.

One other commenter suggested that this is a “public relations” (PR) problem that linguistics has. It is at least such a problem. If / when anyone writes some summary articles describing the basics, then maybe we can see if it’s just a PR problem, or something deeper.
ReplyDelete
Replies
UnknownFebruary 8, 2015 at 1:04 PM
I have to say, Hagoort's essay is one frustrating read. Not in a vicious, ranting way --- it is indeed very polite and free of underhanded accusations, and I'm sure it is well-intended. But the points he makes have been discussed many times and shown to be misguided, you would think that at some point that would have finally sunk in.

Here's the points I found particularly flummoxing:

1) The human brain doesn't differ much from animal brains on an architectural level, so it cannot support representations
Let's apply this line of reasoning to the one computational device we understand really well: computers. An Intel Haswell does not differ much from a Pentium Pro, they're both i686 architectures, so according to Hagoort the fact that the latter does not support complex features like hardware virtualization or C6 states should imply that the former does not, either. Except that it does.

And even if the hardware is exactly the same, why would that matter? Computation isn't about hardware, it's about software. That's why true progress in computer science is tied to the study of algorithms and data structures. If you want to work with really large lists, you don't need faster hardware, you need to move from something like linear search to binary search, and you better store that list as a prefix tree if the items in your list have signifcant structural overlap. That's what makes it possible to deal with harder problems, not throwing in a few more transistors.

2) A general unawareness of Marr's levels
This is basically just a follow-up to the previous point, but you would think that neuroscientists are aware of Marr's three levels of description and what they entail on an empistemic level. There is no need for representations to have a direct analog on the algorithmic or hardware level. In general, the mapping is one-to-many: a set can be instantiated as a list, a hash table, or a tree, for instance, and how that is translated into hardware also differs significantly between computers depending on their architecture (x86, ARM, RISC, ia64, and so on).

Computer scientists completely abstract away from the actual hardware, and for good reasons:

a) the hardware level is unnecessarily complex for the questions being studied,
b) since hardware can differ a lot, important generalizations can only be formulated at higher levels of abstraction
c) the hardware level is very uninformative --- you won't figure out how a program works by measuring the electric potentials in your computer (even tasks that are very close to the hardware like reverse-engineering a driver don't work that way).
ReplyDelete
Replies
Alex DrummondFebruary 8, 2015 at 1:36 PM
I think it's worth emphasizing the point that we shouldn't always assume that a divergence between the results obtained via (i) formal experiments with undergraduate subjects and (ii) informal experiments with linguist subjects indicates that the former are more reliable than the latter. Some entirely incontrovertible judgments are not obtainable using method (i). For example, no-one seriously doubts that there could be no verb KILL such that "John KILLed Mary" means "John and Mary both got killed", but you'd have a hard time getting that judgment out of linguistically naive subjects.
ReplyDelete
Replies
UnknownFebruary 9, 2015 at 11:09 PM
Norbert, I can see where you're coming from in this post, and I'm a regular user of both generative grammars and intuition data myself. But a brief comment on the rhetoric: I don't know if your intention is to galvanize linguists who already adopt the theoretical concepts and methods you're defending, or to persuade others who are skeptical of their value. If it's the former, ok. If the latter, then describing controlled experiments and statistical analysis as "obsessive" and "anal retentive" is extremely counter-productive. Even if you're right that Sprouse & Almeida showed once and for all that this is rarely necessary – a claim that could certainly be debated by thoughtful people on both sides – talking this way is a guaranteed way to ensure that your opinion will be dismissed out-of-hand by people who come from disciplines where the design, conduct, and statistical analysis of controlled experiments is considered one of the most basic aspects of the scientific method.
ReplyDelete
Replies
UnknownFebruary 10, 2015 at 9:54 AM
It is unfortunate that most of the discussion here has focused on the reliability of acceptability judgments. We can't know for sure, but my hunch it that cognitive neuroscientists wouldn't suddenly express deep interest in minimalist syntax if all of the judgments in syntax papers had "p < 0.05" next to them. In my opinion the biggest gulf between theoretical linguists and cognitive scientists (or computer scientists) is the evaluation metric that's used to decide between competing representations. Some decisions seem to be based on aesthetic principles (e.g. dislike of traces, dislike of functional heads, preference for binary branching), rather than empirical arguments; it's often unclear whether two theories even make any different testable predictions for the kind of data the cognitive scientists or computer scientists care about. People would get interested if grammars with Larsonian VP-shells predicted reading times better than grammars without them, or improved parser accuracy, or accounted for some commonly accepted and quantifiable set of syntactic judgments. Even when some predictions can be wrested out of the theories and tested, it's unclear whether the results of those tests ever feed back into syntactic theory. John Hale's work and the papers that Colin mentioned are great examples of attempts to derive empirical predictions from representational theories, but they're the exception rather than the norm. As Tim pointed out there are questions about the linking function between the linguists' representations and the empirical data; given the complexity of contemporary syntactic theories, the only realistic way to get scientists outside the field interested is if linguists did the work to try to solve these problems and show them that the representations they care about are useful.
ReplyDelete
Replies
UnknownFebruary 10, 2015 at 7:56 PM
This comment has been removed by the author.
ReplyDelete
Replies
davidadgerFebruary 11, 2015 at 10:31 AM
I wanted to raise a related but slightly different issue. Hagoort says that representations in the rest of cognitive science involve 'high dimensional geometric manifolds' rather than propositional representations. Norbert responded that the 'linguaform' nature of the syntactic representations is crucial. But I think these two views are compatible. What is a syntactic representation? As an output of the computational system it is, in fact, something that is not a million miles from a 'high dimensional geometric manifold'. Take a structure for a wh-adjunct question like 'how did he dance?'. A Merge style analysis of this has the initial (pair)-Merge position of 'how' in a different dimension from the VP, with reentrant structures (essentially curves in the structure) linking the Merge position of the adjunct with C, C with T, TP with vP, etc. There's possibly extra dimensionality, perhaps temporal, given by the cyclic nature of the object. What the representation is, as a configuration of basic units, is distinct from how it is interpreted, which is where, I think, Hagoort goes wrong. Clearly the structure is interpreted as a propositional like thing, but that's because it is interpreted as (input to) instructions to a language external system used for thinking, memory, panning etc (and everyone, I believe thinks we need propositionality for those). Similarly, it can be interpreted as (an input to) instructions to wave articulatory organs around. Hagoort is confusing the propositional meaning of syntactic structures with their form. Not that it'll probably help our case to say 'hey, we have high dimensional geometric manifolds computable too.'!
ReplyDelete
Replies
davidadgerFebruary 12, 2015 at 7:17 AM
@Alex, I guess the point I was trying to make is that the propositional nature of these objects isn't an inherent part of the object, it's rather an interpretation of the object. Hagoort's objection really seems to be that `linguaform' objects are propositional while cogsci should be using more maplike objects. But, as I pointed out, syntactic representations are not propositional qua configurations, only (possibly) qua interpretations. You're right about the `averaging trees vs spaces' issue, though - there doesn't seem to be any reason to take syntactic representations to be non-discrete.
ReplyDelete
Replies
Alex ClarkFebruary 12, 2015 at 7:40 AM
So there certainly are people out there who reject the idea that we should have discrete combinatorial representations of syntactic structure at all. The idea is that one can make do with, in essence, a point in a high dimensional space that corresponds to the activations of an array of abstract "neural" units. So I don't buy that story (yet), because I don't see how you can do what you need syntactic structure to do using such a representation. But models based on these techniques can do quite interesting things, (speech processing, machine translation etc. ) and they can be learned automatically, and in the non-Gallistel part of neuroscience are seen as more compatible with what we know of computational neuroscience. I didn't read the target article closely enough to know whether Hagoort is that radical, but with the increased success of deep learning in the last 10 years, these views are certainly gaining traction.
ReplyDelete
Replies
Digvijay ChaudharyJanuary 19, 2017 at 1:56 AM
This is nice blog. The information you provide is really good. Want to see Sociology and Linguistics in Cognitive Science
ReplyDelete
Replies
AcadestudioNovember 26, 2021 at 3:27 AM
Nice Blog.
Linguistic Experts Resources
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Saturday, February 7, 2015

The future of linguistics; two views

75 comments:

Contributors