Posts Tagged ‘uniform information density

21
Oct
11

More papers relevant to questions about information density

And while I am at it, let me post three more papers that are interesting for anyone interested in uniform information density and, more generally, theories of communicatively efficient language production (though most of you may already know these papers):

Lots of food for thought.
21
Oct
11

UID and text generation

Ah, just when I thought it couldn’t get any better: Uniform Information Density has been applied to text generation ;) . Have a look at this paper (thanks, Raja, for forwarding it):

According to Raja (the first author), more on this issue is in progress (e.g. an extension beyond complementizers) and future updates on this work  will be posted on Michael White’s lab at Ohio State.
29
Mar
11

Celebrating Tom Wasow: A Festschrift

At this year’s CUNY Sentence Processing Conference, Emily Bender and Jennifer Arnold presented a Festschrift celebrating Thomas Wasow. Here’s what the publisher’s site (CSLI) says (picture taken from the publisher’s website, which is hopefully ok; see the book for copyrights):

This book is a collection of papers on language processing, usage, and grammar, written in honor of Tom Wasow to commemorate his career on the occasion of his 65th birthday. Tom is a professor of linguistics and philosophy. But more accurately, he is a renaissance academic, having done work that connects with many different disciplines, including formal linguistics, sociolinguistics, historical linguistics, psycholinguistics, computational linguistics, and philosophy. Appropriately, this book reflects the diversity of Tom’s research and interests, including topics from multiple branches of linguistics and human information processing. These papers are written with minimal background assumed, so they can be used as teaching materials for beginning scholars. As such, this volume is a tribute to what is perhaps Tom’s most lasting contribution to the field—the mentorship and inspiration he provided to his students and collaborators, many of whom have contributed to this volume.

The book contains introductory and overview articles on a variety of topics in cognitive science from Emily M. Bender, Dan Flickinger, Stephan Oepen, Ash Asudeh, Peter Sells, Amy Perfors, James Paul Gee, John R. Rickford, T. Florian Jaeger, Jennifer E. Arnold, Harry J. Tily, Neal Snider, John A. Hawkins, and Susanne Riehemann.

Continue reading ‘Celebrating Tom Wasow: A Festschrift’

25
Dec
10

HLP Lab at the LSA and congratulations to Judith Degen and Masha Fedzechkina

Congratulations to Judith Degen and Masha Fedzechkina for having their two abstracts be among only twelve selected to be “media-worthy” by LSA reviewers:

  • Degen, J. and Jaeger, T. F. 2011.  Speakers sacrifice some (of the) precision in conveyed meaning to accommodate robust communication. Talk to be presented at the 2011 Meeting of the LSA.
    • Session: Pragmatics II  31
    • Room: Le Batea
    • Time: Friday 2pm

The process of encoding an intended meaning into a linguistic utterance is well-known to be affected by production pressures. We present corpus data suggesting that the choice between even two seemingly non-meaning-equivalent forms as in (1a) and (1b) can be affected by speakers’ preference to distribute information uniformly across the linguistic signal (Uniform Information Density (UID), Jaeger 2006). This suggests that even when two forms do not encode the same (but a similar enough) message, speakers may sacrifice precision in meaning for increased processing efficiency.

(1a) Alex ate some chard.
(1b) Alex ate some of the chard

  • Fedzechkina, M., Jaeger. T. F. , and Newport, E. 2011. Word order and case marking in language acquisition and processing. Poster to be presented at the 2011 Meeting of the LSA.
    • Session: Language Acquisition/Psycholinguistics/Syntax
    • Room: Grand Ballroom Foyer
    • Time: 9:00 – 10:30 AM.

To understand a sentence, comprehenders must identify its actor and patient. In principle, these relationships can be signaled using a single cue, but most languages employ several redundant cues, including word order and case marking. In artificial language learning experiments we investigate word order and case as cues in processing and learning. In languages without case marking, learners regularize word order; but when case marking is present, it is favored and limits word order regularization. Case-marking comes with a disadvantage: it is more complex to acquire. But the present results suggest that this may be outweighed by clarity for processing.

Continue reading ‘HLP Lab at the LSA and congratulations to Judith Degen and Masha Fedzechkina’

25
Mar
10

Psycholinguistics in the field 2, CUNY Poster

And here is one more poster on Yucatec, following Lindsay’s example. This is work by Lis Norcliffe, who just graduated from Stanford and join the MPI in Nijmegen. Her thesis work is on the (possibly resumptive) morphology discussed in this poster and the experiments were part of that thesis, too. You’ll find effects of definiteness and dependency length, which we investigated since they (in our view) provide evidence that this morphological reduction alternation is affected by both a preference for uniform information density and a preference for dependency minimization. Feedback welcome.

13
Dec
09

Mech Turk and Written Recall

Ah, we’ve just got back the first result of two studies that used a written recall paradigm via Mechanical Turk to test a couple of predictions of Uniform Information Density. You can see an example template of a written recall procedure here (JavaScript required). Each study took about 1 day for the equivalent of 20 participants (balanced across 4 lists) at $.02 per trial plus some boni (see below).

The next step is to implement a spoken recall paradigm. If anyone out there has already done that, let me know.

We also tested progressive payment as a way to elicit more balanced data sets. Whereas normal MechTurk data sets exhibit Zipf distributions with regard to the trials per participant, a simple progressive scheme ($.20 for at least 20 trials, $.50 for at least 40 trials, etc.) worked quite well to drastically increase the percentage of data that comes from participants who’ve done the entire experiment.

Furthermore, HLP lab manager Andrew Watts has written a little script that makes sure that each item gets only seen in one condition by each participant and that conditions are counterbalanced across participants (worker IDs). We’re still working on some details, but once it’s ready for prime time, we’ll share it here.




Blog Stats

  • 117,920 hits

 

June 2012
M T W T F S S
« May    
 123
45678910
11121314151617
18192021222324
252627282930  

Categories

RSS Language Log


Follow

Get every new post delivered to your Inbox.