I recently parsed the British National Corpus (BNC) using the latest version of the parser by Charniak’s group @ Brown. In running the results through ‘tgrep2 -p’ (i.e., building a corpus file), I ran into some troubles that I thought I’d put up here in case they save someone a bit of grief.
Blog Stats
- 117,920 hits
Categories
Top Posts
Tags
animacy
centering
coding
cogsci
collinearity
cool talks
cross-linguistic
CUNY
data analysis
Degen
Fedzechkina
field work
funding
Kleinschmidt
linear mixed models
lmer
LSA
Maya
Mechanical Turk
Mexico
mixed logit model
mixed models
multilevel logit model
multilevel models
Norcliffe
NSF
psycholinguistics
R
random effects
R code
regression
self-paced reading
sentence production
simulation
syntactic corpora
tgrep2
travel/motion
typology
uniform information density
valladolid
variation
video description
visualization
word order
Yucatec
HLP lab blog contributors:
Centers/Labs
Links
People
CyberLingBlog
- Announcing Glottolog/Langdoc, a knowledge base of 175k references for (mostly) underdescribed languages March 26, 2012 Sebastian Nordhoff
- NSF announces Building Community and Capacity for Data-Intensive Research in the SBE Sciences February 29, 2012 D Terence Langendoen
- eWAVE -- the electronic World Atlas of Varieties of English November 24, 2011 Bernd Kortmann
- SSWL (Syntactic Structures of the World's Languages) November 10, 2011 Chris Collins
- Liberman on Open Access and the three-legged stool November 10, 2011 Emily M. Bender
Language Log
- Coolly rational in a second language June 1, 2012 Julie Sedivy
- War of the 'iptivists May 31, 2012 Mark Liberman
- Big Data in the humanities and social sciences May 31, 2012 Mark Liberman
- The trouble with making linguistic claims May 31, 2012 Eric Baković
- Why "Hopefully"? May 30, 2012 Geoff Nunberg
Sociolinguistic Cognition Blog
- Workshop: Categories and Gradience: Neural Systems for Speech Communication @ Cambridge, UK
- CFP : 4th APRU Symposium on Brain and Mind Research in the Asia Pacific @ Tokyo, Japan
- CFP: NWAV 41 @ Bloomington
- CFP: Seventh International Workshop on Language Production @ NYU
- Workshop: Empirical Methods in Cognitive Linguistics 6 @ Case Western
The Lousy Linguist
- Sherlock or Watson: Advice for linguists June 1, 2012
- New Film About Native American Languages May 27, 2012
- fun and honorable May 27, 2012