I recently parsed the British National Corpus (BNC) using the latest version of the parser by Charniak’s group @ Brown. In running the results through ‘tgrep2 -p’ (i.e., building a corpus file), I ran into some troubles that I thought I’d put up here in case they save someone a bit of grief.
Blog Stats
- 158,381 hits
Categories
Top Posts
- Diagnosing collinearity in mixed models from lme4
- Nagelkerke and CoxSnell Pseudo R2 for Mixed Logit Models
- R code for Jaeger, Graff, Croft and Pontillo (2011): Mixed effect models for genetic and areal dependencies in linguistic typology: Commentary on Atkinson
- Using pyjamas to program external Mechanical Turk experiments
- Information on applying for a waiver of the J1-visa Foreign Residence Requirement
Tags
animacy
centering
coding
cogsci
collinearity
cool talks
cross-linguistic
CUNY
data analysis
Degen
eye-tracking
Fedzechkina
field work
funding
javascript
Kleinschmidt
language acquisition
linear mixed models
lmer
LSA
Maya
Mechanical Turk
mixed logit model
mixed models
multilevel logit model
multilevel models
Norcliffe
NSF
psycholinguistics
Qian
R
random effects
R code
regression
self-paced reading
sentence production
simulation
syntactic corpora
tgrep2
travel/motion
typology
uniform information density
visualization
word order
Yucatec
HLP lab blog contributors:
Centers/Labs
Links
People
CyberLingBlog
- Free Science Blog February 20, 2013 Emily M. Bender
- Crowdsourcing WALS using Linked Data September 3, 2012 Sebastian Nordhoff
- Interview: New blog for experimental statistics in corpus linguistics June 20, 2012 Emily M. Bender
- NSF/OCI Data Infrastructure Building Blocks (DIBBs) solicitation June 15, 2012 D Terence Langendoen
- Announcing Glottolog/Langdoc, a knowledge base of 175k references for (mostly) underdescribed languages March 26, 2012 Sebastian Nordhoff
Language Log
- Racist Park May 17, 2013 Victor Mair
- Misnegation of the week May 17, 2013 Mark Liberman
- "Significance", in 1885 and today May 17, 2013 Mark Liberman
- Innocent face May 17, 2013 Geoffrey K. Pullum
- Shanghainese May 16, 2013 Victor Mair
Sociolinguistic Cognition Blog
- CFP: Cognitive Modeling and Computational Linguistics @ Sofia, Bulgaria
- CFP: Production of Referring Expressions @ Berlin, Germany
- SALSA XXI Conference
- CFP: Linguistic Variability and How the Mind/Brain Accommodates It @ Ann Arbor, MI
- CFP: Variation and Contact in Languaging: Ecological and Complex Approaches @ Barcelona, Spain
The Lousy Linguist
- Book Reviews May 19, 2013
- heard tell 'bout them linguistic constructions yonder May 16, 2013
- Pullum’s NLP Lament: More Sleight of Hand Than Fact May 13, 2013