OSU Mini-institute: Corpus-based Computational Linguistics (Chris Brew and Mike White)
Slides
First day slides
Tree bank overview
Corpus design
Bird, Loper and Klein's nltk tagging slides
Third day slides
Fourth day slides
Fifth day slides
Fifth day wrapup
Activities
Using NLTK (transcript)
Searching the Penn Treebank with Tregex
Searching the PTB Answer Sheet
Searching the CCGbank with Tregex
Searching the CCGbank Answer Sheet
Doing Surgery on the Penn Treebank with Tsurgeon
Doing Surgery on the PTB Answer Files
Miscellany
Jean Veronis on Google Counts
Adam Kilgarriff on attempts to do science with Google counts:
"Googleology is bad science"
, a squib from Computational Linguistics (2007, v33, 1)
Chris Brew
Last modified: Tue Jul 15 13:18:16 EDT 2008