Kirk Baker Homepage
Kirk Baker, PhD
kbaker@lin g.osu.edu
The Ohio State University, 2008
Department of Linguistics

LinkedIn: http://www.linkedin.com/in/bakerkirk

Specialization
Statistical Language Processing; Text Mining; Computational Lexical Semantics; Machine Learning; Scalable NLP Systems

Current Employment
I work for Collexis, Inc., a Columbia, South Carolina-based company specializing in vertical search and knowledge discovery software. I currently work full-time on a text classification project called the Research, Condition, and Disease Categorization Project (RCDC) at the National Institutes of Health.


Data

English-Korean Transliteration List [Please cite as: Baker, Kirk. 2008. English-Korean Transliteration List (v0.1). Electronic document. http://purl.oclc.org/net/kbaker/data]

Papers

  1. Multilingual Animacy Classification by Sparse Logistic Regression
    Kirk Baker and Chris Brew
    Accepted. Ohio State Working Papers in Linguistics.
  2. Lettered Words: Using Roman Letters to Create Words in Chinese
    Helena Riha and Kirk Baker
    In Variation and Change in Morphology, Rainer, Franz, Wolfgang U. Dressler, Dieter Kastovsky and Hans Christian Luschützky (eds.) John Benjamins. 2010.
  3. An Interactive Automatic Document Classification Prototype
    Kirk Baker, Archna Bhandari and Rao Thotakura
    Proceedings of the Third Workshop on Human-Computer Interaction and Information Retrieval. Washington, D.C. October 23, 2009.
  4. Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data
  5. Singular Value Decomposition Tutorial
    Kirk Baker
    Electronic document. 2005.
  6. Production and perception of glottalized vowels in Coatzospan Mixtec
    Chip Gerfen and Kirk Baker
    Journal of Phonetics. 33(3):311-334. July 2005.
  7. Constraining user response via multimodal dialog interface
    Kirk Baker, Ashley Mckenzie, Alan Biermann and Gert Webelhuth
    International Journal of Speech Technology. 7(4):251-258. October 2004.
  8. Prosodic structure and perception of Korean domain-initial coronal stops
    Kirk Baker
    International Journal of Korean Linguistics. 11:119-132. 2002.
  9. Semantic and Dialogic Annotation for Automated Multilingual Customer Service
    Hilda Hardy, Kirk Baker, Hélène Maynard, Laurence Devillers, Sophie Rosset and Tomek Strzalkowski
    In ISCA Eurospeech, Geneva, 2003.

  10. Multi-layer Dialogue Annotation for Automated Multilingual Customer Service
    Hilda Hardy, Kirk Baker, Laurence Devillers, Lori Lamel, Sophie Rosset, Tomek Strzalkowski, Cristi Ursu and Nick Webb
    In Proceedings of the ISLE Workshop on Dialogue Tagging for Multi-Modal Human Computer Interaction, Edinburgh, 2002.

  11. Segmental and Prosodically-Governed Delateralization
    Soo Jung Kim and Kirk Baker
    In Harvard Studies in Korean Linguistics VIII, S. Kuno et al. eds. Cambridge, MA: Harvard University, 152–166, 1999.

Dissertation: Multilingual Distributional Lexical Similarity