|
roses are #FF0000 |
Chris Brew |
|
Linguistics Graduate Program Rolling Calendar
My research group at The Ohio State University is focused on robust language technology.
Current Members
Kirk Baker (Computational Lexical Semantics)
Ilana Bromberg (Arabic language technology)
Jianguo Li (Computational lexical semantics)
Jihyun Park (Computational lexical semantics)
Dennis Mehay (Machine Translation evaluation, inter alia)
DJ Hovermale
Emily Jamison
We also collaborate with Detmar
Meurers, Donna
Byron Mike
White and Eric
Fosler-Lussier.
Anton Rytting CASL
Anna Feldman Montclair State University
Nathan Vaillette Hampshire College
Sabine Schulte im Walde Stuttgart
Martin Jansche Google,New York
Paul C. Davis Motorola Research
Mirella Lapata Faculty, Informatics, Edinburgh
Scott McDonald Research Staff,Informatics,Edinburgh
David Tugwell , ITRI, Brighton
Kyuchul Yoon, English Division, Kyungnam University
Jirka Hana (computational morphology and portable language technology. Advised by Carl Pollard)
Crystal Nakatsu (dialogue. Advised by Mike White since 2006, previously by me)
Na'im Tyson (Computational Phonology and Speech Science)
Chris is the local arrangements chair for ACL 2008, in Columbus.
Anton has filed his thesis, graduated, and has a postdoctoral
research position at CASL
Anna has filed her thesis and moved to a tenure-track job at Montclair State University.
Chris was the chair for NLP of HLT-EMNLP 2005, which is one of the largest and most prestigious conferences in language technology, with about 400 participants and 130 tightly-refereed research presentations.
Jihyun had a paper on her psycholinguistic work accepted for both the main session and the student research workshop of COLING/ACL 2006
Jianguo had a paper on lexical acquisition from sparse data accepted for both the main session and the student research workshop of COLING/ACL 2006
Brew,Chris and Dragomir Radev 2005
Chris Brew 2006 Language Processing, statistical methods In Encyclopedia of Language and Linguistics, 2nd Ed. Edited by Keith Brown Elsevier: Oxford ISBN 0-08-044299-4
Brew, Chris 1991. Systemic classification and its efficiency. Computational Linguistics, 17(4):375–408.
Hockenmaier, Julia and Chris Brew. 1998a. Error-Driven Segmentation of Chinese. An International Journal of the Chinese and Oriental Languages Information Processing Society, 8(1):69–84, June 1998.
McKelvie, David, Chris Brew, and Henry Thompson. 1998. Using SGML as a basis for Data-Intensive NLP. Computers and the Humanities, 31(5):367–388.
Oberlander, Jon and Chris Brew. 2000. Stochastic Natural Language Generation. Philosophical Transactions of the Royal Society, 358(1769):1373–1388, April.
Lapata, Mirella and Chris Brew 2004. Verb Class Disambiguation using Informative Priors Computational Linguistics : (3)0:1, 45-73
Yoon, Kyuchul and Chris Brew 2005 A linguistically motivated approach to grapheme-to-phoneme conversion for Korean. Computer Speech and Language. (SCI Expanded Journal List, ISSN:0885-2308) Accepted 4 March 2005. Available online 13 June 2005 at http://www.ScienceDirect.com
Anna Feldman, Jiri Hana, and Chris Brew 2006 Experiments in morphological annotation transfer Proceedings of Computational Linguistics and Intelligent Text Processing, CICLing, A. Gelbukh (editor), Lecture Notes in Computer Science, Springer-Verlag, 2006.
Brew, Chris 1990. Systemic Grammar and Partial Descriptions. In Proceedings of the 13th International Conference on Computational Linguistics, volume 1, pages 36–41, Helsinki, Finland.
Brew, Chris 1992. Letting the Cat out of the Bag: generation for shake-and-bake MT. In Proceedings of the 14th International Conference on Computational Linguistics, pages 610–616, Nantes, France.
Brew, Chris 1995. Stochastic HPSG. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, pages 83–89, Dublin, Ireland, March 28–31. University College.
Brew, Chris 1999. An extensible visualization tool to aid treebank exploration. In Hans Uszkoreit, editor, Linguistically Interpreted Corpora, Bergen, July. European Chapter of the Association for Computational Linguistics' Post-conference workshop.
Brew, Chris and David McKelvie. 1996. Word-pair extraction for lexicography. In Kemal Oflazer and Harold Somers, editors, Proceedings of the Second International Conference on New Methods in Language Processing, pages 45–55, Ankara, September. Bilkent University.
Brew. Chris and Markus Dickinson and W. Detmar Meurers 2005 “Language and Computers”: Creating an Introduction for a General Undergraduate Audience Proceedings of the Workshop on “Effective Tools and Methodologies for Teaching Natural Language Processing And Computational Linguistics” 43rd Annual Meeting of the Association for Computational Linguistics (ACL-05) Ann Arbor, MI, USA
Brew, Chris and Sabine Schulte im Walde. 2002. Spectral Clustering for German Verbs. In Hajic and Matsumoto Proceedings of the 7th Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA., pages 117–124.
Brew, Chris and Henry S. Thompson. 1994. Automatic evaluation of computer generated text: a progress report on the Texteval project. In Clifford Weinstein, editor, Proceedings of the Workshop on Human Language Technology, pages 108–113. ARPA/ISTO, March.
Davis, Paul C. and Chris Brew. 2002. Stone Soup Translation. In Proceedings of the 9th Conference on Theoretical and Methodological Issues in Machine Translation (TMI-2002), pages 31–41, Keihanna, Japan.
Feldman, Anna, Jiri Hana, and Chris Brew 2006 A cross-language approach to rapid creation of new morpho-syntactically annotatedresources. In Proceedings of the fifth International conference on Language Resources and Evaluation (LREC 2006) . Genoa, Italy
Feldman, Anna, Jiri Hana, and Chris Brew. 2005 Buy one, get one free or what to do when your linguistic resources are limited”. To appear in Proceedings of the third international seminar on Computer Treatment of Slavic and East-European Languages (Slovko 2005). Bratislava, Slovakia.
Grover, Claire, Chris Brew, Marc Moens, and Suresh Manandhar 1994 Priority union and generalisation in discourse grammar. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, pages 17–24, Las Cruces, New Mexico.
Hana, Jiri, Anna Feldman and Chris Brew 2004 A Resource-light Approach to Russian Morphology: Tagging Russian using Czech resources In 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), Barcelona Spain, 25–26 July 2004
Jiri Hana, Anna Feldman, Luiz Amaral, and Chris Brew. 2006 Tagging Portuguese with a Spanish Tagger Using Cognates. In Proceedings of the Workshop on Cross-language Knowledge Induction hosted in conjunction with the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-2006). Trento, Italy.
Hockenmaier, Julia and Chris Brew. 1998b. Error-driven segmentation of Chinese. In Jin Guo, Kim Teng Lua, and Jie Xu, editors, 12th Pacific Conference on Language and Information, pages 218–229, Singapore, February. Chinese and Oriental Languages Processing Society.
Ide, Nancy and Chris Brew. 2000. Requirements Tools and Architectures for Annotated Corpora. In Proceedings of Data Architectures and Software Support for Large Corpora, pages 1–5, Paris, France
Lapata, Maria and Chris Brew. 1999. Using subcategorization to resolve verb class ambiguity. In Pascale Fung and Joe Zhou, editors, Joint SIGDAT Conference on Empirical Methods in NLP and Very Large Corpora, pages 397–404, College Park, MD
Li, Jianguo and Chris Brew. 2006 Parsing and Subcategorization Data In Proceedings of the COLING/ACL 2006,515–522 Sydney, Australia
Li, Jianguo and Chris Brew. 2005 Automatic extraction of subcategorisation frames from spoken corpora In Katrin Erk,Alissa Melinger and Sabine Schulte im Walde Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes Saarland University, Germany (8 pages)
Li, Jianguo, Chris Brew and Eric Fosler-Lussier 2005 Robust Extraction of Subcategorization Data from Spoken Language In Rob Malouf International Workshop on Parsing Technologies Vancouver, Canada (1 page)
Li, Jianguo and Chris Brew. 2006 Parsing and Subcategorization Data. In Proceedings of COLING/ACL 2006 . Sydney, Australia. (8 pages)
McKelvie, David , Chris Brew, and Henry Thompson 1997. Using SGML as a basis for data-intensive NLP. In Proceedings 5th Conference on Applied Natural Language Processing, Stuttgart, Germany (7 pages)
McDonald, Scott and Chris Brew. 2001. A rational analysis of semantic processing by the left cerebral hemisphere. In First Workshop on Cognitively Plausible Models of Semantic Processing, (5 pages), Edinburgh, July.
McDonald,Scott and Chris Brew “A distributional model of semantic context effects in lexical processing”. 42nd Annual Meeting of the Association for Computational Linguistics ,Barcelona, Spain July 21-26, 2004 (7 pages)
McDonald, Scott, Richard Shillcock, and Chris Brew. 2001. Low-level predictive inference in reading: Using distributional statistics to predict eye movements. In 7th Annual Conference on Architectures and Mechanisms for Language Processing, (1 page).
Park, Jihyun and Chris Brew 2006 A Finite-State Model of Human Sentence Processing, Proceedings of Coling/ACL2006, Sydney, Australia (8 pages)
Park, Jihyun and Chris Brew 2006 A Finite-State Model of Human Sentence Processing, Proceedings of AMLAP2006, Nijmegen, the Netherlands (poster)
Penoff, Brad and Chris Brew. 2001. TREX-Q: A query language based on XML schema. In Proceedings of Institute for Research in Cognitive Science Workshop on Linguistic Databases, pages 200–209, Philadelphia,PA. (6 pages)
Poesio, Massimo, Sabine Schulte im Walde, and Chris Brew. 1998. Lexical clustering and definite description interpretation. In J.Choi and N.Green, editors, AAAI Spring Symposium on Learning for Discourse Interpretation, pages 82–89, Stanford, March. American Association for Artificial Intelligence.
Schulte im Walde, Sabine and Chris Brew. 2002. Inducing German Semantic Verb Classes from Purely Syntactic Subcategorisation Information. In ACL Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia., pages 223–230.
Wong, Wai Yi Peggy, Chris Brew, Shui-Duen Chan, and Mary Beckman. 2002. Using the Segmentation Corpus to define an inventory of concatenative units for Cantonese speech synthesis”. In Proceedings of the First SIGHAN Workshop on Chinese Language Processing in conjunction with the 19th International Conference on Computational Linguistics 2002. (Papers presented at the First SIGHAN Workshop on Chinese Language Processing), pages 119–123. Taipei, Taiwan.
Yoon, Kyuchul, Chris Brew, and Mary Beckman. 2002. Letter-to-sound-rules for Korean. In IEEE Workshop on Speech Synthesis, (4 pages), Santa Monica, CA, 11–13 September.
Brew C. (2003) Review of Huddleston and Pullum's Comprehensive Grammar of the English Language Computational Linguistics 29 (1): 144-147
Brew C. (2001) Review of Harry Bunt and Anton Nijholt's Advances in Probabilistic and other Parsing Technologies Computational Linguistics 27 (3):459-461
Brew C. (2001) Review of Graeme Kennedy's An Introduction to Corpus Linguistics Machine Translation 15: 259-266, 2001.
Grover, C., Brew, C., Manandhar, S., Moens, M., and Schöter, A. (1995). Priority union and generalization in discourse grammar. In A. Schöter and C. Vogel, editors, Edinburgh Working Papers in Cognitive Science, Vol. 10: Nonclassical feature system, pages 157–196. University of Edinburgh.
Brew, Chris 1993. Adding preferences to CUF. In Jochen Dörre, editor, Computational Aspects of Constraint-Based Linguistics Description, Vol. 1. ILLC/Department of Philosophy, University of Amsterdam, August. DYANA-2 Deliverable R1.2.A.
Brew, Chris 1994. Comments on Eisele: types and clauses: two styles of probabilistic processing in CUF. In Jochen Dörre, editor, Computational Aspects of Constraint-Based Linguistic Description, Vol. II. ILLC/Department of Philosophy, University of Amsterdam, pages 23–28. DYANA-2 Deliverable R1.2.B.
Brew, Chris (1993). Adding preferences to CUF. In J. Dörre, editor, Computational Aspects of Constraint-Based Linguistics Description, Vol. 1. ILLC/Department of Philosophy, University of Amsterdam. DYANA-2 Deliverable R1.2.A.
Bard, E. G., Cooper, L., Kowtko, J. C., and Brew, C. (1991). Psycholinguistic studies on the incremental recognition of speech: A revised and extended introduction to the messy and the sticky. DYANA Report R1.3.B, University of Edinburgh.
Brew, C. and Isard, S. D. (1991). Modelling the intonation contours of informal speech. DYANA Report R1.4.B, University of Edinburgh.
Brew, Chris (1991a). Bottom-up phonology: Modelling prosodic categories. In S. Bird, editor, Integrating Phonology, pages 35–55. Centre for Cognitive Science, University of Edinburgh. DYANA Report R1.2.G.
Brew, Chris (1991c). Systemic classification and its efficiency. Research Paper HCRC/RP-18, University of Edinburgh.
Brew, C. and Isard, S. D. (1990a). Principles for the automatic labelling of intonation contours. DYANA Report R1.4.A, University of Edinburgh.
Brew, C. and Isard, S. D. (1990b). Principles of intonation contour labelling. In Edinburgh Working Papers in Linguistics. Department of Linguistics, Edinburgh.
Learning lexical resources from sparsely labeled data (Chris Brew, Jianguo Li, Kirk Baker, Dennis Mehay, DJ Hovermale, Dominic Espinosa)
One strand of our research is funded by NSF with a $500k CAREER grant to work on Hybrid methods for acquisition and tuning of lexical info
.Broad coverage dictionaries and ontologies for natural language processing (NLP) are difficult and costly to create and maintain by hand. It is therefore desirable to learn them from distributional information, such as can be obtained from unlabeled or sparsely labeled text corpora. Many linguistic and psycholinguistic theories are distributional, but emphasize local neighborhood structure more than do previous NLP approaches. Successful visualization techniques such as keyword-in-context also rely on the preservation of neighborhood structure. A similar emphasis is present in emerging techniques for data reduction, such as LLE and min-cut algorithms. While the immediate goal of the project is to gain a better understanding of lexical tuning and acquisition, the resulting dictionaries, ontologies and mapping techniques have the potential to help information professionals (such as librarians, translators, patent examiners and paralegal researchers) to navigate through corpora, to understand the significance of the data that they see, and to incorporate insights derived from the data into their working practice.
WWW: http://www.ling.ohio-state.edu/~cbrew
Email: cbrew@acmsalami.org -- remove the cold meat for real address.
Address: The Ohio State University,Department of Linguistics,
1712 Neil Avenue,Oxley Hall, Columbus OH 43210-1298, USA
Telephone: (614) 292 5420 Fax: (614) 292-8833