Corpora and Corpus Annotation Tools on the WWW

collected by Markus Dickinson and Detmar Meurers (OSU), February 2002

funding for this project provided by OSU College of Humanities Seed Grant

Internal Documentation and Installed Corpora

You can find reference documentation for tools installed at OSU here

You can find a list of our installed corpora here


GUIDES TO CORPUS BUILDING


TOKENIZATION / SEGMENTATION TOOLS


TAGGERS


MORPHOLOGICAL ANALYZERS


PARSERS/CHUNKERS


TEXT ANALYSIS


VARIOUS TOOLS (ANNOTATE, SEARCH, TRANSCRIBE)


XML TOOLS


CORPORA


SYNTACTICALLY-ANNOTATED CORPORA


ONLINE CORPORA


META-SEARCHES AND OTHER ONLINE RESOURCES


note

Our system (found under: /home/corpora) corresponds to the 2-letter language codes (ISO 639) found at The XML Cover Pages


Questions or comments? Contact Markus Dickinson