Niels Ott

Computational Linguist
Projects & Tools

Projects I am/have been Involved In

  • Bedeutungsvergleich im Kontext: Komponenten einer flachen semantischen Analyse: project A4 of the collaborative research center 833 (the hand feeding me).
  • WERTi: Working With English Real Texts: An Intelligent Workbook for English (Java-based re-implementation).
  • Information Retrieval for Language Learning: A Search Engine Prototype – actually the prototype implementation for an entire research project resulting from my MA thesis.
  • Web as Corpus Toolkit: a toolkit and framework written in Perl that can be used for corpus generation both from web pages and files on disk.
  • GraleJ: the successor of Grale, which is the successor of Grisu.

Software Tools hosted Here

  • BananaSplit: a dictionary-based compound splitter for German.
  • ClusterLib: a Java library for hierarchical bottom-up clustering.
  • UIMA Utilities Package: a library that aims to make the lives of UIMA users easier by providing a number of convenience classes.
  • GenericLevenshtein: a versatile Java library implementing Minimum Edit Distance.
  • Phantom Readability Library: a library for computing traditional readability measures, including a demo GUI.
Posted by Niels Ott • 2009-10-26