Centre for Language and Speech Technology


“Tilburg Memory-Based Tagger„

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural language processing. It has also been used for named-entity recognition, information extraction in domain-specific texts, and disfluency chunking in transcribed speech.

Mbt is used by Frog for Dutch tagging.


Download & Installation

Mbt is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation.

To download and install Mbt:

  1. First check if there are up-to-date packages included in your distribution's package manager. There are packages for Debian,Ubuntu and Arch Linux.
  2. If not, we strongly recommend you use our LaMachine software distribution, which includes Frog and all necessary dependencies, and runs on Linux, BSD and Mac OS X. It can also run as a virtual machine under any host OS.
  3. Alternatively, you can always download, compile and install Mbt manually, as shown next.

Manual installation

To compile Mbt manually consult the included INSTALL document, you will need current versions of the following dependencies of our software:

As well as the following 3rd party dependencies:



Mbt is used in:

The development and improvement of Mbt also relies on your bug reports, suggestions, and comments. Use the github issue tracker or mail lamasoftware (at) science.ru.nl