Centre for Language and Speech Technology

MBT

“Tilburg Memory-Based Tagger„

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural language processing. It has also been used for named-entity recognition, information extraction in domain-specific texts, and disfluency chunking in transcribed speech.

Mbt is used by Frog for Dutch tagging.

Features

Download & Installation

Mbt is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation.

To download and install Mbt:

  1. First check if there are up-to-date packages included in your distribution's package manager. There are packages for Alpine Linux, Arch Linux (AUR), macOS (homebrew), Debian and derivates like Ubuntu.
  2. If not, we recommend you use our docker container via docker pull proycon/mbt. It includes mbt and all necessary dependencies.
  3. Alternatively, you can always download, compile and install mbt manually, as shown next.

Manual installation

To compile Mbt manually consult the included INSTALL document, you will need current versions of the following dependencies of our software:

As well as the following 3rd party dependencies:

Documentation

Links

Mbt is used in:

The development and improvement of Mbt also relies on your bug reports, suggestions, and comments. Use the github issue tracker or mail lamasoftware (at) science.ru.nl

badge