Welcome to NLTK-Trainer’s documentation!¶
NLTK-Trainer is a set of Python command line scripts for natural language processing. With these scripts, you can do the following things without writing a single line of code:
- train NLTK based models
- evaluate pickled models against a corpus
- analyze a corpus
These scripts are Python 2 & 3 compatible and work with NLTK 2.0.4 and higher.
Download¶
The scripts can be downloaded from nltk-trainer on github.
Documentation¶
Books¶
Python 3 Text Processing with NLTK 3 Cookbook contains many examples for training NLTK models with & without NLTK-Trainer.
- Chapter 4 covers part-of-speech tagging and train_tagger.py.
- Chapter 5 shows how to train phrase chunkers and use train_chunker.py.
- Chapter 7 demonstrates classifier training and train_classifier.py.
Articles¶
Demos and APIs¶
Nearly all the models that power the text-processing.com NLTK demos and NLP APIs have been trained using NLTK-Trainer.