A simple markup annotator, originally designed for producing training data for supervised information extraction systems. The main class for doing this is Annotator.

This package currently also contains a couple of tokenizers. They should probably really live somewhere else. @since 1.2