proycon/pynlpl
A Python library for natural language processing including n-gram extraction, language modelling, and parsers for common NLP file formats.

PyNLPl is a comprehensive Python library for natural language processing tasks. It provides modules for basic tasks like n-gram extraction and frequency lists, as well as more complex data types and algorithms for language modelling. The library includes parsers for common NLP file formats including FoLiA, Giza, Moses, ARPA, and Timbl, and features extensive support for the FoLiA XML format for linguistic annotation. It also offers evaluation classes and clients for interfacing with NLP servers.