Version 3.5
* add support for Python 3.8
* drop support for Python 2
* create NLTK's own Tokenizer class distinct from the Treebank reference tokeniser
* update Vader sentiment analyser
* fix JSON serialization of some PoS taggers
* minor improvements in grammar.CFG, Vader, pl196x corpus reader, StringTokenizer
* change implementation <= and >= for FreqDist so they are partial orders
* make FreqDist iterable
* correctly handle Penn Treebank trees with a unlabeled branching top node.
pkglint -r --network --only "migrate"
As a side-effect of migrating the homepages, pkglint also fixed a few
indentations in unrelated lines. These and the new homepages have been
checked manually.
Based on wip version packaged by leot, Hiramatsu Yoshifumi,
Kamel Ibn Aziz Derouiche, and myself.
NLTK - the Natural Language Toolkit - is a suite of open source
Python modules, data and documentation for research and development
in natural language processing. NLTK contains code supporting dozens
of NLP tasks, along with 30 popular Corpora and extensive documentation
including a 360-page online book.