--- - branch: MAIN date: Thu Dec 15 23:15:24 UTC 2022 files: - new: '1.9' old: '1.8' path: pkgsrc/textproc/py-nltk/Makefile pathrev: pkgsrc/textproc/py-nltk/Makefile@1.9 type: modified - new: '1.5' old: '1.4' path: pkgsrc/textproc/py-nltk/PLIST pathrev: pkgsrc/textproc/py-nltk/PLIST@1.5 type: modified - new: '1.7' old: '1.6' path: pkgsrc/textproc/py-nltk/distinfo pathrev: pkgsrc/textproc/py-nltk/distinfo@1.7 type: modified id: 20221215T231524Z.19a29d0f89d3c0bb4c014c8fcc454b3a15114682 log: | py-nltk: updated to 3.8 Version 3.8 2022-12-12 * Refactor dispersion plot * Provide type hints for LazyCorpusLoader variables * Throw warning when LanguageModel is initialized with incorrect vocabulary * Fix WordNet's all_synsets() function * Resolve TreebankWordDetokenizer inconsistency with end-of-string contractions * Support both iso639-3 codes and BCP-47 language tags * Avoid DeprecationWarning in Regexp tokenizer * Fix many doctests, add doctests to CI * Fix bool field not being read in VerbNet * Greatly improve time efficiency of SyllableTokenizer when tokenizing numbers * Fix encodings of Polish udhr corpus reader * Allow TweetTokenizer to tokenize emoji flag sequences * Prevent LazyModule from increasing the size of nltk.__dict__ * Fix CoreNLPServer non-default port issue * Add "acion" suffix to the Spanish SnowballStemmer * Allow loading WordNet without OMW * Use input() in nltk.chat.chatbot() for Jupyter support * Fix edit_distance_align() in distance.py * Tackle performance and accuracy regression of sentence tokenizer since NLTK 3.6.6 * Add the Iota operator to semantic logic * Resolve critical errors in WordNet app * Resolve critical error in CHILDES Corpus * Make WordNet information_content() accept adjective satellites * Add "strict=True" parameter to CoreNLP * Resolve issue with WordNet's synset_from_sense_key * Handle WordNet synsets that were lost in mapping * Resolve TypeError in Boxer * Add function to retrieve WordNet synonyms * Warn about nonexistent OMW offsets instead of raising an error * Fix missing ic argument in res, jcn and lin similarity functions of WordNet * Add support for the extended OMW * Fix LC cutoff policy of text tiling * Optimize ConditionalFreqDist.__add__ performance * Add Markdown corpus reader module: pkgsrc subject: 'CVS commit: pkgsrc/textproc/py-nltk' unixtime: '1671146124' user: adam