Received: by mail.netbsd.org (Postfix, from userid 605) id 9928C84D66; Fri, 16 Nov 2018 13:02:23 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.netbsd.org (Postfix) with ESMTP id 8BEE184D39 for ; Fri, 16 Nov 2018 13:02:22 +0000 (UTC) X-Virus-Scanned: amavisd-new at netbsd.org Received: from mail.netbsd.org ([127.0.0.1]) by localhost (mail.netbsd.org [127.0.0.1]) (amavisd-new, port 10025) with ESMTP id TlbMoo4cfJ6P for ; Fri, 16 Nov 2018 13:02:20 +0000 (UTC) Received: from cvs.NetBSD.org (ivanova.NetBSD.org [IPv6:2001:470:a085:999:28c:faff:fe03:5984]) by mail.netbsd.org (Postfix) with ESMTP id B04A984D26 for ; Fri, 16 Nov 2018 13:02:20 +0000 (UTC) Received: by cvs.NetBSD.org (Postfix, from userid 500) id 9CB94FB1F; Fri, 16 Nov 2018 13:02:20 +0000 (UTC) Content-Transfer-Encoding: 7bit Content-Type: multipart/mixed; boundary="_----------=_1542373340148450" MIME-Version: 1.0 Date: Fri, 16 Nov 2018 13:02:20 +0000 From: "Benny Siegert" Subject: CVS commit: pkgsrc/textproc/hunspell To: pkgsrc-changes@NetBSD.org Reply-To: bsiegert@netbsd.org X-Mailer: log_accum Message-Id: <20181116130220.9CB94FB1F@cvs.NetBSD.org> Sender: pkgsrc-changes-owner@NetBSD.org List-Id: pkgsrc-changes.NetBSD.org Precedence: bulk List-Unsubscribe: This is a multi-part message in MIME format. --_----------=_1542373340148450 Content-Disposition: inline Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="UTF-8" Module Name: pkgsrc Committed By: bsiegert Date: Fri Nov 16 13:02:20 UTC 2018 Modified Files: pkgsrc/textproc/hunspell: Makefile PLIST buildlink3.mk distinfo pkgsrc/textproc/hunspell/patches: patch-src_tools_Makefile.am Log Message: Update hunspell to 1.7.0. Bump ABI_DEPENDS in bl3.mk. New features and bug fixes by Laszlo Nemeth, supported by FSF.hu Foundation: • No annoying suggestion times any more, especially in languages with compound word handling and complex morphology. By adding balanced multi-level time limits, now the guaranteed suggestion time is there within half a second, not seconds (nor dozen of seconds or more in extreme cases) for longer misspellings, too. • add SPELLML support for run-time dictionary extension with optional affixation of user words. See new "Grammar By" feature of language-specific user dictionaries of LibreOffice 6.0: News: https://wiki.documentfoundation.org/ReleaseNotes/6.0#.E2.80.9CGrammar_By.E2.80.9D_spell_checking Screencast with English example: https://www.youtube.com/watch?v=EsS3gaBTfOo Screencast with German example: https://www.youtube.com/watch?v=aYVFDqCUb6I • Improved, highly customizable suggestions on level of dictionary words: Pronunciations and typical misspellings defined by optional "ph:" fields of the dictionary words are used not only in n-gram suggestions, but as elements of the REP replacement list getting the highest priority in normal suggestions, also giving the best suggestions for short words, too. More information: see "ph:" in man 5 hunspell. • Handling multiple word suggestions is much more easier. Like in a traditional spelling dictionary, for example, to get the correct suggestion "a lot" for the typical misspelling "alot" at the first place, now it's enough to put the following line to the dic(tionary) file: a lot • Limit compound overgeneration by dictionary based word pairs: Now it's possible to filter bad compound words by listing the correct word pairs with space in the dictionary, as in a traditional spelling dictionary. • clean-up suggestion: □ no n-gram and compound word suggestions, if "good" suggestion exists, ie. uppercase, REP, ph: or dictionary word pair suggestions □ word pairs are always suggested, if they exist in the dic file □ word pairs have top priority in suggestions, and these are the only suggestions if there is no other good suggestion. □ also dictionary word pairs separated by dash instead of space are handled specially in two-word suggestion (depending from the language) • limit bad suggestions by improved n-gram suggestion rules: don't suggest capitalized dictionary words for lower case misspellings in n-gram suggestions, except □ PHONE usage, or □ in the case of German, where not only proper nouns are capitalized, or □ the capitalized word has special pronunciation and don't suggest if the difference of lengths of misspellings and suggestions is 5 or more characters. • Extend dotless i and dotted I rules to Crimean Tatar language Allow dotted I in dictionary, and disable bad capitalization of i. • BREAK: extended recursive word breaking algorithm to handle words or words with suffixes when they already contain word break characters, for example, "e-mail" is a dictionary word with a word break character, and it wasn't accepted before in compounds in some languages. • FORBIDDENWORD precedes BREAK: Now it's possible to forbid compound forms recognized by BREAK word breaking by adding the bad compounds to the dictionary with FORBIDDENWORD flags. • lower limit for "doubletwochars" suggestion algorithm: one of the typical misspellings recognized by Hunspell suggestion mechanism is the syllable duplication. Along the old pattern ABABA -> ABA, for example nutrITITIon -> nutrITIon, now also the simpler ABAB -> AB pattern is recognized in non-starting position, for example, regretTETEd -> regretTEd. • lower limit for longswapchar and movechar: recognized only max. 4-character distances to avoid slow and bad suggestions. • fix compound handling for new Hungarian orthography reform • Allow suggestion search for prefix + two suffixes: Remove artificial performance limit to get correct suggestions for relatively simple misspellings in Hungarian, etc., when the word form contains prefix and both derivative and inflectional suffixes, too: lefikszálása -> lefixálása Improvements for command-line Hunspell: • Remove false alarms during checking OpenDocument (ODF) documents by ignoring elements. (LibreOffice creates a lot of elements also within words during text reediting, resulted often huge amount of broken words before this fix.) • List filenames during filtering multiple files in command-line: Examples: $ hunspell -l *.odt a.odt: mispelling b.odt: egzample $ hunspell -l -G *.odt a.odt: good b.odt: words • Dictionary search by option -D doesn't wait for the standard input (fixed by Siva Mahadevan) Other improvements: • makealias dictionary compression: add option --minimize-diff to reuse free positions of alias lists to create minimal and readable diffs for alias compressed dictionaries stored in revision control systems, as dictionaries of LibreOffice. • Brazilian-Portuguese translation by Rafael Fontenelle • Catalan translation by robert dot buj at gmail • Minor bug fixes by several contributors, see git log To generate a diff of this commit: cvs rdiff -u -r1.29 -r1.30 pkgsrc/textproc/hunspell/Makefile cvs rdiff -u -r1.8 -r1.9 pkgsrc/textproc/hunspell/PLIST cvs rdiff -u -r1.5 -r1.6 pkgsrc/textproc/hunspell/buildlink3.mk cvs rdiff -u -r1.11 -r1.12 pkgsrc/textproc/hunspell/distinfo cvs rdiff -u -r1.1 -r1.2 \ pkgsrc/textproc/hunspell/patches/patch-src_tools_Makefile.am Please note that diffs are not public domain; they are subject to the copyright notices on the relevant files. --_----------=_1542373340148450 Content-Disposition: inline Content-Length: 5757 Content-Transfer-Encoding: binary Content-Type: text/x-diff; charset=us-ascii Modified files: Index: pkgsrc/textproc/hunspell/Makefile diff -u pkgsrc/textproc/hunspell/Makefile:1.29 pkgsrc/textproc/hunspell/Makefile:1.30 --- pkgsrc/textproc/hunspell/Makefile:1.29 Fri Oct 26 07:43:05 2018 +++ pkgsrc/textproc/hunspell/Makefile Fri Nov 16 13:02:20 2018 @@ -1,6 +1,6 @@ -# $NetBSD: Makefile,v 1.29 2018/10/26 07:43:05 leot Exp $ +# $NetBSD: Makefile,v 1.30 2018/11/16 13:02:20 bsiegert Exp $ -DISTNAME= hunspell-1.6.2 +DISTNAME= hunspell-1.7.0 CATEGORIES= textproc MASTER_SITES= ${MASTER_SITE_GITHUB:=hunspell/} GITHUB_TAG= v${PKGVERSION_NOREV} Index: pkgsrc/textproc/hunspell/PLIST diff -u pkgsrc/textproc/hunspell/PLIST:1.8 pkgsrc/textproc/hunspell/PLIST:1.9 --- pkgsrc/textproc/hunspell/PLIST:1.8 Tue Oct 23 11:45:34 2018 +++ pkgsrc/textproc/hunspell/PLIST Fri Nov 16 13:02:20 2018 @@ -1,4 +1,4 @@ -@comment $NetBSD: PLIST,v 1.8 2018/10/23 11:45:34 bsiegert Exp $ +@comment $NetBSD: PLIST,v 1.9 2018/11/16 13:02:20 bsiegert Exp $ bin/affixcompress bin/chmorph bin/hunspell @@ -12,13 +12,11 @@ bin/makealias bin/wordforms bin/wordlist2hunspell include/hunspell/atypes.hxx -include/hunspell/csutil.hxx -include/hunspell/htypes.hxx include/hunspell/hunspell.h include/hunspell/hunspell.hxx include/hunspell/hunvisapi.h include/hunspell/w_char.hxx -lib/libhunspell-1.6.la +lib/libhunspell-1.7.la lib/pkgconfig/hunspell.pc man/hu/man1/hunspell.1 man/man1/hunspell.1 Index: pkgsrc/textproc/hunspell/buildlink3.mk diff -u pkgsrc/textproc/hunspell/buildlink3.mk:1.5 pkgsrc/textproc/hunspell/buildlink3.mk:1.6 --- pkgsrc/textproc/hunspell/buildlink3.mk:1.5 Tue Oct 23 11:45:34 2018 +++ pkgsrc/textproc/hunspell/buildlink3.mk Fri Nov 16 13:02:20 2018 @@ -1,4 +1,4 @@ -# $NetBSD: buildlink3.mk,v 1.5 2018/10/23 11:45:34 bsiegert Exp $ +# $NetBSD: buildlink3.mk,v 1.6 2018/11/16 13:02:20 bsiegert Exp $ BUILDLINK_TREE+= hunspell @@ -6,7 +6,7 @@ BUILDLINK_TREE+= hunspell HUNSPELL_BUILDLINK3_MK:= BUILDLINK_API_DEPENDS.hunspell+= hunspell>=1.2.2 -BUILDLINK_ABI_DEPENDS.hunspell+= hunspell>=1.6.2 +BUILDLINK_ABI_DEPENDS.hunspell+= hunspell>=1.7.0 BUILDLINK_PKGSRCDIR.hunspell?= ../../textproc/hunspell .endif # HUNSPELL_BUILDLINK3_MK Index: pkgsrc/textproc/hunspell/distinfo diff -u pkgsrc/textproc/hunspell/distinfo:1.11 pkgsrc/textproc/hunspell/distinfo:1.12 --- pkgsrc/textproc/hunspell/distinfo:1.11 Tue Oct 23 11:45:34 2018 +++ pkgsrc/textproc/hunspell/distinfo Fri Nov 16 13:02:20 2018 @@ -1,10 +1,10 @@ -$NetBSD: distinfo,v 1.11 2018/10/23 11:45:34 bsiegert Exp $ +$NetBSD: distinfo,v 1.12 2018/11/16 13:02:20 bsiegert Exp $ -SHA1 (hunspell-1.6.2.tar.gz) = 6572d1459deafd358b886d4482c179b715bc92e8 -RMD160 (hunspell-1.6.2.tar.gz) = f966f805bfbd4b69bb10e573cc271314a0411ef6 -SHA512 (hunspell-1.6.2.tar.gz) = a23127f1271da95ac06a1fb2f57b659485e959567b61da05b2bb350684003a0fb7e882b5e524c465fd890f79f513ed03174f38611989a1c09081147c47d6da11 -Size (hunspell-1.6.2.tar.gz) = 721165 bytes +SHA1 (hunspell-1.7.0.tar.gz) = e42ea8342a191b9cd7da57d0d6ad4ae1566c5dcc +RMD160 (hunspell-1.7.0.tar.gz) = 52c7dbf21f460a0b61ea7d0378ef314773887fde +SHA512 (hunspell-1.7.0.tar.gz) = 8149b2e8b703a0610c9ca5160c2dfad3cf3b85b16b3f0f5cfcb7ebb802473b2d499e8e2d0a637a97a37a24d62424e82d3880809210d3f043fa17a4970d47c903 +Size (hunspell-1.7.0.tar.gz) = 482156 bytes SHA1 (patch-aa) = 8c6102ddb2e449b6f1abc23f679e0f6f38bfd0b5 SHA1 (patch-ab) = ee127b1d8f55ceefa807c2fa440885b4fa5d029c SHA1 (patch-ac) = c25cdfe80452cb4ca9850354c9fa8581c787c086 -SHA1 (patch-src_tools_Makefile.am) = 5a5bdd02f8acc64f0bd55c4d3ec11c2481bef851 +SHA1 (patch-src_tools_Makefile.am) = e5f67855c48e04fe12deb90904c9c27e2441a8cf Index: pkgsrc/textproc/hunspell/patches/patch-src_tools_Makefile.am diff -u pkgsrc/textproc/hunspell/patches/patch-src_tools_Makefile.am:1.1 pkgsrc/textproc/hunspell/patches/patch-src_tools_Makefile.am:1.2 --- pkgsrc/textproc/hunspell/patches/patch-src_tools_Makefile.am:1.1 Tue Oct 23 11:45:34 2018 +++ pkgsrc/textproc/hunspell/patches/patch-src_tools_Makefile.am Fri Nov 16 13:02:20 2018 @@ -1,6 +1,6 @@ -$NetBSD: patch-src_tools_Makefile.am,v 1.1 2018/10/23 11:45:34 bsiegert Exp $ +$NetBSD: patch-src_tools_Makefile.am,v 1.2 2018/11/16 13:02:20 bsiegert Exp $ ---- src/tools/Makefile.am.orig 2017-09-03 11:38:44.000000000 +0000 +--- src/tools/Makefile.am.orig 2018-11-12 20:38:56.000000000 +0000 +++ src/tools/Makefile.am @@ -1,4 +1,4 @@ -bin_PROGRAMS=analyze chmorph hunspell munch unmunch hzip hunzip @@ -10,7 +10,7 @@ $NetBSD: patch-src_tools_Makefile.am,v 1 @@ -6,8 +6,8 @@ hzip_SOURCES=hzip.cxx hunzip_SOURCES=hunzip.cxx - hunzip_LDADD = ../hunspell/libhunspell-1.6.la + hunzip_LDADD = ../hunspell/libhunspell-1.7.la -munch_SOURCES=munch.cxx munch.h -unmunch_SOURCES=unmunch.cxx unmunch.h @@ -18,15 +18,15 @@ $NetBSD: patch-src_tools_Makefile.am,v 1 +hunspell_unmunch_SOURCES=unmunch.cxx unmunch.h example_SOURCES=example.cxx - example_LDADD = ../hunspell/libhunspell-1.6.la -@@ -16,8 +16,8 @@ hunspell_SOURCES=hunspell.cxx + example_LDADD = ../hunspell/libhunspell-1.7.la +@@ -17,8 +17,8 @@ nodist_hunspell_SOURCES= ../../config.h hunspell_LDADD = @LIBINTL@ @LIBICONV@ ../parsers/libparsers.a \ - ../hunspell/libhunspell-1.6.la @CURSESLIB@ @READLINELIB@ + ../hunspell/libhunspell-1.7.la @CURSESLIB@ @READLINELIB@ -analyze_SOURCES=analyze.cxx --analyze_LDADD = ../hunspell/libhunspell-1.6.la +-analyze_LDADD = ../hunspell/libhunspell-1.7.la +hunspell_analyze_SOURCES=analyze.cxx -+hunspell_analyze_LDADD = ../hunspell/libhunspell-1.6.la ++hunspell_analyze_LDADD = ../hunspell/libhunspell-1.7.la - bulkcheck_SOURCES=bulkcheck.cxx - bulkcheck_LDADD = ../hunspell/libhunspell-1.6.la + chmorph_SOURCES=chmorph.cxx + chmorph_LDADD = ../parsers/libparsers.a ../hunspell/libhunspell-1.7.la --_----------=_1542373340148450--