Received: by mail.netbsd.org (Postfix, from userid 605) id 9AE3984DF8; Sun, 28 Oct 2018 03:44:08 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.netbsd.org (Postfix) with ESMTP id 9E6AA84DE6 for ; Sun, 28 Oct 2018 03:44:07 +0000 (UTC) X-Virus-Scanned: amavisd-new at netbsd.org Received: from mail.netbsd.org ([IPv6:::1]) by localhost (mail.netbsd.org [IPv6:::1]) (amavisd-new, port 10025) with ESMTP id EMKg6KzRoutR for ; Sun, 28 Oct 2018 03:44:06 +0000 (UTC) Received: from cvs.NetBSD.org (ivanova.netbsd.org [199.233.217.197]) by mail.netbsd.org (Postfix) with ESMTP id 7140B84C8D for ; Sun, 28 Oct 2018 03:44:06 +0000 (UTC) Received: by cvs.NetBSD.org (Postfix, from userid 500) id 6D088FBEE; Sun, 28 Oct 2018 03:44:06 +0000 (UTC) Content-Transfer-Encoding: 7bit Content-Type: multipart/mixed; boundary="_----------=_1540698246179090" MIME-Version: 1.0 Date: Sun, 28 Oct 2018 03:44:06 +0000 From: "Amitai Schleier" Subject: CVS commit: pkgsrc/textproc/xapian-omega To: pkgsrc-changes@NetBSD.org Reply-To: schmonz@netbsd.org X-Mailer: log_accum Message-Id: <20181028034406.6D088FBEE@cvs.NetBSD.org> Sender: pkgsrc-changes-owner@NetBSD.org List-Id: pkgsrc-changes.NetBSD.org Precedence: bulk List-Unsubscribe: This is a multi-part message in MIME format. --_----------=_1540698246179090 Content-Disposition: inline Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="US-ASCII" Module Name: pkgsrc Committed By: schmonz Date: Sun Oct 28 03:44:06 UTC 2018 Modified Files: pkgsrc/textproc/xapian-omega: distinfo Log Message: Update to 1.4.8. From the changelog: indexers: * omindex: + Improve date handling in .eml files. We now handle a "Date:" header without the day of the week, which is allowed by RFC822 and RFC2822 (though seems rare in practice). If the date can't be parsed, we now just omit the date information rather than failing to process the file. + Add support for indexing Apple iWork documents (Keynote (.key), Numbers (.numbers) and Pages (.pages)) using libetonyek. Currently only the file variants are handled since omindex doesn't currently support indexing a directory as a document. + Index Visio files using vsd2xhtml. + Extend --filter to support filters which produce SVG as output. + Handle SVG embedded in XML with svg: namespace prefix. + Add --read-filters option to read a list of filters from a file, each line of which is a rule as passed to --filter. Based on a patch from Gaurav Arora. + Add new --mime-type-match option which allows specifying a MIME Content-Type for a given shell filename pattern pattern (with the special Content-Type values "ignore" and "skip" supported, as for --mime-type). + Adjust --mime-type to allow ':' in the extension. A valid MIME Content-Type can't contain a colon, so if the argument to --mime-type contains more than one colon it makes more sense to split at the *last* colon (we used to split at the first), as an extension could conceivably contain a colon. Mostly this change is for consistency with the new --mime-type-match option, where the leafname pattern could reasonably contain a colon. + Remove failed entries for ignored files. If a file is mapped to pseudo-mimetype "ignore" then remove any existing failure record for it so that ignored files so we don't potentially end up with a lot of cruft failure records for files we are no longer trying to index. + If a file fails to index due to failing to allocate enough memory we now try to flag it as failed to index so it will be skipped by default on future runs. This should help to avoid indexing getting stuck on problematic files. + Add a "pages" field with the number of pages in the document where we know how to determine this (currently only for PDF files for which pdfinfo reports this information). + Handle initially empty database exactly the same was as when --overwrite is specified. This probably has no user-visible consequences, but it's cleaner for the handling to be exactly the same. * scriptindex: + Improve scriptindex diagnostic messages. All diagnostics are now labelled as "error", "warning" or "note" as appropriate, and we now consistently report "FILE:LINE:" (and also "COLUMN:" in most cases) to make it clearer where the problem lies. + Add new "split" action which splits the text on a specified delimiter and executes the following actions for each piece. Based on a patch by Gaurav Arora. + Missing whitespace after the closing " on an action argument is now flagged as an error. Previously scriptindex would attempt to parse the following characters as the next action. + Support C-like escapes for quoted parameter values. Notably this means it is now possible to include `"` in quoted parameter values. omega: + Value-based date range filters can now be specified via CGI parameters START.N, END.N and/or SPAN.N where N is a value slot number, allowing multiple concurrent filters on different slots to be specified. + Support YYYY and YYYYMM limits in term-based date ranges. Previously value-based date ranges supported these as limits, but term-based date ranges gave an error. + Add stem_strategy option and deprecate existing stem_all option in favour of this new more versatile option. + Support "natural" $sort option via new flag "#" which sorts embedded natural numbers in numerical order. + Support numeric $sort option via new flag "n", similar to GNU sort -n. + Rewrite field parsing to be more efficient, and store fields in an unordered_map for faster lookup. To generate a diff of this commit: cvs rdiff -u -r1.24 -r1.25 pkgsrc/textproc/xapian-omega/distinfo Please note that diffs are not public domain; they are subject to the copyright notices on the relevant files. --_----------=_1540698246179090 Content-Disposition: inline Content-Length: 1315 Content-Transfer-Encoding: binary Content-Type: text/x-diff; charset=us-ascii Modified files: Index: pkgsrc/textproc/xapian-omega/distinfo diff -u pkgsrc/textproc/xapian-omega/distinfo:1.24 pkgsrc/textproc/xapian-omega/distinfo:1.25 --- pkgsrc/textproc/xapian-omega/distinfo:1.24 Sun Aug 26 13:26:12 2018 +++ pkgsrc/textproc/xapian-omega/distinfo Sun Oct 28 03:44:06 2018 @@ -1,8 +1,8 @@ -$NetBSD: distinfo,v 1.24 2018/08/26 13:26:12 schmonz Exp $ +$NetBSD: distinfo,v 1.25 2018/10/28 03:44:06 schmonz Exp $ -SHA1 (xapian-omega-1.4.7.tar.xz) = 12da93cbd19657922756b845bf523adc8ae4e923 -RMD160 (xapian-omega-1.4.7.tar.xz) = 36b65f362365949f37694ba744ae9dc0a833fc8f -SHA512 (xapian-omega-1.4.7.tar.xz) = e7e2c7dfa3e76ccdde103b34aa1b0732033e9f218346a81f1b0b865c6e2d17c356b96a4522a3a2273fb55eb2fbd2dd57e835368771f46c9fb458c0c11329893b -Size (xapian-omega-1.4.7.tar.xz) = 510644 bytes +SHA1 (xapian-omega-1.4.8.tar.xz) = d6b355622697eaed303e238d590fca1e05055fb3 +RMD160 (xapian-omega-1.4.8.tar.xz) = 1e9d85b5118a395569e53071da222d847f527545 +SHA512 (xapian-omega-1.4.8.tar.xz) = 5d27c03891ff941b23c899b1a293ed9be0129de08ff46156380638408360ea6e4aa2dfb4fe0ffb13726807849802fe8d19f6dc817a52ec5bf50c2bf94ed94c2f +Size (xapian-omega-1.4.8.tar.xz) = 521352 bytes SHA1 (patch-Makefile.in) = eea809d9403820e86478d939a2dad4c67358ebf5 SHA1 (patch-omega.conf) = f1e941dac326de7005cad0a5a4a2848e401855c4 --_----------=_1540698246179090--