Received: from mail.netbsd.org (mail.netbsd.org [149.20.53.66]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "mail.netbsd.org", Issuer "Postmaster NetBSD.org" (verified OK)) by mollari.NetBSD.org (Postfix) with ESMTPS id E1B79A650D for ; Thu, 2 Oct 2014 16:06:07 +0000 (UTC) Received: by mail.netbsd.org (Postfix, from userid 605) id 3215114A141; Thu, 2 Oct 2014 16:06:07 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.netbsd.org (Postfix) with ESMTP id 4F3F414A13D for ; Thu, 2 Oct 2014 16:06:03 +0000 (UTC) X-Virus-Scanned: amavisd-new at NetBSD.org Received: from mail.netbsd.org ([127.0.0.1]) by localhost (mail.NetBSD.org [127.0.0.1]) (amavisd-new, port 10025) with ESMTP id lR9cCBvX8bPh for ; Thu, 2 Oct 2014 16:06:02 +0000 (UTC) Received: from cvs.netbsd.org (cvs.NetBSD.org [IPv6:2001:4f8:3:7:2e0:81ff:fe30:95bd]) by mail.netbsd.org (Postfix) with ESMTP id 7C6E314A13A for ; Thu, 2 Oct 2014 16:06:02 +0000 (UTC) Received: by cvs.netbsd.org (Postfix, from userid 500) id 6CD1E98; Thu, 2 Oct 2014 16:06:02 +0000 (UTC) Content-Disposition: inline Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="US-ASCII" MIME-Version: 1.0 Date: Thu, 2 Oct 2014 16:06:02 +0000 From: "Adam Ciarcinski" Subject: CVS commit: pkgsrc/graphics/tesseract To: pkgsrc-changes@NetBSD.org Reply-To: adam@netbsd.org X-Mailer: log_accum Message-Id: <20141002160602.6CD1E98@cvs.netbsd.org> Sender: pkgsrc-changes-owner@NetBSD.org List-Id: pkgsrc-changes.NetBSD.org Precedence: bulk Module Name: pkgsrc Committed By: adam Date: Thu Oct 2 16:06:02 UTC 2014 Modified Files: pkgsrc/graphics/tesseract: Makefile PLIST distinfo Removed Files: pkgsrc/graphics/tesseract/files: tesseract.sh Log Message: Changes 3.02.02: * Moved ResultIterator/PageIterator to ccmain. * Added Right-to-left/Bidi capability in the output iterators for Hebrew/Arabic. * Added paragraph detection in layout analysis/post OCR. * Fixed inconsistent xheight during training and over-chopping. * Added simultaneous multi-language capability. * Refactored top-level word recognition module. * Added experimental equation detector. * Improved handling of resolution from input images. * Blamer module added for error analysis. * Cleaned up externally used namespace by removing includes from baseapi.h. * Removed dead memory mangagement code. * Tidied up constraints on control parameters. * Added support for ShapeTable in classifier and training. * Refactored class pruner. * Fixed training leaks and randomness. * Major improvements to layout analysis for better image detection, diacritic detection, better textline finding, better tabstop finding. * Improved line detection and removal. * Added fixed pitch chopper for CJK. * Added UNICHARSET to WERD_CHOICE to make mult-language handling easier. * Fixed problems with internally scaled images. * Added page and bbox to string in tr files to identify source of training data better. * Fixes to Hindi Shiroreka splitter. * Added word bigram correction. * Reduced stack memory consumption and eliminated some ugly typedefs. * Added new uniform classifier API. * Added new training error counter. * Fixed endian bug in dawg reader. * Many other fixes, including the way in which the chopper finds chops and messes with the outline while it does so. To generate a diff of this commit: cvs rdiff -u -r1.13 -r1.14 pkgsrc/graphics/tesseract/Makefile cvs rdiff -u -r1.6 -r1.7 pkgsrc/graphics/tesseract/PLIST cvs rdiff -u -r1.10 -r1.11 pkgsrc/graphics/tesseract/distinfo cvs rdiff -u -r1.1.1.1 -r0 pkgsrc/graphics/tesseract/files/tesseract.sh Please note that diffs are not public domain; they are subject to the copyright notices on the relevant files.