Received: by mail.netbsd.org (Postfix, from userid 605) id 0EF6984EFC; Tue, 13 Feb 2018 21:00:05 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.netbsd.org (Postfix) with ESMTP id 3589484DBB for ; Tue, 13 Feb 2018 21:00:04 +0000 (UTC) X-Virus-Scanned: amavisd-new at netbsd.org Received: from mail.netbsd.org ([127.0.0.1]) by localhost (mail.netbsd.org [127.0.0.1]) (amavisd-new, port 10025) with ESMTP id Ehf7fi9-Z723 for ; Tue, 13 Feb 2018 21:00:03 +0000 (UTC) Received: from cvs.NetBSD.org (ivanova.netbsd.org [199.233.217.197]) by mail.netbsd.org (Postfix) with ESMTP id 8913384CFC for ; Tue, 13 Feb 2018 21:00:03 +0000 (UTC) Received: by cvs.NetBSD.org (Postfix, from userid 500) id 823C0FB40; Tue, 13 Feb 2018 21:00:03 +0000 (UTC) Content-Transfer-Encoding: 7bit Content-Type: multipart/mixed; boundary="_----------=_151855560368200" MIME-Version: 1.0 Date: Tue, 13 Feb 2018 21:00:03 +0000 From: "Joerg Sonnenberger" Subject: CVS commit: pkgsrc/textproc/py-html-sanitizer To: pkgsrc-changes@NetBSD.org Reply-To: joerg@netbsd.org X-Mailer: log_accum Message-Id: <20180213210003.823C0FB40@cvs.NetBSD.org> Sender: pkgsrc-changes-owner@NetBSD.org List-Id: pkgsrc-changes.NetBSD.org Precedence: bulk List-Unsubscribe: This is a multi-part message in MIME format. --_----------=_151855560368200 Content-Disposition: inline Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="US-ASCII" Module Name: pkgsrc Committed By: joerg Date: Tue Feb 13 21:00:03 UTC 2018 Added Files: pkgsrc/textproc/py-html-sanitizer: DESCR Makefile PLIST distinfo Log Message: Add py-html-sanitizer-1.3.0: html-sanitizer is a whitelist-based and very opinionated HTML sanitizer that can be used both for untrusted and trusted sources. It attempts to clean up the mess made by various rich text editors and or copy-pasting to make styling of webpages simpler and more consistent. It builds on the excellent HTML cleaner in lxml to make the result both valid and safe. It goes further than pure tag filtering by transforming the HTML fragments to normalize formatting and drop redundant or pointless tags. To generate a diff of this commit: cvs rdiff -u -r0 -r1.1 pkgsrc/textproc/py-html-sanitizer/DESCR \ pkgsrc/textproc/py-html-sanitizer/Makefile \ pkgsrc/textproc/py-html-sanitizer/PLIST \ pkgsrc/textproc/py-html-sanitizer/distinfo Please note that diffs are not public domain; they are subject to the copyright notices on the relevant files. --_----------=_151855560368200 Content-Disposition: inline Content-Length: 3410 Content-Transfer-Encoding: binary Content-Type: text/x-diff; charset=us-ascii Added files: Index: pkgsrc/textproc/py-html-sanitizer/DESCR diff -u /dev/null pkgsrc/textproc/py-html-sanitizer/DESCR:1.1 --- /dev/null Tue Feb 13 21:00:03 2018 +++ pkgsrc/textproc/py-html-sanitizer/DESCR Tue Feb 13 21:00:03 2018 @@ -0,0 +1,8 @@ +html-sanitizer is a whitelist-based and very opinionated HTML sanitizer +that can be used both for untrusted and trusted sources. It attempts to +clean up the mess made by various rich text editors and or copy-pasting +to make styling of webpages simpler and more consistent. It builds on the +excellent HTML cleaner in lxml to make the result both valid and safe. + +It goes further than pure tag filtering by transforming the HTML +fragments to normalize formatting and drop redundant or pointless tags. Index: pkgsrc/textproc/py-html-sanitizer/Makefile diff -u /dev/null pkgsrc/textproc/py-html-sanitizer/Makefile:1.1 --- /dev/null Tue Feb 13 21:00:03 2018 +++ pkgsrc/textproc/py-html-sanitizer/Makefile Tue Feb 13 21:00:03 2018 @@ -0,0 +1,19 @@ +# $NetBSD: Makefile,v 1.1 2018/02/13 21:00:03 joerg Exp $ + +DISTNAME= html-sanitizer-1.3.0 +PKGNAME= ${PYPKGPREFIX}-${DISTNAME} +CATEGORIES= textproc converters python +MASTER_SITES= ${MASTER_SITE_PYPI:=h/html-sanitizer/} + +MAINTAINER= joerg@NetBSD.org +HOMEPAGE= https://github.com/matthiask/html-sanitizer/ +COMMENT= White-list based HTML sanitizer +LICENSE= modified-bsd + +USE_LANGUAGES= # none + +DEPENDS+= ${PYPKGPREFIX}-lxml>3:../../textproc/py-lxml +DEPENDS+= ${PYPKGPREFIX}-beautifulsoup4-[0-9]*:../../www/py-beautifulsoup4 + +.include "../../lang/python/egg.mk" +.include "../../mk/bsd.pkg.mk" Index: pkgsrc/textproc/py-html-sanitizer/PLIST diff -u /dev/null pkgsrc/textproc/py-html-sanitizer/PLIST:1.1 --- /dev/null Tue Feb 13 21:00:03 2018 +++ pkgsrc/textproc/py-html-sanitizer/PLIST Tue Feb 13 21:00:03 2018 @@ -0,0 +1,21 @@ +@comment $NetBSD: PLIST,v 1.1 2018/02/13 21:00:03 joerg Exp $ +${PYSITELIB}/${EGG_INFODIR}/PKG-INFO +${PYSITELIB}/${EGG_INFODIR}/SOURCES.txt +${PYSITELIB}/${EGG_INFODIR}/dependency_links.txt +${PYSITELIB}/${EGG_INFODIR}/requires.txt +${PYSITELIB}/${EGG_INFODIR}/top_level.txt +${PYSITELIB}/html_sanitizer/__init__.py +${PYSITELIB}/html_sanitizer/__init__.pyc +${PYSITELIB}/html_sanitizer/__init__.pyo +${PYSITELIB}/html_sanitizer/__main__.py +${PYSITELIB}/html_sanitizer/__main__.pyc +${PYSITELIB}/html_sanitizer/__main__.pyo +${PYSITELIB}/html_sanitizer/django.py +${PYSITELIB}/html_sanitizer/django.pyc +${PYSITELIB}/html_sanitizer/django.pyo +${PYSITELIB}/html_sanitizer/sanitizer.py +${PYSITELIB}/html_sanitizer/sanitizer.pyc +${PYSITELIB}/html_sanitizer/sanitizer.pyo +${PYSITELIB}/html_sanitizer/tests.py +${PYSITELIB}/html_sanitizer/tests.pyc +${PYSITELIB}/html_sanitizer/tests.pyo Index: pkgsrc/textproc/py-html-sanitizer/distinfo diff -u /dev/null pkgsrc/textproc/py-html-sanitizer/distinfo:1.1 --- /dev/null Tue Feb 13 21:00:03 2018 +++ pkgsrc/textproc/py-html-sanitizer/distinfo Tue Feb 13 21:00:03 2018 @@ -0,0 +1,6 @@ +$NetBSD: distinfo,v 1.1 2018/02/13 21:00:03 joerg Exp $ + +SHA1 (html-sanitizer-1.3.0.tar.gz) = 67ffae88b696cfb623d6778d4ae6bd6ac792d250 +RMD160 (html-sanitizer-1.3.0.tar.gz) = 17dd857a28bea1a1af2540e5c7267810d07dcd51 +SHA512 (html-sanitizer-1.3.0.tar.gz) = cc231f2257300962cda3dea29dc37943bafb72a07b157cc3e47222594e4b01d6d3a5e0688696277d6617a4f160987f5d13220dca1a6c5528d1b472ccdd392483 +Size (html-sanitizer-1.3.0.tar.gz) = 10953 bytes --_----------=_151855560368200--