From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from lists.gentoo.org (pigeon.gentoo.org [208.92.234.80]) by finch.gentoo.org (Postfix) with ESMTP id 509B91397E6 for ; Mon, 17 Aug 2015 22:29:09 +0000 (UTC) Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 5ADA31419B; Mon, 17 Aug 2015 22:29:06 +0000 (UTC) Received: from smtp.gentoo.org (smtp.gentoo.org [140.211.166.183]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by pigeon.gentoo.org (Postfix) with ESMTPS id EA68D1419B for ; Mon, 17 Aug 2015 22:29:05 +0000 (UTC) Received: from oystercatcher.gentoo.org (unknown [IPv6:2a01:4f8:202:4333:225:90ff:fed9:fc84]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by smtp.gentoo.org (Postfix) with ESMTPS id 88C3E34098F for ; Mon, 17 Aug 2015 22:29:04 +0000 (UTC) Received: from localhost.localdomain (localhost [127.0.0.1]) by oystercatcher.gentoo.org (Postfix) with ESMTP id 805AC116 for ; Mon, 17 Aug 2015 22:29:00 +0000 (UTC) From: "James Le Cuirot" To: gentoo-commits@lists.gentoo.org Content-Transfer-Encoding: 8bit Content-type: text/plain; charset=UTF-8 Reply-To: gentoo-dev@lists.gentoo.org, "James Le Cuirot" Message-ID: <1439850519.6357034e4caf72d9b29c3666a94a31f57bbddb54.chewi@gentoo> Subject: [gentoo-commits] repo/gentoo:master commit in: dev-java/htmlcleaner/ X-VCS-Repository: repo/gentoo X-VCS-Files: dev-java/htmlcleaner/Manifest dev-java/htmlcleaner/htmlcleaner-2.13.ebuild dev-java/htmlcleaner/metadata.xml X-VCS-Directories: dev-java/htmlcleaner/ X-VCS-Committer: chewi X-VCS-Committer-Name: James Le Cuirot X-VCS-Revision: 6357034e4caf72d9b29c3666a94a31f57bbddb54 X-VCS-Branch: master Date: Mon, 17 Aug 2015 22:29:00 +0000 (UTC) Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-commits@lists.gentoo.org X-Archives-Salt: 726760b9-4119-44c5-8fae-427541244a57 X-Archives-Hash: 7caa943c7b9bcefd6e404b84660024d4 commit: 6357034e4caf72d9b29c3666a94a31f57bbddb54 Author: James Le Cuirot gentoo org> AuthorDate: Mon Aug 17 22:28:39 2015 +0000 Commit: James Le Cuirot gentoo org> CommitDate: Mon Aug 17 22:28:39 2015 +0000 URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=6357034e dev-java/htmlcleaner: Imported and bumped from java-overlay Closes bug #369977. Package-Manager: portage-2.2.20.1 dev-java/htmlcleaner/Manifest | 1 + dev-java/htmlcleaner/htmlcleaner-2.13.ebuild | 64 ++++++++++++++++++++++++++++ dev-java/htmlcleaner/metadata.xml | 6 +++ 3 files changed, 71 insertions(+) diff --git a/dev-java/htmlcleaner/Manifest b/dev-java/htmlcleaner/Manifest new file mode 100644 index 0000000..972d136 --- /dev/null +++ b/dev-java/htmlcleaner/Manifest @@ -0,0 +1 @@ +DIST htmlcleaner-2.13-src.zip 346998 SHA256 e51daa50eb65b5a92467bf794a6f82b667fe7ca1d1d58ba046ae234e45d9c3a0 SHA512 bf9cc014688ebb20b0bafba82fc710f25c1db8c173dd3e9f742c31c64f127c3c89b29b151e64a3368b04d278733cce8828831f297834ca9b872a60951e0a2d47 WHIRLPOOL 2b2792607e2c4f5f3dc8d2b7d7035c3f762ac1355724811b8a743ad2a0cd0f6fecf5392be8c58632b697a8529b55f60721dc63128c647aad7faa0e191c9056ce diff --git a/dev-java/htmlcleaner/htmlcleaner-2.13.ebuild b/dev-java/htmlcleaner/htmlcleaner-2.13.ebuild new file mode 100644 index 0000000..fa953f2 --- /dev/null +++ b/dev-java/htmlcleaner/htmlcleaner-2.13.ebuild @@ -0,0 +1,64 @@ +# Copyright 1999-2015 Gentoo Foundation +# Distributed under the terms of the GNU General Public License v2 +# $Id$ + +EAPI=5 + +JAVA_PKG_IUSE="doc source" + +inherit java-pkg-2 java-pkg-simple + +DESCRIPTION="HTML parser written in Java that can be used as a tool, library or Ant task" +HOMEPAGE="http://htmlcleaner.sourceforge.net/" +SRC_URI="mirror://sourceforge/htmlcleaner/files/${P}-src.zip" + +LICENSE="BSD" +SLOT="0" +KEYWORDS="~amd64 ~x86" +IUSE="test" + +CDEPEND="dev-java/jdom:2" + +DEPEND="${CDEPEND} + >=virtual/jdk-1.5 + app-arch/unzip + dev-java/ant-core:0 + test? ( dev-java/junit:4 )" + +RDEPEND="${CDEPEND} + >=virtual/jre-1.5" + +JAVA_SRC_DIR="src/main/java" +JAVA_GENTOO_CLASSPATH="jdom-2" + +src_prepare() { + # Don't require default.xml to be in the current directory. + sed -i "s:\"default\.xml\":\"${JAVA_PKG_SHAREPATH}/default.xml\":g" \ + src/main/java/org/htmlcleaner/ConfigFileTagProvider.java || die +} + +src_configure() { + JAVA_CLASSPATH_EXTRA=$(java-pkg_getjars --build-only ant-core) +} + +src_install() { + java-pkg-simple_src_install + java-pkg_register-ant-task + java-pkg_dolauncher ${PN} --main org.${PN}.CommandLine + + insinto "${JAVA_PKG_SHAREPATH}" + newins example.xml default.xml +} + +src_test() { + local DIR="src/test/java" + local CP="${DIR}:${PN}.jar:$(java-pkg_getjars junit-4,${JAVA_GENTOO_CLASSPATH})" + + local TESTS=$(find "${DIR}" -name "*Test.java" ! -name "Abstract*") + TESTS="${TESTS//src\/test\/java\/}" + TESTS="${TESTS//.java}" + TESTS="${TESTS//\//.}" + + ejavac -classpath "${CP}" -d "${DIR}" $(find "${DIR}" -name "*.java") + ejunit4 -classpath "${CP}" ${TESTS} +} diff --git a/dev-java/htmlcleaner/metadata.xml b/dev-java/htmlcleaner/metadata.xml new file mode 100644 index 0000000..580cafc --- /dev/null +++ b/dev-java/htmlcleaner/metadata.xml @@ -0,0 +1,6 @@ + + + + java + HtmlCleaner is an open source HTML parser written in Java. HTML found on the Web is usually dirty, ill-formed and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring some order to the tags, attributes and ordinary text. For any given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web browsers use in order to create the Document Object Model. However, you can provide custom tag and rule sets for tag filtering and balancing. +