From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from lists.gentoo.org (pigeon.gentoo.org [208.92.234.80]) by finch.gentoo.org (Postfix) with ESMTP id B5416138CD1 for ; Tue, 26 May 2015 13:57:23 +0000 (UTC) Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 58278E08DC; Tue, 26 May 2015 13:57:22 +0000 (UTC) Received: from smtp.gentoo.org (smtp.gentoo.org [140.211.166.183]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by pigeon.gentoo.org (Postfix) with ESMTPS id 0548CE08DC for ; Tue, 26 May 2015 13:57:21 +0000 (UTC) Received: from oystercatcher.gentoo.org (oystercatcher.gentoo.org [148.251.78.52]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by smtp.gentoo.org (Postfix) with ESMTPS id 6F1E9340C21 for ; Tue, 26 May 2015 13:57:20 +0000 (UTC) Received: from localhost.localdomain (localhost [127.0.0.1]) by oystercatcher.gentoo.org (Postfix) with ESMTP id 2287B9C8 for ; Tue, 26 May 2015 13:57:19 +0000 (UTC) From: "Martin Mokrejs" To: gentoo-commits@lists.gentoo.org Content-Transfer-Encoding: 8bit Content-type: text/plain; charset=UTF-8 Reply-To: gentoo-dev@lists.gentoo.org, "Martin Mokrejs" Message-ID: <1432648555.0322af051993bb9cfbcf22a9e544bc01b18a54a0.mmokrejs@gentoo> Subject: [gentoo-commits] proj/sci:master commit in: sci-biology/blue/ X-VCS-Repository: proj/sci X-VCS-Files: sci-biology/blue/ChangeLog sci-biology/blue/blue-1.1.2.ebuild sci-biology/blue/blue-1.1.3.ebuild X-VCS-Directories: sci-biology/blue/ X-VCS-Committer: mmokrejs X-VCS-Committer-Name: Martin Mokrejs X-VCS-Revision: 0322af051993bb9cfbcf22a9e544bc01b18a54a0 X-VCS-Branch: master Date: Tue, 26 May 2015 13:57:19 +0000 (UTC) Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-commits@lists.gentoo.org X-Archives-Salt: 1bdb32a1-ac7e-4ec9-b2eb-896d70b719de X-Archives-Hash: 6ad862e1f2b43be58818a33f252778fa commit: 0322af051993bb9cfbcf22a9e544bc01b18a54a0 Author: Martin Mokrejš fold natur cuni cz> AuthorDate: Tue May 26 13:55:55 2015 +0000 Commit: Martin Mokrejs fold natur cuni cz> CommitDate: Tue May 26 13:55:55 2015 +0000 URL: https://gitweb.gentoo.org/proj/sci.git/commit/?id=0322af05 sci-biology/blue: version bump, updated HOMEPAGE and SRC_URI Package-Manager: portage-2.2.18 sci-biology/blue/ChangeLog | 6 ++++++ sci-biology/blue/blue-1.1.2.ebuild | 12 ++++++++---- sci-biology/blue/blue-1.1.3.ebuild | 38 ++++++++++++++++++++++++++++++++++++++ 3 files changed, 52 insertions(+), 4 deletions(-) diff --git a/sci-biology/blue/ChangeLog b/sci-biology/blue/ChangeLog index cbcd718..97f6844 100644 --- a/sci-biology/blue/ChangeLog +++ b/sci-biology/blue/ChangeLog @@ -2,6 +2,12 @@ # Copyright 1999-2015 Gentoo Foundation; Distributed under the GPL v2 # $Header: $ +*blue-1.1.3 (26 May 2015) + + 26 May 2015; Martin Mokrejs +blue-1.1.3.ebuild, + blue-1.1.2.ebuild: + sci-biology/blue: version bump, updated HOMEPAGE and SRC_URI + 17 Mar 2015; Martin Mokrejs blue-1.1.2.ebuild: improved comments based on email from upstream diff --git a/sci-biology/blue/blue-1.1.2.ebuild b/sci-biology/blue/blue-1.1.2.ebuild index 5b9ed96..945a35d 100644 --- a/sci-biology/blue/blue-1.1.2.ebuild +++ b/sci-biology/blue/blue-1.1.2.ebuild @@ -5,11 +5,11 @@ EAPI=5 DESCRIPTION="Error corrector for Illummina and Roche/454 able to also fix insertions and deletions" -HOMEPAGE="http://www.csiro.au/Outcomes/ICT-and-Services/Software/Blue.aspx" +HOMEPAGE="http://www.bioinformatics.csiro.au/blue" SRC_URI=" - http://www.csiro.au/~/media/CSIROau/Images/Bioinformatics/Blue_software/Version_1-1-2/Linux112.ashx -> ${P}.tar.gz - http://www.csiro.au/~/media/CSIROau/Images/Bioinformatics/Blue_software/Version_1-1-2/Correcting_reads_with_Blue.ashx -> Correcting_reads_with_Blue.pdf" - + http://www.bioinformatics.csiro.au/public/files/Linux-${PV}.tar.gz -> ${P}.tar.gz + http://www.bioinformatics.csiro.au/public/files/Correcting_reads_with_Blue.pdf -> ${P}-Correcting_reads_with_Blue.pdf + http://www.bioinformatics.csiro.au/public/files/Changes_for_Blue_1.1.3.pdf -> ${P}-Changes_for_Blue_${PV}.pdf" LICENSE="GPL-2+" SLOT="0" KEYWORDS="" @@ -29,6 +29,10 @@ src_compile(){ src_install(){ dobin Tessel.exe GenerateMerPairs.exe Blue.exe # one could install also the *.exe.so files for speedup into /usr/bin/ + + # The -hp option sets a flag that is checked when Blue is scanning along a read trying to find errors that could be corrected. There are a number of tests done at every base position, all based on depth of coverage. These tests will pick up random indel errors, but indels are so common at the end of homopolymer runs in 454 and IonTorrent data that multiple hp run lengths all look to be OK. For example, if our genome had AAAAAA then with Illumina data this is what we'd see almost all the time, with very rare indels at the end of the hp run resulting in runs of 5 or 7 As. With 454-like data, we'd probably get 5 As as frequently as 6 As so depth of coverage would say that neither of them are errors. The -hp flag looks out for the end of hp runs and forces an attempt at correction at that point. If the read wasn't in error, then no correction will be made. + + # In general Blue will correct Ns - if a correct replacement can be found. The only time it doesn't do this is if there are too many consecutive Ns - as the process of finding likely replacements is combinatoric and the cost goes up exponentially with the number of consecutive Ns. In these cases, the read is abandoned and passed through uncorrected. Can you give an example of a read that you would like to see corrected differently? } # it's author uses 'mono-sgen /usr/bin/Blue.exe' to use the alternative garbage collector diff --git a/sci-biology/blue/blue-1.1.3.ebuild b/sci-biology/blue/blue-1.1.3.ebuild new file mode 100644 index 0000000..945a35d --- /dev/null +++ b/sci-biology/blue/blue-1.1.3.ebuild @@ -0,0 +1,38 @@ +# Copyright 1999-2015 Gentoo Foundation +# Distributed under the terms of the GNU General Public License v2 +# $Header: $ + +EAPI=5 + +DESCRIPTION="Error corrector for Illummina and Roche/454 able to also fix insertions and deletions" +HOMEPAGE="http://www.bioinformatics.csiro.au/blue" +SRC_URI=" + http://www.bioinformatics.csiro.au/public/files/Linux-${PV}.tar.gz -> ${P}.tar.gz + http://www.bioinformatics.csiro.au/public/files/Correcting_reads_with_Blue.pdf -> ${P}-Correcting_reads_with_Blue.pdf + http://www.bioinformatics.csiro.au/public/files/Changes_for_Blue_1.1.3.pdf -> ${P}-Changes_for_Blue_${PV}.pdf" +LICENSE="GPL-2+" +SLOT="0" +KEYWORDS="" +IUSE="" + +DEPEND="dev-lang/mono" +RDEPEND="${DEPEND}" + +S="${WORKDIR}"/Linux + +src_compile(){ + Blue/compile.sh || die + Tessel/compile.sh || die + GenerateMerPairs/compile.sh || die +} + +src_install(){ + dobin Tessel.exe GenerateMerPairs.exe Blue.exe + # one could install also the *.exe.so files for speedup into /usr/bin/ + + # The -hp option sets a flag that is checked when Blue is scanning along a read trying to find errors that could be corrected. There are a number of tests done at every base position, all based on depth of coverage. These tests will pick up random indel errors, but indels are so common at the end of homopolymer runs in 454 and IonTorrent data that multiple hp run lengths all look to be OK. For example, if our genome had AAAAAA then with Illumina data this is what we'd see almost all the time, with very rare indels at the end of the hp run resulting in runs of 5 or 7 As. With 454-like data, we'd probably get 5 As as frequently as 6 As so depth of coverage would say that neither of them are errors. The -hp flag looks out for the end of hp runs and forces an attempt at correction at that point. If the read wasn't in error, then no correction will be made. + + # In general Blue will correct Ns - if a correct replacement can be found. The only time it doesn't do this is if there are too many consecutive Ns - as the process of finding likely replacements is combinatoric and the cost goes up exponentially with the number of consecutive Ns. In these cases, the read is abandoned and passed through uncorrected. Can you give an example of a read that you would like to see corrected differently? +} + +# it's author uses 'mono-sgen /usr/bin/Blue.exe' to use the alternative garbage collector