bio

AlphaFold

https://deepmind.com/research/case-studies/alphafold

Package description

AlphaFold can predict protein structures with atomic accuracy even where no similar structure is known

Use latest version

module load AlphaFold

Use specific version

module load AlphaFold/2.2.2-foss-2021b-CUDA-11.4.1

AlphaLink

https://github.com/lhatsk/AlphaLink

Package description

AlphaLink predicts protein structures using deep learning given a sequence and a set of experimental contacts. It extends OpenFold with crosslinking MS data or other experimental distance restraint by explicitly incorporating them in the OpenFold architecture.

Use latest version

module load AlphaLink

Use specific version

module load AlphaLink/1.0-foss-2021a-CUDA-11.3.1

module load AlphaLink/1.0.122023-foss-2021a-CUDA-11.3.1

BCFtools

https://www.htslib.org/

Package description

Samtools is a suite of programs for interacting with high-throughput sequencing data. BCFtools - Reading/writing BCF2/VCF/gVCF files and calling/filtering/summarising SNP and short indel sequence variants

Use latest version

module load BCFtools

Use specific version

module load BCFtools/1.11-GCC-10.2.0

module load BCFtools/1.14-GCC-11.2.0

module load BCFtools/1.17-GCC-12.2.0

BCL

http://www.meilerlab.org/index.php/bclcommons/show/b_apps_id/1

Package description

The Bio Chemical Library (BCL) is a software package that provides unique tools for biological research, such as protein structure determination from sparse experimental data. The BCL contains the widely used secondary structure prediction program JUFO, a folding algorithm that assembles secondary structure elements, and loop construction tools that complete protein backbones.

Use latest version

module load BCL

Use specific version

module load BCL/4.0.0-GCCcore-10.2.0

module load BCL/4.2.0-GCCcore-11.2.0

module load BCL/4.3.0-GCCcore-11.2.0-Python-2.7.18

BEDOPS

http://bedops.readthedocs.io/en/latest/index.html

Package description

BEDOPS is an open-source command-line toolkit that performs highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale. Tasks can be easily split by chromosome for distributing whole-genome analyses across a computational cluster.

Use latest version

module load BEDOPS

Use specific version

module load BEDOPS/2.4.41-foss-2021b

BEDTools

https://bedtools.readthedocs.io/

Package description

BEDTools: a powerful toolset for genome arithmetic. The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. The utilities are largely based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM.

Use latest version

module load BEDTools

Use specific version

module load BEDTools/2.30.0-GCC-10.2.0

module load BEDTools/2.30.0-GCC-11.2.0

module load BEDTools/2.30.0-GCC-11.3.0

BLAST

http://blast.ncbi.nlm.nih.gov/

Package description

Basic Local Alignment Search Tool, or BLAST, is an algorithm for comparing primary biological sequence information, such as the amino-acid sequences of different proteins or the nucleotides of DNA sequences.

Use latest version

module load BLAST

Use specific version

module load BLAST/2.2.26-Linux_x86_64

BLAST+

https://blast.ncbi.nlm.nih.gov/

Package description

Basic Local Alignment Search Tool, or BLAST, is an algorithm for comparing primary biological sequence information, such as the amino-acid sequences of different proteins or the nucleotides of DNA sequences.

Use latest version

module load BLAST+

Use specific version

module load BLAST+/2.11.0-gompi-2020b

module load BLAST+/2.12.0-gompi-2021b

BWA

http://bio-bwa.sourceforge.net/

Package description

Burrows-Wheeler Aligner (BWA) is an efficient program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome.

Use latest version

module load BWA

Use specific version

module load BWA/0.7.17-GCCcore-11.2.0

BamTools

https://github.com/pezmaster31/bamtools

Package description

BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files.

Use latest version

module load BamTools

Use specific version

module load BamTools/2.5.1-GCC-10.2.0

module load BamTools/2.5.2-GCC-11.2.0

module load BamTools/2.5.2-GCC-11.3.0

Beagle

https://faculty.washington.edu/browning/beagle/beagle.html

Package description

Beagle is a software package for phasing genotypes and for imputing ungenotyped markers.

Use latest version

module load Beagle

Use specific version

module load Beagle/5.4.22Jul22.46e-Java-11

Beast

https://beast2.org

Package description

BEAST is a cross-platform program for Bayesian MCMC analysis of molecular sequences. It is entirely orientated towards rooted, time-measured phylogenies inferred using strict or relaxed molecular clock models. It can be used as a method of reconstructing phylogenies but is also a framework for testing evolutionary hypotheses without conditioning on a single tree topology. BEAST uses MCMC to average over tree space, so that each tree is weighted proportional to its posterior probability.

Use latest version

module load Beast

Use specific version

module load Beast/1.10.4-GCC-10.2.0

module load Beast/2.7.4-GCC-12.2.0

Bio-DB-HTS

https://metacpan.org/release/Bio-DB-HTS

Package description

Read files using HTSlib including BAM/CRAM, Tabix and BCF database files

Use latest version

module load Bio-DB-HTS

Use specific version

module load Bio-DB-HTS/3.01-GCC-11.2.0

BioPerl

https://bioperl.org/

Package description

Bioperl is the product of a community effort to produce Perl code which is useful in biology. Examples include Sequence objects, Alignment objects and database searching objects.

Use latest version

module load BioPerl

Use specific version

module load BioPerl/1.7.8-GCCcore-11.2.0

Biopython

https://www.biopython.org

Package description

Biopython is a set of freely available tools for biological computation written in Python by an international team of developers. It is a distributed collaborative effort to develop Python libraries and applications which address the needs of current and future work in bioinformatics.

Use latest version

module load Biopython

Use specific version

module load Biopython/1.75-fosscuda-2019b-Python-3.7.4

module load Biopython/1.78-foss-2020b

module load Biopython/1.79-foss-2021a

module load Biopython/1.79-foss-2021b

module load Biopython/1.79-foss-2022a

Bowtie

http://bowtie-bio.sourceforge.net/index.shtml

Package description

Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome.

Use latest version

module load Bowtie

Use specific version

module load Bowtie/1.3.1-GCC-11.3.0

Bowtie2

https://bowtie-bio.sourceforge.net/bowtie2/index.shtml

Package description

Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes. Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes.

Use latest version

module load Bowtie2

Use specific version

module load Bowtie2/2.4.5-GCC-11.3.0

CNVkit

https://github.com/etal/cnvkit

Package description

A command-line toolkit and Python library for detecting copy number variants and alterations genome-wide from high-throughput sequencing.

Use latest version

module load CNVkit

Use specific version

module load CNVkit/0.9.8-foss-2020b-R-4.0.3

module load CNVkit/0.9.10-foss-2022a-R-4.2.1

Clustal-Omega

http://www.clustal.org/omega/

Package description

Clustal Omega is a multiple sequence alignment program for proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. Evolutionary relationships can be seen via viewing Cladograms or Phylograms

Use latest version

module load Clustal-Omega

Use specific version

module load Clustal-Omega/1.2.4-GCC-11.2.0

ClustalW2

https://www.ebi.ac.uk/Tools/msa/clustalw2/

Package description

ClustalW2 is a general purpose multiple sequence alignment program for DNA or proteins.

Use latest version

module load ClustalW2

Use specific version

module load ClustalW2/2.1-GCC-11.2.0

Crumble

https://github.com/jkbonfield/crumble

Package description

Exploration of controlled loss of quality values for compressing CRAM files

Use latest version

module load Crumble

Use specific version

module load Crumble/0.8.3-GCC-11.2.0

DROP

https://gagneurlab-drop.readthedocs.io

Package description

Pipeline to find aberrant events in RNA-Seq data, useful for diagnosis of rare disorders

Use latest version

module load DROP

Use specific version

module load DROP/1.1.1-foss-2021b-R-4.1.2

module load DROP/1.3.0-foss-2021b-R-4.1.2

ExpansionHunter

https://github.com/Illumina/ExpansionHunter

Package description

There are a number of regions in the human genome consisting of repetitions of short unit sequence (commonly a trimer). Such repeat regions can expand to a size much larger than the read length and thereby cause a disease. Fragile X Syndrome, ALS, and Huntington's Disease are well known examples. Expansion Hunter aims to estimate sizes of such repeats by performing a targeted search through a BAM/CRAM file for reads that span, flank, and are fully contained in each repeat.

Use latest version

module load ExpansionHunter

Use specific version

module load ExpansionHunter/5.0.0-GCCcore-10.3.0

FastQC

https://www.bioinformatics.babraham.ac.uk/projects/fastqc/

Package description

FastQC is a quality control application for high throughput sequence data. It reads in sequence data in a variety of formats and can either provide an interactive application to review the results of several different QC checks, or create an HTML based report which can be integrated into a pipeline.

Use latest version

module load FastQC

Use specific version

module load FastQC/0.11.9-Java-11

GATK

https://www.broadinstitute.org/gatk/

Package description

The Genome Analysis Toolkit or GATK is a software package developed at the Broad Institute to analyse next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Use latest version

module load GATK

Use specific version

module load GATK/4.2.0.0-GCCcore-10.2.0-Java-11

module load GATK/4.2.3.0-GCCcore-11.2.0-Java-11

GCTA

https://yanglab.westlake.edu.cn/software/gcta/

Package description

GCTA (Genome-wide Complex Trait Analysis) is a software package, which was initially developed to estimate the proportion of phenotypic variance explained by all genome-wide SNPs for a complex trait but has been extensively extended for many other analyses of data from genome-wide association studies (GWASs).

Use latest version

module load GCTA

Use specific version

module load GCTA/1.94.0beta-foss-2021b

GRIDSS

https://github.com/PapenfussLab/gridss

Package description

GRIDSS is a module software suite containing tools useful for the detection of genomic rearrangements. GRIDSS includes a genome-wide break-end assembler, as well as a structural variation caller for Illumina sequencing data. GRIDSS calls variants based on alignment-guided positional de Bruijn graph genome-wide break-end assembly, split read, and read pair evidence.

Use latest version

module load GRIDSS

Use specific version

module load GRIDSS/2.13.1-foss-2021b-Java-11

GROMACS

https://www.gromacs.org

Package description

GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles. This is a GPU enabled build, containing both MPI and threadMPI builds. It also contains the gmxapi extension for the single precision MPI build next to PLUMED.

Use latest version

module load GROMACS

Use specific version

module load GROMACS/2021-fosscuda-2020b-PLUMED-2.7.1

module load GROMACS/2021.5-foss-2021b

module load GROMACS/2022.5-foss-2022a-PLUMED-2.8.3

module load GROMACS/2023-foss-2022a-CUDA-11.7.0-PLUMED-2.9.0

GView

https://github.com/phac-nml/gview-wiki/wiki

Package description

GView is a Java package used to display and navigate bacterial genomes.

Use latest version

module load GView

Use specific version

module load GView/1.7-Java-1.8

HH-suite

https://github.com/soedinglab/hh-suite

Package description

The HH-suite is an open-source software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs).

Use latest version

module load HH-suite

Use specific version

module load HH-suite/3.2.0-fosscuda-2019b

module load HH-suite/3.3.0-gompi-2021a

module load HH-suite/3.3.0-gompi-2021b

module load HH-suite/3.3.0-gompi-2022a

HISAT2

https://daehwankimlab.github.io/hisat2

Package description

HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) against the general human population (as well as against a single reference genome).

Use latest version

module load HISAT2

Use specific version

module load HISAT2/2.2.1-gompi-2021b

module load HISAT2/2.2.1-gompi-2022a

HMMER

http://hmmer.org/

Package description

HMMER is used for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.

Use latest version

module load HMMER

Use specific version

module load HMMER/3.3.2-gompi-2021a

module load HMMER/3.3.2-gompi-2021b

module load HMMER/3.3.2-gompi-2022a

HTSlib

https://www.htslib.org/

Package description

A C library for reading/writing high-throughput sequencing data. This package includes the utilities bgzip and tabix

Use latest version

module load HTSlib

Use specific version

module load HTSlib/1.11-GCC-10.2.0

module load HTSlib/1.14-GCC-11.2.0

module load HTSlib/1.15.1-GCC-11.3.0

module load HTSlib/1.17-GCC-12.2.0

HiCExplorer

http://hicexplorer.readthedocs.org/

Package description

HiCexplorer addresses the common tasks of Hi-C analysis from processing to visualization.

Use latest version

module load HiCExplorer

Use specific version

module load HiCExplorer/3.7.2-foss-2022a

HiCMatrix

https://github.com/deeptools/HiCMatrix

Package description

This library implements the central class of HiCExplorer to manage Hi-C interaction matrices.

Use latest version

module load HiCMatrix

Use specific version

module load HiCMatrix/17-foss-2022a

IQ-Tree

http://www.iqtree.org

Package description

Efficient and versatile phylogenomic software by maximum likelihood

Use latest version

module load IQ-Tree

Use specific version

module load IQ-Tree/2.1.3-GCC-11.2.0

Infernal

http://eddylab.org/infernal/

Package description

Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities.

Use latest version

module load Infernal

Use specific version

module load Infernal/1.1.4-foss-2022b

Iris

https://github.com/mkirsche/Iris

Package description

A module for improving the insertion sequences of structural variant calls

Use latest version

module load Iris

Use specific version

module load Iris/1.0.5-GCC-11.2.0-Java-15

Jasmine

https://github.com/mkirsche/Jasmine

Package description

SV Merging Across Samples

Use latest version

module load Jasmine

Use specific version

module load Jasmine/1.1.4-GCC-11.2.0-Java-15

Kalign

https://github.com/TimoLassmann/kalign

Package description

Kalign is a fast multiple sequence alignment program for biological sequences.

Use latest version

module load Kalign

Use specific version

module load Kalign/3.3.1-GCCcore-10.3.0

module load Kalign/3.3.2-GCCcore-11.2.0

module load Kalign/3.3.5-GCCcore-11.3.0

LUMPY

https://github.com/arq5x/lumpy-sv

Package description

A probabilistic framework for structural variant discovery.

Use latest version

module load LUMPY

Use specific version

module load LUMPY/0.3.1-foss-2020b

LocalColabFold

https://github.com/YoshitakaMo/localcolabfold

Package description

LocalColabFold is an installer script designed to make ColabFold functionality available on users' local machines.

Use latest version

module load LocalColabFold

Use specific version

module load LocalColabFold/1.5.5-CUDA-12.2.0

MACSE

https://bioweb.supagro.inra.fr/macse/index.php

Package description

MACSE aligns coding NT sequences with respect to their AA translation while allowing NT sequences to contain multiple frameshifts and/or stop codons. MACSE is hence the first automatic solution to align protein-coding gene datasets containing non-functional sequences (pseudogenes) without disrupting the underlying codon structure.

Use latest version

module load MACSE

Use specific version

module load MACSE/2.06-Java-15

MAFFT

https://mafft.cbrc.jp/alignment/software/

Package description

MAFFT is a multiple sequence alignment program for unix-like operating systems. It offers a range of multiple alignment methods, L-INS-i (accurate; for alignment of <~200 sequences), FFT-NS-2 (fast; for alignment of <~30,000 sequences), etc.

Use latest version

module load MAFFT

Use specific version

module load MAFFT/7.490-GCC-11.2.0-with-extensions

module load MAFFT/7.490-gompi-2021b-with-extensions

MM-align

https://zhanggroup.org/MM-align

Package description

MM-align is an algorithm for structurally aligning a pair of multiple-chain protein complexes. The multiple chains in each complex are first joined, in every possible order, and then simultaneously aligned with cross-chain alignments prevented. An optimal alignment between two complexes, as well as the overall TM-score, will be reported.

Use latest version

module load MM-align

Use specific version

module load MM-align/20210816

MMseqs2

https://mmseqs.com

Package description

MMseqs2: ultra fast and sensitive search and clustering suite

Use latest version

module load MMseqs2

Use specific version

module load MMseqs2/14-7e284-gompi-2022a

MUSCLE

https://drive5.com/muscle/

Package description

MUSCLE is one of the best-performing multiple alignment programs according to published benchmark tests, with accuracy and speed that are consistently better than CLUSTALW. MUSCLE can align hundreds of sequences in seconds. Most users learn everything they need to know about MUSCLE in a few minutes-only a handful of command-line options are needed to perform common alignment tasks.

Use latest version

module load MUSCLE

Use specific version

module load MUSCLE/5.1-GCCcore-11.2.0

MaxAlign

https://services.healthtech.dtu.dk/service.php?MaxAlign-1.1

Package description

In phylogenetic analyses, for instance, gapped columns are often discarded entirely from the alignment. MaxAlign is a program that optimizes the alignment prior to such analyses.

Use latest version

module load MaxAlign

Use specific version

module load MaxAlign/1.1-GCCcore-11.2.0

MethylDackel

https://github.com/dpryan79/MethylDackel

Package description

A (mostly) universal methylation extractor for BS-seq experiments.

Use latest version

module load MethylDackel

Use specific version

module load MethylDackel/0.6.1-GCC-11.2.0

MrBayes

https://nbisweden.github.io/MrBayes/

Package description

MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models.

Use latest version

module load MrBayes

Use specific version

module load MrBayes/3.2.7-gompi-2020b

module load MrBayes/3.2.7-gompi-2022b

MultiQC

https://multiqc.info

Package description

Aggregate results from bioinformatics analyses across many samples into a single report. MultiQC searches a given directory for analysis logs and compiles a HTML report. It's a general use tool, perfect for summarising the output from numerous bioinformatics tools.

Use latest version

module load MultiQC

Use specific version

module load MultiQC/1.11-foss-2021b

NGS

https://github.com/ncbi/ngs

Package description

NGS is a new, domain-specific API for accessing reads, alignments and pileups produced from Next Generation Sequencing.

Use latest version

module load NGS

Use specific version

module load NGS/2.11.2-GCCcore-11.2.0

OpenFold

https://github.com/aqlaboratory/openfold

Package description

A faithful PyTorch reproduction of DeepMind's AlphaFold 2

Use latest version

module load OpenFold

Use specific version

module load OpenFold/1.0.1-foss-2021a-CUDA-11.3.1

OpenMM

https://openmm.org

Package description

OpenMM is a toolkit for molecular simulation.

Use latest version

module load OpenMM

Use specific version

module load OpenMM/7.5.1-foss-2021a-CUDA-11.3.1-DeepMind-patch

module load OpenMM/8.0.0-foss-2022a-CUDA-12.2.0

PAML

http://abacus.gene.ucl.ac.uk/software/paml.html

Package description

PAML is a package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood.

Use latest version

module load PAML

Use specific version

module load PAML/4.9j-GCCcore-11.2.0

PLINK

https://www.cog-genomics.org/plink/2.0/

Package description

PLINK is a free, open-source whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner.

Use latest version

module load PLINK

Use specific version

module load PLINK/2.00a3.7-foss-2022a

PRANK

http://wasabiapp.org/software/prank/

Package description

PRANK is a probabilistic multiple alignment program for DNA, codon and amino-acid sequences. PRANK is based on a novel algorithm that treats insertions correctly and avoids over-estimation of the number of deletion events.

Use latest version

module load PRANK

Use specific version

module load PRANK/170427-GCC-11.2.0

PREQUAL

https://github.com/simonwhelan/prequal

Package description

A program to identify and mask regions with non-homologous adjacent characters in FASTA files.

Use latest version

module load PREQUAL

Use specific version

module load PREQUAL/1.02-GCCcore-11.2.0

PSIPRED

http://bioinf.cs.ucl.ac.uk

Package description

Accurate protein secondary structure prediction

Use latest version

module load PSIPRED

Use specific version

module load PSIPRED/4.02-GCC-8.3.0

module load PSIPRED/4.02-GCC-10.2.0

module load PSIPRED/4.02-GCC-12.2.0

Pindel

http://gmt.genome.wustl.edu/packages/pindel/

Package description

Pindel can detect breakpoints of large deletions, medium sized insertions, inversions, tandem duplications and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of these variants from paired-end short reads.

Use latest version

module load Pindel

Use specific version

module load Pindel/0.2.5b9-20170508-GCC-11.2.0

PyRosetta

https://www.pyrosetta.org/

Package description

PyRosetta is an interactive Python-based interface to the powerful Rosetta molecular modeling suite. It enables users to design their own custom molecular modeling algorithms using Rosetta sampling methods and energy functions.

Use latest version

module load PyRosetta

Use specific version

module load PyRosetta/4.release-292-GCCcore-8.3.0-Python-3.7.4

Pysam

https://github.com/pysam-developers/pysam

Package description

Pysam is a python module for reading and manipulating Samfiles. It's a lightweight wrapper of the samtools C-API. Pysam also includes an interface for tabix.

Use latest version

module load Pysam

Use specific version

module load Pysam/0.16.0.1-GCC-10.2.0-Python-2.7.18

module load Pysam/0.16.0.1-GCC-10.2.0

module load Pysam/0.18.0-GCC-11.2.0

module load Pysam/0.19.1-GCC-11.3.0

module load Pysam/0.20.0-GCC-11.3.0

Qualimap

http://qualimap.bioinfo.cipf.es/

Package description

Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.

Use latest version

module load Qualimap

Use specific version

module load Qualimap/2.2.1-foss-2021b-R-4.1.2

R-bundle-Bioconductor

https://bioconductor.org

Package description

Bioconductor provides tools for the analysis and coprehension of high-throughput genomic data.

Use latest version

module load R-bundle-Bioconductor

Use specific version

module load R-bundle-Bioconductor/3.12-foss-2020b-R-4.0.3

module load R-bundle-Bioconductor/3.14-foss-2021b-R-4.1.2

module load R-bundle-Bioconductor/3.15-foss-2022a-R-4.2.1

module load R-bundle-Bioconductor/3.16-foss-2022b-R-4.2.2

RAxML

https://github.com/stamatak/standard-RAxML

Package description

RAxML search algorithm for maximum likelihood based inference of phylogenetic trees.

Use latest version

module load RAxML

Use specific version

module load RAxML/8.2.12-gompi-2021b-hybrid-avx2

RFdiffusion

https://github.com/RosettaCommons/RFdiffusion

Package description

RFdiffusion is an open source method for structure generation, with or without conditional information (a motif, target etc). It can perform a whole range of protein design challenges as we have outlined in the RFdiffusion paper.

Use latest version

module load RFdiffusion

Use specific version

module load RFdiffusion/1.1.0-foss-2022a

RSEM

https://deweylab.github.io/RSEM/

Package description

RNA-Seq by Expectation-Maximization

Use latest version

module load RSEM

Use specific version

module load RSEM/1.3.3-foss-2022a

Racon

https://github.com/lbcb-sci/racon

Package description

Ultrafast consensus module for raw de novo genome assembly of long uncorrected reads.

Use latest version

module load Racon

Use specific version

module load Racon/1.5.0-GCCcore-11.2.0

RegTools

https://regtools.readthedocs.org

Package description

RegTools is a set of tools that integrate DNA-seq and RNA-seq data to help interpret mutations in a regulatory and splicing context.

Use latest version

module load RegTools

Use specific version

module load RegTools/1.0.0-foss-2022b

RoseTTAFold2NA

https://github.com/uw-ipd/RoseTTAFold2NA

Package description

RoseTTAFold2 protein/nucleic acid complex prediction

Use latest version

module load RoseTTAFold2NA

Use specific version

module load RoseTTAFold2NA/0.2

Rosetta

https://www.rosettacommons.org

Package description

Rosetta is the premier software suite for modeling macromolecular structures. As a flexible, multi-purpose application, it includes tools for structure prediction, design, and remodeling of proteins and nucleic acids.

Use latest version

module load Rosetta

Use specific version

module load Rosetta/3.13-gompi-2020b

module load Rosetta/3.13-gompi-2022b

SAMtools

https://www.htslib.org/

Package description

SAM Tools provide various utilities for manipulating alignments in the SAM format, including sorting, merging, indexing and generating alignments in a per-position format.

Use latest version

module load SAMtools

Use specific version

module load SAMtools/1.11-GCC-10.2.0

module load SAMtools/1.12-GCC-10.2.0

module load SAMtools/1.14-GCC-11.2.0

SRA-Toolkit

https://github.com/ncbi/sra-tools

Package description

The SRA Toolkit, and the source-code SRA System Development Kit (SDK), will allow you to programmatically access data housed within SRA and convert it from the SRA format

Use latest version

module load SRA-Toolkit

Use specific version

module load SRA-Toolkit/3.0.3-gompi-2022a

STAR

https://github.com/alexdobin/STAR

Package description

STAR aligns RNA-seq reads to a reference genome using uncompressed suffix arrays.

Use latest version

module load STAR

Use specific version

module load STAR/2.7.9a-GCC-11.2.0

module load STAR/2.7.10b-GCC-11.3.0

SURVIVOR

https://github.com/fritzsedlazeck/SURVIVOR

Package description

Toolset for SV simulation, comparison and filtering

Use latest version

module load SURVIVOR

Use specific version

module load SURVIVOR/1.0.7-19-ged1ca51-GCC-11.2.0

SVTyper

https://github.com/hall-lab/svtyper

Package description

SVTyper performs breakpoint genotyping of structural variants (SVs) using whole genome sequencing data.

Use latest version

module load SVTyper

Use specific version

module load SVTyper/0.7.1-foss-2020b-Python-2.7.18

Salmon

https://github.com/COMBINE-lab/salmon

Package description

Salmon is a wicked-fast program to produce a highly-accurate, transcript-level quantification estimates from RNA-seq data.

Use latest version

module load Salmon

Use specific version

module load Salmon/1.7.0-gompi-2021b

module load Salmon/1.9.0-GCC-11.3.0

Sambamba

https://lomereiter.github.io/sambamba/

Package description

Sambamba is a high performance modern robust and fast tool (and library), written in the D programming language, for working with SAM and BAM files. Current functionality is an important subset of samtools functionality, including view, index, sort, markdup, and depth.

Use latest version

module load Sambamba

Use specific version

module load Sambamba/0.8.0-GCC-10.2.0

Subread

https://subread.sourceforge.net/

Package description

High performance read alignment, quantification and mutation discovery

Use latest version

module load Subread

Use specific version

module load Subread/2.0.4-GCC-11.3.0

T-COFFEE

https://tcoffee.org/

Package description

A collection of tools for Computing, Evaluating and Manipulating Multiple Alignments of DNA, RNA, Protein Sequences and Structures.

Use latest version

module load T-COFFEE

Use specific version

module load T-COFFEE/13.45.0.4846264-gompi-2021b

TranslatorX

http://translatorx.co.uk/

Package description

Nucleotide sequence alignment and alignment cleaning based on amino acid information.

Use latest version

module load TranslatorX

Use specific version

module load TranslatorX/1.1-GCCcore-11.2.0

Trim_Galore

https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/

Package description

Trim Galore! is a wrapper script to automate quality and adapter trimming as well as quality control, with some added functionality to remove biased methylation positions for RRBS sequence files (for directional, non-directional (or paired-end) sequencing).

Use latest version

module load Trim_Galore

Use specific version

module load Trim_Galore/0.6.7-GCCcore-11.2.0

module load Trim_Galore/0.6.10-GCCcore-11.3.0

VEP

https://www.ensembl.org/info/docs/tools/vep

Package description

Variant Effect Predictor (VEP) determines the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions. Includes EnsEMBL-XS, which provides pre-compiled replacements for frequently used routines in VEP.

Use latest version

module load VEP

Use specific version

module load VEP/103.1-GCC-11.2.0

bx-python

https://github.com/bxlab/bx-python

Package description

The bx-python project is a Python library and associated set of scripts to allow for rapid implementation of genome scale analyses.

Use latest version

module load bx-python

Use specific version

module load bx-python/0.9.0-foss-2022a

cooler

https://open2c.github.io/cooler

Package description

Cooler is a support library for a storage format, also called cooler, used to store genomic interaction data of any size, such as Hi-C contact matrices.

Use latest version

module load cooler

Use specific version

module load cooler/0.9.1-foss-2022a

cutadapt

https://opensource.scilifelab.se/projects/cutadapt/

Package description

Cutadapt finds and removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads.

Use latest version

module load cutadapt

Use specific version

module load cutadapt/3.5-GCCcore-11.2.0

module load cutadapt/4.2-GCCcore-11.3.0

e3nn

https://e3nn.org/

Package description

Euclidean neural networks (e3nn) is a python library based on pytorch to create equivariant neural networks for the group O(3).

Use latest version

module load e3nn

Use specific version

module load e3nn/0.3.3-foss-2022a

gsort

https://github.com/brentp/gsort

Package description

gsort is a tool to sort genomic files according to a genomefile

Use latest version

module load gsort

Use specific version

module load gsort/0.1.4

kallisto

https://pachterlab.github.io/kallisto/

Package description

kallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads.

Use latest version

module load kallisto

Use specific version

module load kallisto/0.48.0-gompi-2022a

lDDT

https://openstructure.org

Package description

The local Distance Difference Test (lDDT) is a superposition-free score which evaluates local distance differences in a model compared to a reference structure.

Use latest version

module load lDDT

Use specific version

module load lDDT/1.2

mTM-align

http://yanglab.nankai.edu.cn/mTM-align/

Package description

mTM-align is a server for for efficient protein structure comparisons, which includes are two related modules. The first is for fast database search with one input structure. The second is for multiple structure alignment with two or more input structures. For the first module, it takes about 2-5 minutes for a structure of a medium size (~300 residues). After the search is done, a multiple structure alignment is performed automatically with the top 10 structure, using the second module. The users are also able to select other structures from the returned list to perform multiple structure alignment. For the second module, it takes a few seconds for input of ~10 structures of medium size.

Use latest version

module load mTM-align

Use specific version

module load mTM-align/20220104

manta

https://github.com/Illumina/manta

Package description

Manta calls structural variants (SVs) and indels from mapped paired-end sequencing reads. It is optimized for analysis of germline variation in small sets of individuals and somatic variation in tumor/normal sample pairs. Manta discovers, assembles and scores large-scale SVs, medium-sized indels and large insertions within a single efficient workflow.

Use latest version

module load manta

Use specific version

module load manta/1.6.0-GCC-10.2.0-Python-2.7.18

minimap2

https://github.com/lh3/minimap2

Package description

Minimap2 is a fast sequence mapping and alignment program that can find overlaps between long noisy reads, or map long reads or their assemblies to a reference genome optionally with detailed alignment (i.e. CIGAR). At present, it works efficiently with query sequences from a few kilobases to ~100 megabases in length at an error rate ~15%. Minimap2 outputs in the PAF or the SAM format. On limited test data sets, minimap2 is over 20 times faster than most other long-read aligners. It will replace BWA-MEM for long reads and contig alignment.

Use latest version

module load minimap2

Use specific version

module load minimap2/2.22-GCCcore-11.2.0

mosdepth

https://github.com/brentp/mosdepth

Package description

Fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing

Use latest version

module load mosdepth

Use specific version

module load mosdepth/0.3.1-GCC-10.2.0

module load mosdepth/0.3.3-GCC-11.2.0

ncbi-vdb

https://github.com/ncbi/ncbi-vdb

Package description

The SRA Toolkit and SDK from NCBI is a collection of tools and libraries for using data in the INSDC Sequence Read Archives.

Use latest version

module load ncbi-vdb

Use specific version

module load ncbi-vdb/2.11.2-gompi-2021b

module load ncbi-vdb/3.0.2-gompi-2022a

parabricks

https://docs.nvidia.com/clara/parabricks/v3.6.1/text/software_overview.html#what-is-clara-parabricks

Package description

Parabricks is a software suite for performing secondary analysis of next generation sequencing (NGS) DNA and RNA data.

Use latest version

module load parabricks

Use specific version

module load parabricks/3.6.1

picard

https://broadinstitute.github.io/picard/

Package description

A set of tools (in Java) for working with next generation sequencing data in the BAM format.

Use latest version

module load picard

Use specific version

module load picard/2.26.10-Java-15

preseq

https://smithlabresearch.org/software/preseq

Package description

Software for predicting library complexity and genome coverage in high-throughput sequencing.

Use latest version

module load preseq

Use specific version

module load preseq/3.1.2-GCC-11.2.0

prodigal

https://github.com/hyattpd/Prodigal/

Package description

Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) is a microbial (bacterial and archaeal) gene finding program developed at Oak Ridge National Laboratory and the University of Tennessee.

Use latest version

module load prodigal

Use specific version

module load prodigal/2.6.3-GCCcore-12.2.0

pyBigWig

https://github.com/deeptools/pyBigWig

Package description

A python extension, written in C, for quick access to bigBed files and access to and creation of bigWig files.

Use latest version

module load pyBigWig

Use specific version

module load pyBigWig/0.3.18-foss-2022a

pyGenomeTracks

https://pygenometracks.readthedocs.io

Package description

pyGenomeTracks aims to produce high-quality genome browser tracks that are highly customizable.

Use latest version

module load pyGenomeTracks

Use specific version

module load pyGenomeTracks/3.8-foss-2022a

pybedtools

https://daler.github.io/pybedtools

Package description

pybedtools wraps and extends BEDTools and offers feature-level manipulations from within Python.

Use latest version

module load pybedtools

Use specific version

module load pybedtools/0.9.0-GCC-11.3.0

pyfaidx

https://pypi.python.org/pypi/pyfaidx

Package description

pyfaidx: efficient pythonic random access to fasta subsequences

Use latest version

module load pyfaidx

Use specific version

module load pyfaidx/0.5.9.5-GCCcore-10.2.0

module load pyfaidx/0.7.0-GCCcore-11.2.0

module load pyfaidx/0.7.1-GCCcore-11.3.0

samblaster

https://github.com/GregoryFaust/samblaster

Package description

samblaster is a fast and flexible program for marking duplicates in read-id grouped1 paired-end SAM files.

Use latest version

module load samblaster

Use specific version

module load samblaster/0.1.26-GCC-10.2.0

smoove

https://github.com/brentp/smoove

Package description

smoove simplifies and speeds calling and genotyping SVs for short reads. It also improves specificity by removing many spurious alignment signals that are indicative of low-level noise and often contribute to spurious calls.

Use latest version

module load smoove

Use specific version

module load smoove/0.2.6-foss-2020b-Python-2.7.18

strelka

https://github.com/Illumina/strelka

Package description

Strelka2 is a fast and accurate small variant caller optimized for analysis of germline variation in small cohorts and somatic variation in tumor/normal sample pairs.

Use latest version

module load strelka

Use specific version

module load strelka/2.9.10-GCCcore-11.3.0-Python-2.7.18

svaba

https://github.com/walaj/svaba

Package description

SvABA - Structural variation and indel analysis by assembly

Use latest version

module load svaba

Use specific version

module load svaba/1.1.0-foss-2020b

module load svaba/1.1.0-foss-2022b

svtools

https://github.com/hall-lab/svtools

Package description

svtools is a suite of utilities designed to help bioinformaticians construct and explore cohort-level structural variation calls.

Use latest version

module load svtools

Use specific version

module load svtools/0.5.1-foss-2020b-Python-2.7.18

tMAE

https://github.com/mumichae/tMAE

Package description

Package containing functions to: perform a negative binomial test on allele-specific counts add gnomAD minor allele frequencies MAplot (FC vs total counts) of allele-specific counts and results allelic counts (ALT vs REF)

Use latest version

module load tMAE

Use specific version

module load tMAE/1.0.1-foss-2021b-R-4.1.2

wham

https://github.com/zeeev/wham

Package description

duphold: scalable, depth-based annotation and curation of high-confidence structural variant calls

Use latest version

module load wham

Use specific version

module load wham/1.8.0-GCC-10.2.0