Burrows-Wheeler Aligner

From HPC users
Jump to navigationJump to search

Introduction

BWA is a software package for mapping low-divergent sequences against a large reference genome, such as the human genome. It consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. The first algorithm is designed for Illumina sequence reads up to 100bp, while the rest two for longer sequences ranged from 70bp to 1Mbp. BWA-MEM and BWA-SW share similar features such as long-read support and split alignment, but BWA-MEM, which is the latest, is generally recommended for high-quality queries as it is faster and more accurate. BWA-MEM also has better performance than BWA-backtrack for 70-100bp Illumina reads.

Installed version

The currently installed version of BWA is 0.7.12.

Using BWA

If you want to find out more about BWA on the HPC cluster, you can use the command

module spider bwa.kit

This will show you basic informations e.g. a short description and the currently installed version.

To load the module, use the command

module load bwa.kit

There is only one version of BWA installed at the moment, if there should be more than one you would have to specify the right version to load:

module load bwa.kit/0.7.12

Always remember: this command is case sensitive!

Using BWA with the HPC cluster

If you want to use BWA on the HPC cluster, you will have to create a jobscript which is than processed by SLURM.

An example jobscript could look like this:

#!/bin/bash 

#SBATCH --ntasks=1
#SBATCH --mem=2G
#SBATCH --time=0-2:00  
#SBATCH --job-name=BWA-TEST
#SBATCH --output=bwa-test.%j.out        
#SBATCH --error=bwa-test.%j.err   

##YOUR SCRIPT STARTS HERE
bwa ..

You can find a short manual on how to use bwa here.

Documentation

For further informations, please visit the official manual.