Difference between revisions of "BioPerl"

From HPC users
Jump to navigationJump to search
Line 52: Line 52:


After job has finished (takes about 10-20 seconds), you will find three new files in the directory you started the job from:
After job has finished (takes about 10-20 seconds), you will find three new files in the directory you started the job from:
*''sequence.fasta
*''sequence.fasta''
*bioperl-test.JOBID.err
*''bioperl-test.JOBID.err''
*bioperl-test.JOBID.out''
*''bioperl-test.JOBID.out''


If the file "sequence.fasta" contains this:
If the file "sequence.fasta" contains this:

Revision as of 13:55, 15 May 2017

Introduction

BioPerl is a collection of Perl modules that facilitate the development of Perl scripts for bioinformatics applications. It has played an integral role in the Human Genome Project.

Installed version

The currently installed version is 1.7.0.

Using BioPerl

If you want to use BioPerl on the cluster, you will have to load its corresponding module first. You can do that with the command:

module load BioPerl

After that BioPerl is loaded and operational.

Using BioPerl on the cluster

If you want to run jobs on the cluster, you will have to create a jobscript which needs to be submitted to SLURM.

An example jobscript for BioPerl could look like this:

#!/bin/bash 

#SBATCH --ntasks=1
#SBATCH --mem=2G
#SBATCH --time=0-2:00  
#SBATCH --job-name=BIOPERL-TEST
#SBATCH --output=bioperl-test.%j.out        
#SBATCH --error=bioperl-test.%j.err   

module load BioPerl
perl seqio.pl

If you want to run this example, please create a file with the bioperl-test.job containing the code above. You will also need to create a file called "seqio.pl" with the following content:

#!/bin/perl -w

use Bio::Seq; use Bio::SeqIO; 

$seq_obj = Bio::Seq->new(-seq=>"aaaatgggggggggggccccgtt",
                         -display_id => "#12345",
                         -desc => "example 1",
                         -alphabet => "dna" ); 

$seqio_obj = Bio::SeqIO->new(-file => '>sequence.fasta',
                             -format => 'fasta' ); 

$seqio_obj->write_seq($seq_obj);

If you created both files you can submit your job with the following command:

sbatch -p carl.p bioperl-test.job

After job has finished (takes about 10-20 seconds), you will find three new files in the directory you started the job from:

  • sequence.fasta
  • bioperl-test.JOBID.err
  • bioperl-test.JOBID.out

If the file "sequence.fasta" contains this:

>#12345 example 1
aaaatgggggggggggccccgtt

everything went fine and you can start to create your own scripts.

Documentation

The full documentation can be found here.