Difference between revisions of "HPC Tutorial No1"

From HPC users
Jump to navigationJump to search
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:


Material related to the first tutorial on "High Performance Computing" (HPC) which
took place at the University of Oldenburg from 19.11.2013 to 21.11.2013. The tutorial
was split into three sessions. The first two sessions featured the parts I and II
(listed below) and were held at:
'''Seminar-Room: W04 1-162:''' <br />
Tue, 19.11.13 - 14-16 Uhr<br />
Wed, 20.11.13 - 16-18 Uhr
The third session (part III) comprised practical exercises which were meant to
illustrate some of the content presented in the earlier parts and was held at:
'''Computer-Lab: W01 0-008:'''<br />
Thu, 21.11.13 - 09-12 Uhr
The target audience of this 1st HPC tutorial were new Users of the local HPC
system, for whom, in order to benefit from the tutorial, the skills of reading
and writing C-programs were of avail.
The full program of this 1st HPC tutorial reads:


== Part I ==
== Part I ==


I. Cluster Overview:
1. Cluster Overview:
#  System Overview
#  System Overview
#  Available compilers
#  Available compilers
Line 9: Line 29:
#  Available parallel environments
#  Available parallel environments
#  Available Libraries
#  Available Libraries
'''NOTES:'''
* The slides related to the presentation can be found [[Media:Introduction_To_HPC_On_FLOW_and_HERO.pdf|here]].


== Part II ==
== Part II ==


II. Introduction to the usage of SGE:
2. Introduction to the usage of SGE:
#  General Job submission (specifying job requirements)
#  General Job submission (specifying job requirements)
#  Single Slot jobs (how to compile submit and monitor status)
#  Single Slot jobs (how to compile submit and monitor status)
Line 18: Line 42:
#  Monitoring and Controlling jobs (qstat, qrsh, qacct)
#  Monitoring and Controlling jobs (qstat, qrsh, qacct)
      
      
III. Debugging:
3. Debugging:
#  Compiling programs for debugging
#  Compiling programs for debugging
#  Tracking memory issues
#  Tracking memory issues
#  Profiling
#  Profiling


IV. Misc:
4. Misc:
#  Logging in from outside the university
#  Logging in from outside the university
#  Mounting the HPC home directory
#  Mounting the HPC home directory
Line 29: Line 53:
#  Importance of allocating proper resources
#  Importance of allocating proper resources


 
'''NOTES:'''
'''Material:'''
* The slides related to the presentation can be found [[Media:HPC_tutorial_pt2.pdf | here]].
The slides related to the presentation can be found here [[Media:HPC_tutorial_pt2.pdf | here]].
* The content of part II closely follows the HPC User-Wiki entry [[Brief_Introduction_to_HPC_Computing| Brief Introduction to HPC Computing]].
 


== Part III ==
== Part III ==


V. Exercises (Computer-Lab):  
5. Exercises (Computer-Lab):  
# Simple example programs
# Simple example programs
# Practical aspects not covered in Parts I+II
# Practical aspects not covered in part II
 
'''NOTES:'''
* The example programs can be downloaded [[Media:HPC_tutorial_examples.tar| here]].
* A brief documentation of some of the examples (i.e. those which are notdiscussed in the [[Brief_Introduction_to_HPC_Computing|Brief introduction to HPC]] Wiki page) can be found [[HPC_Tutorial_Part3:_Examples|here]].
* In part III we also discussed how to monitor the current resource usage for a running job using an interactive session. A brief writeup of the whole procedure can now be found [[Brief_Introduction_to_HPC_Computing#Monitoring_the_current_resource_usage_for_a_running_job | here]] as part of the [[Brief_Introduction_to_HPC_Computing|Brief introduction to HPC]] Wiki page.

Latest revision as of 13:28, 6 January 2014

Material related to the first tutorial on "High Performance Computing" (HPC) which took place at the University of Oldenburg from 19.11.2013 to 21.11.2013. The tutorial was split into three sessions. The first two sessions featured the parts I and II (listed below) and were held at:

Seminar-Room: W04 1-162:
Tue, 19.11.13 - 14-16 Uhr
Wed, 20.11.13 - 16-18 Uhr

The third session (part III) comprised practical exercises which were meant to illustrate some of the content presented in the earlier parts and was held at:

Computer-Lab: W01 0-008:
Thu, 21.11.13 - 09-12 Uhr

The target audience of this 1st HPC tutorial were new Users of the local HPC system, for whom, in order to benefit from the tutorial, the skills of reading and writing C-programs were of avail.

The full program of this 1st HPC tutorial reads:

Part I

1. Cluster Overview:

  1. System Overview
  2. Available compilers
  3. Modification of user environments via "module"
  4. Available parallel environments
  5. Available Libraries


NOTES:

  • The slides related to the presentation can be found here.

Part II

2. Introduction to the usage of SGE:

  1. General Job submission (specifying job requirements)
  2. Single Slot jobs (how to compile submit and monitor status)
  3. Parallel Jobs (openMPI, impi, smp)
  4. Monitoring and Controlling jobs (qstat, qrsh, qacct)

3. Debugging:

  1. Compiling programs for debugging
  2. Tracking memory issues
  3. Profiling

4. Misc:

  1. Logging in from outside the university
  2. Mounting the HPC home directory
  3. Parallel environment memory issue
  4. Importance of allocating proper resources

NOTES:

Part III

5. Exercises (Computer-Lab):

  1. Simple example programs
  2. Practical aspects not covered in part II

NOTES:

  • The example programs can be downloaded here.
  • A brief documentation of some of the examples (i.e. those which are notdiscussed in the Brief introduction to HPC Wiki page) can be found here.
  • In part III we also discussed how to monitor the current resource usage for a running job using an interactive session. A brief writeup of the whole procedure can now be found here as part of the Brief introduction to HPC Wiki page.