0% found this document useful (0 votes)

114 views15 pages

Tutorial Presentation 8

This document provides an introduction to OpenMP, which is an application programming interface used to explicitly direct multi-threaded, shared memory parallelism. It discusses how chip manufacturers are moving to multi-core CPUs, OpenMP's shared memory model, fork-join execution model, key components of the OpenMP API including compiler directives and runtime routines, how variables can be classified as private or shared, examples of work-sharing constructs like parallel loops, and different scheduling strategies for loop iterations.

Uploaded by

hisuin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

114 views15 pages

Tutorial Presentation 8

Uploaded by

hisuin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

OpenMP

Arash Bakhtiari
bakhtiar@in.tum.de

2012-12-18 Tue

Introduction
I

Chip manufacturers are rapidly moving to multi-core

CPUs

Figure : Quad-core processor Intel Sandy Bridge

Shared Memory Model

All processors can access all memory in global address

space.
Threads Model: A single process can have multiple,
concurrent execution paths
On a multi-core system, the threads run at the same
time, with each core running a particular thread or task.

Figure : Shared Memory Model [1]

What is OpenMP?

I
I

An Application Program Interface (API)

Used to explicitly direct multi-threaded, shared memory
parallelism
Provides a portable, scalable model
Supports C/C++ and Fortran on a wide variety of
architectures

Fork-Join Model

I
I

OpenMP-program starts as a single thread

Additional threads (Team) are created when the master
hits a parallel region
When all threads finished the parallel region, the new
threads are given back to the runtime or operating
system.
The master continues after the parallel region

Fork-Join Model (cont.)

Figure : Fork-Join Model [1]

OpenMP API
Primary API components:
I Compiler Directives:
#pragma omp p a r a l l e l

Run-time Library Routines:

i n t omp_get_num_threads ( v o i d ) ;

Environment Variables

e x p o r t OMP_NUM_THREADS=2

Example
Listing 1: OpenMP Hello World!
#i n c l u d e <i o s t r e a m >
#i n c l u d e <omp . h>
int
{

main ( i n t

argc ,

char argv [ ] )

#pragma omp p a r a l l e l
{
s t d : : c o u t << "THREAD : " << omp_get_thread_num ( ) << " \ t H e l l o , World ! \ n " ;
}
return 0;
}

Listing 2: Compiling
g++ o h e l l o

h e l l o . c fopenmp

Classification of Variables

private(var-list):
I

shared(var-list):
I

Variables in var-list are private

Variables in var-list are shared.

default(private | shared | none):

Sets the default for all variables in this region.

Example
Listing 3: OpenMP Private Variable
#i n c l u d e <i o s t r e a m >
#i n c l u d e <omp . h>
i n t main ( i n t a r g c , c h a r a r g v [ ] )
{
int i , j ;
i = 1;
j = 2;
s t d : : c o u t << "BEFORE : i , j= "<< i << " , " << j << s t d : : e n d l ;
#pragma omp p a r a l l e l p r i v a t e ( i )
{
i = 3;
j = 5;
s t d : : c o u t << " INLOOP : i , j= "<< i << " , " << j << s t d : : e n d l ;
}

s t d : : c o u t << "AFTER :
return 0;

i , j= "<< i << " , " << j << s t d : : e n d l ;

Work-Sharing Constructs

Work-sharing constructs distribute the specified work to

all threads within the current team
Types:
I
I
I
I

Parallel loop
Parallel section
Master region
Single region

Parallel Loop

Syntax:

#pragma omp f o r

I
I

[ clause

...]

The iterations of the loop are distributed to the threads

The scheduling of loop iterations: static, dynamic,
guided, and runtime.

Scheduling Strategies
I

Schedule clause:

schedule ( type

[ , size ])

static: Chunks of the specified size are assigned in a

round- robin fashion to the threads.
dynamic: The iterations are broken into chunks of the
specified size. When a thread finishes the execution of a
chunk, the next chunk is assigned to that thread.
guided: Similar to dynamic, but the size of the chunks is
exponentially decreasing. The size parameter specifies the
smallest chunk. The initial chunk is implementation
dependent.
runtime: The scheduling type and the chunk size is
determined via environment variables.

Example
Listing 4: OpenMP Private Variable
#i n c l u d e <i o s t r e a m >
#i n c l u d e <omp . h>
#d e f i n e CHUNKSIZE 100
#d e f i n e N
1000
i n t main ( )
{
i n t i , chunk ;
d o u b l e a [N] , b [N] , c [N ] ;
s r a n d ( t i m e ( NULL ) ) ;
f o r ( i =0; i < N ; i ++) {
a [ i ] = generate_random_double ( 0 . 0 ,
b [ i ] = generate_random_double ( 0 . 0 ,
}
c h u n k = CHUNKSIZE ;
#pragma omp p a r a l l e l
{

10.0);
10.0);

s h a r e d ( a , b , c , chunk )

private ( i )

#pragma omp f o r s c h e d u l e ( dynamic , c h u n k ) n o w a i t

f o r ( i =0; i < N ; i ++)
c[ i ] = a[ i ] + b[ i ];
}
return
}

References

Blaise Barney, Lawrence Livermore National Laboratory,

https://computing.llnl.gov/tutorials/openMP/

OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
51 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
Open MP
No ratings yet
Open MP
30 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
OpenMP Shared-Memory Programming Guide
No ratings yet
OpenMP Shared-Memory Programming Guide
37 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
49 pages
OpenMP for Shared Memory Programming
No ratings yet
OpenMP for Shared Memory Programming
30 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
OpenMP Basics and Examples
No ratings yet
OpenMP Basics and Examples
80 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
No ratings yet
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
62 pages
OpenMP Intro
No ratings yet
OpenMP Intro
52 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
OpenMP Tutorial: Hands-On Introduction
No ratings yet
OpenMP Tutorial: Hands-On Introduction
153 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
40 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
Openmp: John H. Osorio Ríos
No ratings yet
Openmp: John H. Osorio Ríos
24 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Omp Handouts
No ratings yet
Omp Handouts
109 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
65 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
OpenMP Basics for Programmers
No ratings yet
OpenMP Basics for Programmers
5 pages
Open MP
No ratings yet
Open MP
28 pages
OpenMP Shared Memory Guide
No ratings yet
OpenMP Shared Memory Guide
35 pages
Openmp
No ratings yet
Openmp
21 pages
Cs6801 Mcap MGM
No ratings yet
Cs6801 Mcap MGM
7 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
56 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
Shared Memory Parallel Programming: Introduction To Openmp
No ratings yet
Shared Memory Parallel Programming: Introduction To Openmp
39 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
OpenMP Multithreading Tutorial
100% (1)
OpenMP Multithreading Tutorial
82 pages
OpenMP Lec11 Week4
No ratings yet
OpenMP Lec11 Week4
18 pages
Num Tech
No ratings yet
Num Tech
39 pages
PDC Lecture 7
No ratings yet
PDC Lecture 7
11 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
09 OpenMP Intro
No ratings yet
09 OpenMP Intro
15 pages
PC File
No ratings yet
PC File
57 pages
OpenMP Tutorial - Lawrence Livermore National Laboratory
No ratings yet
OpenMP Tutorial - Lawrence Livermore National Laboratory
75 pages
OPENMP
No ratings yet
OPENMP
37 pages
Unit 3 HPC
No ratings yet
Unit 3 HPC
10 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
29 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
Open MP1551363136163
No ratings yet
Open MP1551363136163
29 pages
Unit 3
No ratings yet
Unit 3
13 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
10 pages
PDC Lecture 7
No ratings yet
PDC Lecture 7
10 pages
Week 1: Required Readings and Videos/Homework
No ratings yet
Week 1: Required Readings and Videos/Homework
5 pages
SE Patterns
No ratings yet
SE Patterns
15 pages
Tutorial (Advanced Programming) Worksheet 8:: Assignment 1: Lists
No ratings yet
Tutorial (Advanced Programming) Worksheet 8:: Assignment 1: Lists
2 pages
Tutorial (Advanced Programming) Worksheet 6:: Assignment 1: Heat Equation
No ratings yet
Tutorial (Advanced Programming) Worksheet 6:: Assignment 1: Heat Equation
3 pages
Simulation and Animation: Computer Graphics & Visualization
No ratings yet
Simulation and Animation: Computer Graphics & Visualization
15 pages
Basic Mathematical Tools For Imaging and Visualization: Dr. Tobias Lasser
No ratings yet
Basic Mathematical Tools For Imaging and Visualization: Dr. Tobias Lasser
15 pages
Computer Architecture Evolution
No ratings yet
Computer Architecture Evolution
17 pages
Tutorial Presentation 2
No ratings yet
Tutorial Presentation 2
19 pages
Effiziente Algorithmen Und Datenstrukturen I: Aufgabe 1 (10 Punkte)
No ratings yet
Effiziente Algorithmen Und Datenstrukturen I: Aufgabe 1 (10 Punkte)
8 pages
NMR Vorlesung SS 2013 3
No ratings yet
NMR Vorlesung SS 2013 3
20 pages
Solution01 Wise2011
No ratings yet
Solution01 Wise2011
5 pages
Free Crochet Pattern Lion Brand Tweed Stripes Tweedy Mitered Afghan
No ratings yet
Free Crochet Pattern Lion Brand Tweed Stripes Tweedy Mitered Afghan
4 pages
Probability Calculations & Combinatorics
No ratings yet
Probability Calculations & Combinatorics
6 pages
Free Knitting Pattern Lion Brand Landscapes Diagonal Furrows Scarf
No ratings yet
Free Knitting Pattern Lion Brand Landscapes Diagonal Furrows Scarf
3 pages
Design of Serial in - Serial OUT Shift Register Using Behavior Modeling Style (Verilog CODE) - Verilog Programming by Naresh Singh Dobal
No ratings yet
Design of Serial in - Serial OUT Shift Register Using Behavior Modeling Style (Verilog CODE) - Verilog Programming by Naresh Singh Dobal
5 pages
Aixperf Part1
No ratings yet
Aixperf Part1
28 pages
BAdI Implementation For The Validation of Recipie... - SAP Community
No ratings yet
BAdI Implementation For The Validation of Recipie... - SAP Community
14 pages
Calendar Functions in Python
No ratings yet
Calendar Functions in Python
3 pages
General Instructions and Guidelines For Tests and Exams
No ratings yet
General Instructions and Guidelines For Tests and Exams
4 pages
CS6611 Mobile Application Development Lab
No ratings yet
CS6611 Mobile Application Development Lab
54 pages
ADF Code Corner: 044. How-To Restrict The List of Values Retrieved by A Model Driven LOV
No ratings yet
ADF Code Corner: 044. How-To Restrict The List of Values Retrieved by A Model Driven LOV
6 pages
Algorithm and Data Structure Lecture 1a
No ratings yet
Algorithm and Data Structure Lecture 1a
4 pages
MET ONE 3400 Brochure
No ratings yet
MET ONE 3400 Brochure
5 pages
Chapter 4 Application and OS Security
No ratings yet
Chapter 4 Application and OS Security
41 pages
Mobile App Energy Optimization
No ratings yet
Mobile App Energy Optimization
11 pages
Overview of Computers and Programming: Problem Solving and Program Design in C 5th Edition
No ratings yet
Overview of Computers and Programming: Problem Solving and Program Design in C 5th Edition
18 pages
OpenCV Setup in Visual Studio 2005
No ratings yet
OpenCV Setup in Visual Studio 2005
5 pages
FemaleLiver 02 NetworkConstr Blockwise
No ratings yet
FemaleLiver 02 NetworkConstr Blockwise
6 pages
The Copperbelt University Cs250 Assignment Solutions Name: Bright Londa SIN:18120707 Programme: Bioinformatics
No ratings yet
The Copperbelt University Cs250 Assignment Solutions Name: Bright Londa SIN:18120707 Programme: Bioinformatics
3 pages
PDF Test Bank For Linux+ and LPIC-1 Guide To Linux Certification, 5th Edition Jason Eckert Download
100% (24)
PDF Test Bank For Linux+ and LPIC-1 Guide To Linux Certification, 5th Edition Jason Eckert Download
46 pages
STE Grade-9 SLM HTML English-Edited
No ratings yet
STE Grade-9 SLM HTML English-Edited
41 pages
Viewse Um006 - en e (337 448)
100% (1)
Viewse Um006 - en e (337 448)
112 pages
Mid Defense
No ratings yet
Mid Defense
68 pages
Advanced Web Programming Lab Manual
No ratings yet
Advanced Web Programming Lab Manual
62 pages
6400 3500 SNTP Time Client Clock Datasheet
No ratings yet
6400 3500 SNTP Time Client Clock Datasheet
2 pages
Anspo NVR User's Manual
No ratings yet
Anspo NVR User's Manual
74 pages
WASP Using Excel - 3 Plotting Data
No ratings yet
WASP Using Excel - 3 Plotting Data
3 pages
GIS in Construction: Applications & Research
No ratings yet
GIS in Construction: Applications & Research
21 pages
DJ Strike
No ratings yet
DJ Strike
12 pages
IT Security KPI Analysis Task
No ratings yet
IT Security KPI Analysis Task
2 pages
Typical EDI Setup Sequence
No ratings yet
Typical EDI Setup Sequence
4 pages
Term 1 Class 2
No ratings yet
Term 1 Class 2
5 pages
Coupling and Cohesion
No ratings yet
Coupling and Cohesion
6 pages
Breaking Down Data Silos
No ratings yet
Breaking Down Data Silos
4 pages

Tutorial Presentation 8

Uploaded by

Tutorial Presentation 8

Uploaded by

OpenMP

Chip manufacturers are rapidly moving to multi-core

Figure : Quad-core processor Intel Sandy Bridge

Shared Memory Model

All processors can access all memory in global address

Figure : Shared Memory Model [1]

An Application Program Interface (API)

OpenMP-program starts as a single thread

Fork-Join Model (cont.)

Figure : Fork-Join Model [1]

Run-time Library Routines:

Variables in var-list are private

default(private | shared | none):

Sets the default for all variables in this region.

i , j= "<< i << " , " << j << s t d : : e n d l ;

Work-sharing constructs distribute the specified work to

The iterations of the loop are distributed to the threads

static: Chunks of the specified size are assigned in a

#pragma omp f o r s c h e d u l e ( dynamic , c h u n k ) n o w a i t

Blaise Barney, Lawrence Livermore National Laboratory,

You might also like