0% found this document useful (0 votes)

18 views8 pages

ACA 2024W 04 Shared-memory programming with OpenMP 1-15

Uploaded by

Ghofrane Rh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views8 pages

ACA 2024W 04 Shared-memory programming with OpenMP 1-15

Uploaded by

Ghofrane Rh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Multicore & GPU Programming : An Integrated Approach

Shared-Memory Programming: OpenMP

By G. Barlas

 Modifications by H. Weber

Objectives
!
Learn how to use OpenMP compiler directives to introduce concurrency in a
sequential program.
!
Learn the most important OpenMP #pragma directives and associated
clauses, for controlling the concurrent constructs generated by the compiler.
!
Understand which loops can be parallelized with OpenMP directives.
!
Address the dependency issues that OpenMP-generated threads face,
using synchronization constructs.
!
Learn how to use OpenMP to create function-parallel programs.
!
Learn how to write thread-safe functions.
!
Understand the issue of cache-false sharing and learn how to eliminate it.

<C> G. Barlas, 2015 2

Introduction

!
The decomposition of a sequential program into
components that can execute in parallel is a tedious
enterprise.
!
OpenMP has been designed to alleviate much of the effort
involved, by accommodating the incremental conversion of
sequential programs into parallel ones, with the assistance
of the compiler.
!
OpenMP relies on compiler directives for decorating
portions of the code that the compiler will attempt to
parallelize.

<C> G. Barlas, 2015 3

OpenMP History
! OpenMP: Open Multi-Processing is an API for shared-memory programming.
! OpenMP was specifically designed for parallelizing existing sequential
programs.
! Uses compiler directives and a library of functions to support its operation.
! OpenMP v.1 was published in 1998.
! OpenMP v.4.0 was published in 2013.
! Standard controlled by the OpenMP Architecture Review Board (ARB).
! GNU C support:
− GCC 4.7 supports OpenMP 3.1 specification
− GCC 4.9 supports OpenMP 4.0.

<C> G. Barlas, 2015  4

OpenMP Paradigm

! OpenMP programs are Globally Sequential, Locally

Parallel.
! Programs follow the fork-join paradigm:

<C> G. Barlas, 2015 5

OpenMP Essential Definitions

! Structured block: an executable statement or a compound block,
with a single point of entry and a single point of exit.
! Construct: an OpenMP directive and the associated statement, for-
loop or structured block that it controls.
!
Region: all code encountered during the execution of a construct,
including any called functions.
!
Parallel region: a region executed simultaneously by multiple
threads.
! A region is dynamic but a construct is static.
!
Master thread: the thread executing the sequential part of the
program and spawning the child threads.
!
Thread team: a set of threads that execute a parallel region.

<C> G. Barlas, 2015 6

„Hello World“ in OpenMP

! Can you match some of the previous definitions with parts of this
program?
<C> G. Barlas, 2015 7

„Hello World“ Sequence Diagram

! One of the possible execution sequences:

<C> G. Barlas, 2015 8

#pragma directives

!
Pragma directives allow a programmer to access compiler-
specific preprocessor extensions.
!
For example, a common use of pragmas, is in the management
of include files. E.g.
#pragma once
!
Pragma directives in OpenMP can have a number of optional
clauses, that modify their behavior.
!
In the previous example the clause is num_threads(numThr)
!
Compilers that do not support certain pragma directives, ignore
them.

<C> G. Barlas, 2015 9

Thread Team Size Control

!
Universally: via the OMP_NUM_THREADS environmental
variable:
$ echo ${OMP_NUM_THREADS} # to query the value
$ export OMP_NUM_THREADS=4 # to set it in BASH
!
Program level: via the omp_set_number_threads function,
outside an OpenMP construct.
!
Pragma level: via the num_threads clause.
!
The omp_get_num_threads call returns the active threads in a
parallel region. If it is called in a sequential part it returns 1.

<C> G. Barlas, 2015 10

Variable Scope
! Outside the parallel regions, normal scope rules apply.
!
OpenMP specifies the following types of variables:
− Shared: all variables declared outside a parallel region are by default
shared. That does not mean that they are in anyway "protected".
− Private: all variables declared inside a parallel region are allocated in the
run-time stack of each thread. So we have as many copies of these
variables as the size of the thread team. Private variables are destroyed
upon the termination of a parallel region.
− Reduction: a reduction variable gets individual copies for each thread
running the corresponding parallel region. Upon the termination of the
parallel region, an operation is applied to the individual copies (e.g.
summation) to produce the value that will be stored in the shared variable.
!
The default scope of variables can be modified by clauses in the
pragma lines.
<C> G. Barlas, 2015 11

Parallel Function Integration

end

∫ f (x )dx
start
n−1
f (x i ) + f (x i+ 1 )
≈ ∑ step⋅ 2
i=0
f (start ) + f (end) n−1
= step ⋅( + ∑ f (x i ))
2 i=1
where x 0 = start
x n = end
step = (end−start )/n

<C> G. Barlas, 2015 12

Example: Function integrate()
!
The sequential implementation:
double integrate (double st, double en, int div, double (*f) (double))
{
double localRes = 0;
double step = (en - st) / div;
double x;
x = st; Look at Code integrate_seq.cpp.
localRes = f (st) + f (en);
localRes /= 2;
for (int i = 1; i < div; i++)
{
x += step;
localRes += f (x);
}
localRes *= step;

return localRes;
}
//---------------------------------------
int main (int argc, char *argv[])
{
. . .
double finalRes = integrate (start, end, divisions, testf);

cout << finalRes << endl;

<C> G. Barlas, 2015 13

OpenMP V.0: Manual partitioning

! Given the ID of each thread, we can calculate:

Race <C>
condition!
G. Barlas, 2015 14
OpenMP V.1: Removing the race cond.
! Giving each thread its own private storage. Sequential
reduction is required afterwards.

<C> G. Barlas, 2015 15

Update to Modern C++
From Everand
Update to Modern C++
James Raynard
No ratings yet
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
OpenMP_SPM
No ratings yet
OpenMP_SPM
9 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
10 OpenMP-2
No ratings yet
10 OpenMP-2
25 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
Lecture 10 Shared Memory Programming with OpenMP.pptx
No ratings yet
Lecture 10 Shared Memory Programming with OpenMP.pptx
30 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
Lecture - 06 (Shared Memory Programming With OpenMP)
No ratings yet
Lecture - 06 (Shared Memory Programming With OpenMP)
65 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Govindarajan_ParallelizationPrinciples-NSM-AstroPhysics
No ratings yet
Govindarajan_ParallelizationPrinciples-NSM-AstroPhysics
50 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
UNIT 3
No ratings yet
UNIT 3
13 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
DS1822-Parallel Computing - Unit2
No ratings yet
DS1822-Parallel Computing - Unit2
25 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
openmp_HPC_ass1
No ratings yet
openmp_HPC_ass1
43 pages
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
No ratings yet
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
62 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
Mcap-lab Manual 1
No ratings yet
Mcap-lab Manual 1
19 pages
OpenMP Presentation
No ratings yet
OpenMP Presentation
51 pages
Chapter 5
No ratings yet
Chapter 5
92 pages
CS-3006_8_UsingOpenMP_SharedMemoryProgramming
No ratings yet
CS-3006_8_UsingOpenMP_SharedMemoryProgramming
61 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Openmp
No ratings yet
Openmp
61 pages
Openmp: Openmp Adds Constructs For Shared-Memory
No ratings yet
Openmp: Openmp Adds Constructs For Shared-Memory
15 pages
Num Tech
No ratings yet
Num Tech
39 pages
Open MP
No ratings yet
Open MP
30 pages
Openmp Programming: Aiichiro Nakano
No ratings yet
Openmp Programming: Aiichiro Nakano
10 pages
PDSOpenMP
No ratings yet
PDSOpenMP
22 pages
OpenMPSlides Tamu SC
No ratings yet
OpenMPSlides Tamu SC
80 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
OpenMP Tutorial
100% (1)
OpenMP Tutorial
82 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
49 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
4.OpenMP Done
No ratings yet
4.OpenMP Done
3 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Openmp 2pp
No ratings yet
Openmp 2pp
15 pages
Openmp
No ratings yet
Openmp
115 pages
Node.js: Tools & Skills
From Everand
Node.js: Tools & Skills
James Hibbard
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Ds
No ratings yet
Ds
14 pages
Arrays and Strings
No ratings yet
Arrays and Strings
42 pages
Unit 2 WT
No ratings yet
Unit 2 WT
29 pages
Chapter 15 Slides
No ratings yet
Chapter 15 Slides
36 pages
Overview (Java SE 11 & JDK 11)
No ratings yet
Overview (Java SE 11 & JDK 11)
6 pages
Python 1
No ratings yet
Python 1
5 pages
CCB Interview Prepare
No ratings yet
CCB Interview Prepare
4 pages
Structure and Union
No ratings yet
Structure and Union
5 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Abdul Ghani Resume
No ratings yet
Abdul Ghani Resume
1 page
SERVER SIDE SCRIPTING BASIC-php
No ratings yet
SERVER SIDE SCRIPTING BASIC-php
42 pages
DSA
No ratings yet
DSA
17 pages
Module 2 - String
No ratings yet
Module 2 - String
34 pages
SAP ABAP New Syntax 7.4+
100% (1)
SAP ABAP New Syntax 7.4+
17 pages
Class 12th Project on Java & MySQL Connectiviy
No ratings yet
Class 12th Project on Java & MySQL Connectiviy
27 pages
Kotlin in Microservices Using DDD, Event Sourcing & CQRS
No ratings yet
Kotlin in Microservices Using DDD, Event Sourcing & CQRS
70 pages
Atul Soni - C Programming to Improve Coding Skills_ Only Learning and Algorithm Based Programs With Source Code-Independently Published (2024)
No ratings yet
Atul Soni - C Programming to Improve Coding Skills_ Only Learning and Algorithm Based Programs With Source Code-Independently Published (2024)
837 pages
Rails Intro & MVC
No ratings yet
Rails Intro & MVC
21 pages
PHP_Complete_Answer_Bank
No ratings yet
PHP_Complete_Answer_Bank
6 pages
自动化UVM验证平台生成和代码管理
No ratings yet
自动化UVM验证平台生成和代码管理
34 pages
Node - Js Console - REPL
No ratings yet
Node - Js Console - REPL
3 pages
Documento
No ratings yet
Documento
3 pages
POINTERS & ARRAYS
No ratings yet
POINTERS & ARRAYS
4 pages
Week1 Intro To Java
No ratings yet
Week1 Intro To Java
52 pages
C++ Qbank
No ratings yet
C++ Qbank
5 pages
Project Working
No ratings yet
Project Working
13 pages
DS Lab Report (1)
No ratings yet
DS Lab Report (1)
60 pages
Script
No ratings yet
Script
25 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
Packages
No ratings yet
Packages
5 pages