0% found this document useful (0 votes)

59 views52 pages

Advanced OpenMP Pitfalls & Solutions

Here are the key steps to parallelize the heat equation solver using OpenMP reduction: 1. Add the OpenMP parallel directive around the time step loop: #pragma omp parallel for reduction(+:u) for (int n=0; n<N; n++) { 2. Inside the loop, declare u as private to avoid race conditions: double u_private; 3. Do the time step calculation into u_private: u_private = u[i][j] + ... 4. Use the reduction clause to accumulate the results: #pragma omp reduction(+:u_private) u[i][j] += u_private

Uploaded by

JAMEEL AHMAD

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views52 pages

Advanced OpenMP Pitfalls & Solutions

Uploaded by

JAMEEL AHMAD

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Pitfalls and advanced OpenMP

Lukas Einkemmer
Department of Mathematics
University of Innsbruck

Shared memory parallelization with OpenMP – Day 2.

Link to slides: http://www.einkemmer.net/training.html

With special thanks to Rolf Rabenseifner (HLRS) on whose original slide

set parts of this course are based.
How to write correct OpenMP programs
Pitfalls

OpenMP is easy to write, but it is also easy to get wrong.

Our goal is to discuss common pitfalls and best practice to

avoid errors in OpenMP code.
Synchronization

A synchronization point in a parallel program coordinates the

work of two or more threads.

Types of synchronization points:

I Barrier: execution of the program can not continue until all
threads have reached the barrier
I Critical (and atomic): Only one thread can execute the
critical region at the same time.
I Lock functions: fine grained control over synchronization.

Example of a barrier in OpenMP

// code
#pragma omp barrier
// no thread can execute this code until all threads
// have reached the barrier
OpenMP memory model

WRONG!
bool wait = false;
#pragma omp parallel for
for(int i=0;i<n;i++) {
// busy wait
while(wait)
;

wait = true;
// do some work
wait = false;
}

The code tries to emulate a critical region.

The program is wrong because we have a race condition.

I Each thread reads and writes to the shared variable wait.
OpenMP memory model
The program, most likely, stops to make any progress.
I This is called a deadlock.

Naive way to think about this program:

WRONG! wait=false

bool wait = false; #pragma omp parallel for

#pragma omp parallel for
for(int i=0;i<n;i++) { while(false)

// busy wait
wait=true;
while(wait)
; // do work
while(true)
;
wait = true; wait=false;
while(true) while(true)
// do some work ; ;
wait=true
wait = false;
} while(true)
// do work ;
OpenMP memory model

The naive analysis is not correct. The code

while(wait)
;

compiles to

.L4:
jmp .L4

The result of the compilation is an infinite loop.

Compiler Explorer: https://godbolt.org.

Full example: https://godbolt.org/z/5xvBcC.
OpenMP memory model
Accessing a shared variable from memory
Memory 0 0 1

Cache 0 0 0 0 1 0

CPU Register 0 0 1 1 0

Thread 0 Thread 1 Thread 0 Thread 1 Thread 0 Thread 1

OpenMP assumes that each thread can operate as if it were

executed sequentially.
In a sequential program
wait = true;
while(wait) ;
is equivalent to
while(true) ;
From a performance perspective, this is the only choice.
OpenMP memory model

At some point in a program a consistent view of memory is

required.
I This is called a flush.

A flush can be done explicitly by the

#pragma omp flush
directive. Explicit flushes are almost never necessary.

A flush is very expensive.

I All data in registers and caches have to be transferred back to
main memory.
I Frequent flushes thus remove the performance benefit of the
memory hierarchy.
OpenMP memory model

A flush is implied at
I barrier
I beginning and end of critical
I beginning and end of a parallel region
I end of a worksharing construct (for, do, sections, single,
workshare)
I immediately before and after a task scheduling point

No flush is implied at
I beginning of a worksharing construct (for, do, sections, single,
workshare)
I beginning and end of master

Recommendation: Use OpenMP directives (such as critical

regions) for synchronization. Avoid lock functions.
Race condition

A race condition occurs when multiple threads are allowed to

access the same memory location and at least one access is a write.

WRONG!
#pragma omp parallel
{
#pragma omp for reduction(+:s) nowait
for(int i=0;i<n;i++)
s += v[i];

int id = omp_get_thread_num();
a[id] = f(s, id);
}

The nowait clause can be used to remove a flush.

Recommendation: be careful, this might introduce a race

condition.
Race condition
Recommendation: declare variables where they are used.
Bad!
Good!
double x;
#pragma omp parallel for
#pragma omp parallel for \
{
private(x)
double x;
{
// code
// code
}
}
Recommendation: force the explicit declaration of all variables.
!$OMP PARALLEL DEFAULT(NONE) SHARED(...) PRIVATE(...)
// code
!$OMP END PARALLEL

Recommendation: use unit tests with different number of threads

and multiple runs to test your code.
Recommendation: use tools that can detect race conditions
(such as Intel Inspector).
Library functions

Race conditions can hide inside library function.

WRONG!
#pragma omp parallel
{
time_t t;
time(&t);
tm* ptm = gmtime(&t);
}

From http://www.cplusplus.com/reference/ctime/gmtime/
A pointer to a tm structure with its members filled with
the values that correspond to the UTC time representation
of timer.
The returned value points to an internal object whose va-
lidity or value may be altered by any subsequent call to
gmtime or localtime.
Library functions

Internally gmtime might look like

tm* gmtime(const time_t* timer) {
static tm t;
// code that populates t
return &t;
}

gmtime r is a thread safe alternative to gmtime, but gmtime r is

not part of the C++ standard.

Recommendation: make sure that library functions which are

called inside OpenMP parallel regions are thread safe.

Recommendation: avoid side effects/internal state in functions

that are called inside OpenMP parallel regions.
Implementation defined behavior

Certain behavior of the OpenMP runtime is not specified by the

OpenMP standard:
I default number of threads;
I default schedule;
I size of the first chunck in schedule(guided);
I default schedule for schedule(runtime);
I default for dynamic thread adjustment;
I number of levels for nested parallelism.

Recommendation: do not rely on undefined behavior.

Recommendation: write OpenMP code that does not assume a

certain number of threads, schedule, chunk size, etc.
How to write efficient OpenMP programs
Overhead of OpenMP

As a rule of thumb we pay the following penalty (in clock cycles)

Operation cost in cycles scaling

arithmetics 1
L1 hit 1-10
function call 10-20
thread ID 10-50 impl. dependent
L3 hit 40
sin/cos 100
Static for, no barrier 100-200 constant
memory 200
barrier 200-500 log, linear
parallel 500-1000 linear
dynamic for, no barrier 103 problem dependent
disk 105
Exact numbers depend on the specific architecture.
False sharing
Several threads access the same cache line.
a[0]++ a[1]++ Code

Core 0 Core 1 Cores

4 5 0 1 3 5 0 1 Caches

a 3 5 0 1 0 0 0 0 0 0 0 0 0 0 0 0 Memory

L1 and L2 caches are (usually) distinct for each core.

I Cache coherence protocol moves the cache line
continuously between threads/cores.
This is associated with a large overhead.
Heat equation
Heat equation

Our goal is to solve the heat equation

∂t u(t, x , y ) = ∂xx u(t, x , y ) + ∂yy u(t, x , y )

with boundary conditions u(t, x , 0) = x , u(t, x , 1) = x ,

u(t, 0, y ) = 0, u(t, 1, y ) = 1 and initial condition u(0, x , y ) = 0.

Solution is approximated by values on a grid uijn .

uijn+1 −uijn
Time discretization: (∂t u)nij ≈ ∆t .
n
ui+1,j −2uijn +ui−1,j
n
Space discretization: n
(∂xx u)ij ≈ ∆x 2

Time step
∆t n ∆t
uijn+1 = uijn + ui+1,j − 2u n
ij + u n
i−1,j + u n
i,j+1 − 2u n
ij + u n
i,j−1 .
∆x 2 ∆y 2
Heat equation

Goals:
I Parallelization of a more realistic application.
I Understand the performance of parallel programs.

Sequential program is provided

I C/C++: heat.c
I Fortran: heat.F

Compile flags to set the number of grid points

g++ -Dimax=250 -Dkmax=250 -O3 heat.c -o heat
Exercise 4a

Parallelize the program using the reduction clause.

Compile and run with 80 × 80 grid points.

Expected result (timings might be different):

I 0.4 s (sequential), 0.5 sec (1 thread), 2.8 sec (2 threads)

Why is the parallel implementation significantly slower than

the sequential implementation?
Solution 4a

The problem is in the sequential program

for(int k=0;k<kmax;k++)
for(int i=0;i<imax;i++)
dphi = (phi[i+1][k]+phi[i-1][k]-2.0*phi[i][k])*dy2i
+ (phi[i][k+1]+phi[i][k-1]-2.0*phi[i][k])*dx2i;

Memory access pattern:

Memory access pattern MAP with loops interchanged

Order of the two loops is important.

I Compiler might be smart enough to interchange the loops.
I Not possible if the outer loop is parallelized by OpenMP.
Exercise 4b

Tasks:
I Interchange nested loops.
I Investigate performance as a function of the problem size.

Expected results:
I No speedup for 80 × 80.
I Significant speedup for 250 × 250.
I Super-linear speedup for 1000 × 1000.

Why can we observe more than a speedup of 4 with

OMP NUM THREADS=4 (super-linear speedup)?
Solution 4b

Memory requirements: 2 · sizeof(double) · (103 )2 = 16MB.

Problem does not fit into the cache of a single core anymore.
I By increasing the number of cores the amount of available
cache increases.

Super-linear speedup is typical observed for relatively small

problems.
Exercise 4c

Further optimize the code by moving the parallel region outside

of the time loop.

Time the numerical computation and the abort statement.

I Why does the abort statement require almost the same time
as the numerical computation?
I Use this knowledge to further optimize the program.
Solution 4c
#pragma omp parallel
for(it=1;it<=itmax;it++) {
#pragma omp barrier
#pragma omp single
dphimax=0.;

#pragma omp for reduction(max:dphimax)

for(k=1;k<kmax;k++)
for(i=1;i<imax;i++) {
...
}

#pragma omp for

for(k=1;k<kmax;k++)
for(i=1;i<imax;i++)
phi[i][k] = phin[i][k];

if(dphimax < eps)

break;
}
Solution 4c

Do the abort condition only every 20th iteration.

Vectorization with OpenMP
Vectorization by the compiler

void vector_add(double* a, double* b) {

a[0] += b[0]; a[1] += b[1];
a[2] += b[2]; a[3] += b[3];
}
compiles to four different add instructions – no vectorization!.

void vector_add(double* __restrict a,

double* __restrict b) {
a[0] += b[0]; a[1] += b[1];
a[2] += b[2]; a[3] += b[3];
}
compiles to
vmovupd ymm0, YMMWORD PTR [rsi] # loads 4 doubles
vaddpd ymm0, ymm0, YMMWORD PTR [rdi] # adds 4 doubles
vmovupd YMMWORD PTR [rdi], ymm0 # write 4 doubles

Full examples: https://godbolt.org/z/lIEVSj, https://godbolt.org/z/9JB6T2.

Vectorization by the compiler
The function
void vector_add(double* a, double* b) {
a[0] += b[0]; a[1] += b[1];
a[2] += b[2]; a[3] += b[3];
}
can not be vectorized since the following call is completely legal
double* p;
vector_add(p, p+1);
which results in
p[0] = p[0] + p[1];
p[1] = p[1] + p[2]; // not independent of previous line

Automatic vectorization is a difficult problem for the

compiler!
Keyword restrict tells the compiler that all memory accesses
that change a are done explicitly through a – makes it much easier
for the compiler to reason about the code.
Vectorization using OpenMP

The simd directive is used to tell the compiler that the loop
iterations are independent.
#pragma omp simd
for(int i=0;i<n;i++)
a[i] += b[i];

Is used in the same way as the for/do directives.

Programmer takes responsibility that loop iterations can be

parallelized.
I Responsibility to proof correctness is transferred to a human.

The clauses private, lastprivate, reduction, and collapse can be

used exactly as for a parallel for loops.

Full example: https://godbolt.org/z/AuNwOU.

Vectorization using OpenMP

WRONG!
#pragma omp simd
for(int i=5;i<n;i++)
a[i] = a[i-5]*b[i];

Correct.
#pragma omp simd safelen(4)
for(int i=5;i<n;i++)
a[i] = a[i-5]*b[i];

safelen(m) clause specifies that a maximum of m + 1 elements

(index 0 to m) of the loop can be together in a vector.
Vectorization using OpenMP

Functions can be used in an omp simd directive.

#pragma omp declare simd notinbranch
double dist(double x1, double y1, double x2, double y2) {
return sqrt(pow(x1-x2,2) + pow(y1-y2,2));
}

#pragma omp simd

for(int i=0;i<n;i++)
d[i] = dist(x1[i], y1[i], x2[i], y2[i]);
Vectorization using OpenMP

Modern CPUs can also vectorize branches

#pragma omp declare simd inbranch
double dist(double x1, double y1, double x2, double y2) {
return sqrt(pow(x1-x2,2) + pow(y1-y2,2));
}

#pragma omp simd

for(int i=0;i<n;i++)
if(x1[i] > x2[i])
d[i] = dist(x1[i], y1[i], x2[i], y2[i]);
else
e[i] = dist(x1[i], y1[i], x2[i], y2[i]);

Whether such a statement is actually vectorized depends on the

compiler and the available instruction set.
Vectorization using OpenMP

Core based parallelism (MIMD) and vectorization (SIMD) can be

combined.
#pragma omp parallel for simd
for(int i=0;i<n;i++)
a[i] += b[i];
Array of struct vs struct of arrays

Array of struct (AoS) Struct of arrays (SoA)

struct state { struct states {
double density; vector<double> density;
double momentum; vector<double> momentum;
// ... // ...
}; };
vector<state> v_aos; states v_soa;

No vectorization Vectorization
#pragma omp simd #pragma omp simd
for(int i=0;i<n;i++) for(int i=0;i<n;i++)
v_aos[i].density v_soa.density[i]
= f(v_aos[i].density); = f(v_soa.density[i]);
Memory access AoS Memory access SoA

Full example: https://godbolt.org/z/kik_VP.

Thread affinity in OpenMP
Thread affinity

In order to run a OpenMP program threads have to be mapped

to cores.
I By default, threads can be moved from one core to another.

On modern systems moving threads can reduce performance.

I Core specific caches have to be invalidated.
I First touch principle is only beneficial if threads are fixed to
the same NUMA domain.

Disable thread movement:

export OMP_PROC_BIND=true

Support for mapping threads to the underlying hardware has

been added in OpenMP 4.0.
I Previously, a patchwork of different tools could be used to
accomplish this.
Thread affinity
cat /proc/cpuinfo
processors: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

processor : 0 processor : 4
physical id : 0 physical id : 0
core id : 0 core id : 0

Memory Memory
memory bus memory bus

Core 0 Core 1 Core 4 Core 5

H0 H1 H0 H1 H0 H1 H0 H1

Core 2 Core 3 Core 6 Core 7

H0 H1 H0 H1 H0 H1 H0 H1

Socket 0 Socket 1

16 processors = 2 CPUs × 4 cores × 2 hyperthreads

OpenMP places and proc bind

Place partition:
OMP_PLACES = threads or cores or sockets
Threads can freely migrate within a place.

Placement options:
OMP_PROC_BIND = spread or close or master
I close: place threads as close together as possible.
I spread: place threads as far apart as possible.
I master: place threads on the same place partition.
Thread placement
Place all threads on the same NUMA node, one thread per
core.
OMP_NUM_THREADS=4
OMP_PLACES=cores
OMP_PROC_BIND=close

Memory Memory
memory bus memory bus

T0 T1

T2 T3

Socket 0 Socket 1

Threads can be moved between hyperthreads.

Thread placement
Spread threads equally among the two NUMA nodes, one
thread per core.
OMP_NUM_THREADS=4
OMP_PLACES=cores
OMP_PROC_BIND=spread

Memory Memory
memory bus memory bus

T0 T2

T1 T3

Socket 0 Socket 1

Threads can be moved between hyperthreads.

Thread placement

One-to-one placement between threads and hyperthreads.

OMP_NUM_THREADS=16
OMP_PLACES=threads
OMP_PROC_BIND=close

Memory Memory
memory bus memory bus

T0 T1 T2 T3 T8 T9 T10 T11

T4 T5 T6 T7 T12 T13 T14 T15

Socket 0 Socket 1
Thread placement

One thread per core.

OMP_NUM_THREADS=8
OMP_PLACES=threads
OMP_PROC_BIND=spread

Memory Memory
memory bus memory bus

T0 T1 T4 T5

T2 T3 T6 T7

Socket 0 Socket 1

Threads are fixed to a single hyperthread.

Thread placement

Recommendation: number of threads ≤ number of cores. One

thread per core.

Recommendation: for memory bound problems spread threads

across all NUMA domains to make full use of the available memory
bandwidth (requires first touch).

Recommendation: Hybrid MPI+OpenMP. One MPI process per

socket and one thread per core.
OMP_NUM_THREADS=4
OMP_PLACES=cores
OMP_PROC_BIND=close
Each MPI process runs on a single NUMA domain.
Thread placement for nested parallelism

OpenMP environment variables can specify different values for

nested parallel regions.

OMP_NUM_THREADS=2,4,2
OMP_PLACES=threads
OMP_PROC_BIND=spread,spread,close

The code
#pragma omp parallel // creates one thread/socket
#pragma omp parallel // creates one thread/core
#pragma omp parallel // creates one thread/hyperthread
//code
creates a total of 16 threads.
The taskloop directive
Remember tasks
struct node {
node *left, *right;
};
void traverse(node* p) {
if(p->left)
#pragma omp task
traverse(p->left); // this is created as a task
if(p->right)
#pragma omp task
traverse(p->right); // this is created as a task
process(p);
}
int main() {
node tree;
#pragma omp parallel // create a team of threads
{
#pragma omp single
traverse(&tree); // executed sequentially
}
}
Taskloop
Taskloop works like a parallel for loop and is used like a task
construct.
#pragma omp parallel
#pragma omp single
#pragma omp taskloop
for(int i=0;i<n;i++)
a[i] = b[i] + i;

We can control the number of tasks by setting either

I num tasks: number of tasks that are generated; or
I grainsize: how many loop iterations should be assinged to a
single task.
private, collapse, etc. can be used as in a parallel for loop.
I reduction clause for taskloop has been added in OpenMP 5.0.
Many more tasks can be generated than threads are available.
I Load balancing similar to the dynamic scheduling strategy.
Taskloop
Main application of taskloop is to combine task based and
(traditional) loop based parallelism.
#pragma omp parallel
{
#pragma omp sections #pragma omp parallel

{
section section
// MPI communication 1 2

#pragma omp taskloop MPI Loop Loop Loop

for(int i=0;i<n_b;i++)
a[i] = ...;
} Loop Loop Loop Loop

#pragma omp section

{
Loop Loop Loop Loop
#pragma omp taskloop
for(int i=n_b;i<n;i++)
a[i] = ...;
}
}
Exercise 5

Goal:
I usage of taskloop construct.

Sequential program is provided in

I C/C++: pi taskloop.c and pi taskloop2.c
I Fortran: pi taskloop.f90 and pi taskloop2.f90

Use taskloop to parallelize pi taskloop.[c|f90].

Use sections+2×taskloop to parallelize pi taskloop2.[c|f90].

OpenMP Shared-Memory Programming Guide
No ratings yet
OpenMP Shared-Memory Programming Guide
37 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
51 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
High Performance Computing WS2022 Slides 2 Openmp GDB Gprof
No ratings yet
High Performance Computing WS2022 Slides 2 Openmp GDB Gprof
41 pages
Open MP
No ratings yet
Open MP
59 pages
OpenMP Basics and Examples
No ratings yet
OpenMP Basics and Examples
80 pages
OpenMP Guide for Parallel Computing
No ratings yet
OpenMP Guide for Parallel Computing
32 pages
Lec7 - TLP Shared Memory and OpenMP
No ratings yet
Lec7 - TLP Shared Memory and OpenMP
45 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
Open MP
No ratings yet
Open MP
28 pages
Worksharing and Parallel Loops
No ratings yet
Worksharing and Parallel Loops
23 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
40 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
OpenMP Tutorial: Hands-On Introduction
No ratings yet
OpenMP Tutorial: Hands-On Introduction
153 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
OpenMP Programming Guide
No ratings yet
OpenMP Programming Guide
38 pages
Parallel Computing Lab Manual PDF
100% (1)
Parallel Computing Lab Manual PDF
51 pages
Omp Sync Data Runtime Environment
No ratings yet
Omp Sync Data Runtime Environment
59 pages
PC File
No ratings yet
PC File
57 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
29 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
Sample - Code - Parallel - Cse6230 Fa14 04 Omp
No ratings yet
Sample - Code - Parallel - Cse6230 Fa14 04 Omp
51 pages
OpenMP Tips and Best Practices
No ratings yet
OpenMP Tips and Best Practices
23 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
CS 61C: Great Ideas in Computer Architecture (Machine Structures)
No ratings yet
CS 61C: Great Ideas in Computer Architecture (Machine Structures)
32 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
Omp Exercises
No ratings yet
Omp Exercises
81 pages
3unit3 Mca Pecnotes
No ratings yet
3unit3 Mca Pecnotes
23 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
OPENMP
No ratings yet
OPENMP
37 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Lecture 06 - OpenMP
No ratings yet
Lecture 06 - OpenMP
37 pages
Mid Sem QP&Solution
No ratings yet
Mid Sem QP&Solution
7 pages
Openmp
No ratings yet
Openmp
61 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
OpenMP Intro
No ratings yet
OpenMP Intro
52 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
OpenMP Shared
No ratings yet
OpenMP Shared
28 pages
CO3 Efficient openMP Programming in High Performance Computing
No ratings yet
CO3 Efficient openMP Programming in High Performance Computing
23 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
21th 22th Lecture
No ratings yet
21th 22th Lecture
22 pages
OpenMP Tasking for Developers
No ratings yet
OpenMP Tasking for Developers
21 pages
OpenMP for Shared Memory Programming
No ratings yet
OpenMP for Shared Memory Programming
30 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
Class - 11 - Computer CH 2 PDF 1
No ratings yet
Class - 11 - Computer CH 2 PDF 1
12 pages
DM Lec2 Getting To Know Your Data
No ratings yet
DM Lec2 Getting To Know Your Data
34 pages
Class - 11 - Computer CH 4 PDF 1
No ratings yet
Class - 11 - Computer CH 4 PDF 1
8 pages
Evolutionary Search: Genetic Algorithm I
No ratings yet
Evolutionary Search: Genetic Algorithm I
47 pages
Association Rule Mining: FP Growth
No ratings yet
Association Rule Mining: FP Growth
22 pages
Why Parallel Computing?: Peter Pacheco
No ratings yet
Why Parallel Computing?: Peter Pacheco
84 pages
Lecture 2 3 Distributed Systems
No ratings yet
Lecture 2 3 Distributed Systems
27 pages
Lecture-4 Parallel hardware-Jameel-NNL
No ratings yet
Lecture-4 Parallel hardware-Jameel-NNL
39 pages
PRN Aat
No ratings yet
PRN Aat
11 pages
ML-1001 R400 Rev 03.1 Student Course Book
100% (6)
ML-1001 R400 Rev 03.1 Student Course Book
678 pages
Unit 3.1 Miller Effect
No ratings yet
Unit 3.1 Miller Effect
3 pages
Fluent-Intro 15.0 WS08a Tank Flush
No ratings yet
Fluent-Intro 15.0 WS08a Tank Flush
33 pages
Artificial Intelligence Techniques
No ratings yet
Artificial Intelligence Techniques
25 pages
Cyber Security Training Content
No ratings yet
Cyber Security Training Content
4 pages
Bharati Vidyapeeth College of Engineering Department of Mechanical Engineering A-Y-2020-21 Title
No ratings yet
Bharati Vidyapeeth College of Engineering Department of Mechanical Engineering A-Y-2020-21 Title
19 pages
A High Efficiency Flyback Micro-Inverter With A New Adaptive Snubber For Photovoltaic Applications
No ratings yet
A High Efficiency Flyback Micro-Inverter With A New Adaptive Snubber For Photovoltaic Applications
29 pages
Documentum Server 7.1 Installation Guide
No ratings yet
Documentum Server 7.1 Installation Guide
145 pages
Introduction To Bluetooth
No ratings yet
Introduction To Bluetooth
14 pages
Physics
No ratings yet
Physics
12 pages
Security+ Vulnerability Lab Guide
No ratings yet
Security+ Vulnerability Lab Guide
42 pages
Ex 280 V 5
100% (1)
Ex 280 V 5
7 pages
Ghaziabad Branch of Circ of Icai: Institute of Chartered Accountants of India
No ratings yet
Ghaziabad Branch of Circ of Icai: Institute of Chartered Accountants of India
49 pages
Oracle DB Control Cleanup Guide
No ratings yet
Oracle DB Control Cleanup Guide
7 pages
Web Image Re-ranking Framework
No ratings yet
Web Image Re-ranking Framework
8 pages
Qualitative Vs Quantitative Data
100% (1)
Qualitative Vs Quantitative Data
12 pages
How Language Works
50% (2)
How Language Works
258 pages
HR Details 2016 17
No ratings yet
HR Details 2016 17
4 pages
LeFun C2&C6 User Manual
No ratings yet
LeFun C2&C6 User Manual
28 pages
Java Set-1 Answers
No ratings yet
Java Set-1 Answers
9 pages
List of It Company in Naagpur
No ratings yet
List of It Company in Naagpur
9 pages
SIWES Report: NITT Zaria Experience
No ratings yet
SIWES Report: NITT Zaria Experience
23 pages
Theory Support For Assignment 4: Communication Theory - 1 (EC5.203 - Spring 2020) March 27, 2020
No ratings yet
Theory Support For Assignment 4: Communication Theory - 1 (EC5.203 - Spring 2020) March 27, 2020
2 pages
Internal and External Links
No ratings yet
Internal and External Links
6 pages
How To Find Configuration Path For Any Field
No ratings yet
How To Find Configuration Path For Any Field
6 pages
Software and Its Characteristics
No ratings yet
Software and Its Characteristics
16 pages
Digital Transformation in Telecom
No ratings yet
Digital Transformation in Telecom
7 pages
Intro to Arrays and Strings
No ratings yet
Intro to Arrays and Strings
69 pages
ABB Profile
No ratings yet
ABB Profile
8 pages