0% found this document useful (0 votes)

16 views59 pages

Locks

The document discusses classical algorithms for locks that provide mutual exclusion using only atomic load and store operations. It covers two partial solutions, Peterson's algorithm for a two-thread solution, and the filter lock which generalizes Peterson's approach to support multiple threads. The goal is to understand fundamental principles of synchronization without relying on stronger atomic primitives.

Uploaded by

oreh2345

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views59 pages

Locks

Uploaded by

oreh2345

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Mutual Exclusion:

Classical Algorithms for Locks

Bill Scherer

Department of Computer Science

Rice University

scherer@cs.rice.edu

COMP 422 Lecture 20 March 2008

Motivation

Ensure that a block of code manipulating a data structure is

executed by only one thread at a time
• Why? avoid conflicting accesses to shared data (data races)
—read/write conflicts
—write/write conflicts
• Approach: critical section
• Mechanism: lock
—methods
– acquire
– release

• Usage
—acquire lock to enter the critical section
—release lock to leave the critical section
2
Problems with Locks

• Conceptual
—coarse-grained: poor scalability
—fine-grained: hard to write
• Semantic
—deadlock
—priority inversion
• Performance
—convoying
—intolerance of page faults and preemption

3
Lock Alternatives

• Transactional memory (TM)

+ Easy to use, well-understood metaphor
– High overhead (so far)
± Subject of much active research
• Ad hoc nonblocking synchronization (NBS)
+ Thread failure/delay cannot prevent progress
+ Can be faster than locks (stacks, queues)
– Notoriously difficult to write – every new algorithm is a publishable
result
+ Can be “canned” in libraries (e.g. java.util)

4
Synchronization Landscape
Ad Hoc
NBS
Programmer Effort

Fine
Locks

Software HW
TM (STM) TM

Coarse Canned
Locks NBS

System Performance
5
Properties of Good Lock Algorithms

• Mutual exclusion (safety property)

—critical sections of different threads do not overlap
– cannot guarantee integrity of computation without this property

• No deadlock
—if some thread attempts to acquire the lock, then some thread will
acquire the lock
• No starvation
—every thread that attempts to acquire the lock eventually succeeds
– implies no deadlock

Notes
• Deadlock-free locks do not imply a deadlock-free program
—e.g., can create circular wait involving a pair of “good” locks
• Starvation freedom is desirable, but not essential
—practical locks: many permit starvation, although it is unlikely to occur
• Without a real-time guarantee, starvation freedom is weak property 6
Topics for Today

Classical locking algorithms using load and store

• Steps toward a two-thread solution
—two partial solutions and their properties
• Peterson’s algorithm: a two-thread solution
• Filter lock: generalized Peterson

7
Classical Lock Algorithms

• Use atomic load and store only, no stronger atomic primitives

• Not used in practice
—locks based on stronger atomic primitives are more efficient
• Why study classical algorithms?
—understand the principles underlying synchronization
– subtle
– such issues are ubiquitous in parallel programs

8
Toward a Classical Lock for Two Threads

• First, consider two inadequate but interesting lock algorithms

—use load and store only
• Assumptions
—only two threads
—each thread has a unique value of self_threadid ∈ {0,1}

9
Lock1

class Lock1: public Lock {

private: set my flag
volatile bool flag[2];
public:
void acquire() {
int other_threadid = 1 - self_threadid;
flag[self_threadid] = true;
while (flag[other_threadid] == true);
}
void release() {
flag[self_threadid] = false;
}
} wait until other flag
is false

10
Using Lock1
assume that initially
both flags are false
thread 0 thread 1

flag[0] = true
while(flag[1] == true);
flag[1] = true
CS0 while(flag[0] == true);
wait
flag[0] = false

CS1

flag[1] = false
11
Using Lock1

thread 0 thread 1
flag[0] = true
flag[1] = true
while(flag[1] == true); while(flag[0] == true);

wait wait

deadlock!
12
Summary of Lock1 Properties

• If one thread executes acquire before the other, works fine

—Lock1 provides mutual exclusion
• However, Lock1 is inadequate
—if both threads write flags before either reads → deadlock

13
Lock2

class Lock2: public Lock {

private:
volatile int victim;
public:
void acquire() {
victim = self_threadid;
while (victim == self_threadid); // busy wait
}
void release() { }
}

14
Using Lock2

thread 0 thread 1

victim = 0 victim = 1
while(victim == 0); while(victim == 1);

wait

victim = 0
while(victim == 0);

wait

15
Using Lock2

thread 0
victim = 0
while(victim == 0);

wait

deadlock!
16
Summary of Lock2 Properties

• If the two threads run concurrently, acquire succeeds for one

—provides mutual exclusion
• However, Lock2 is inadequate
—if one thread runs before the other, it will deadlock

17
Combining the Ideas

Lock1 and Lock2 complement each other

• Each succeeds under conditions that causes the other to fail
—Lock1 succeeds when CS attempts do not overlap
—Lock2 succeeds when CS attempts do overlap
• Design a lock protocol that leverages the strengths of both…

18
Peterson’s Algorithm: 2-way Mutual Exclusion

class Peterson: public Lock {

private:
volatile bool flag[2];
volatile int victim;
public:
void acquire() {
int other_threadid = 1 - self_threadid;
flag[self_threadid] = true; // I’m interested
victim = self_threadid // you go first
while (flag[other_threadid] == true &&
victim == self_threadid);
}
void release() {
flag[self_threadid] = false;
}
}
Gary Peterson. Myths about the Mutual Exclusion Problem.
Information Processing Letters, 12(3):115-116, 1981.
19
Peterson’s Lock: Serialized Acquires

thread 0 thread 1
flag[0] = true
victim = 0
while(flag[1] == true
&& victim == 0);
flag[1] = true
victim = 1
while(flag[0] == true
CS0 && victim == 1);
wait
flag[0] = false
CS1

flag[1] = false
20
Peterson’s Lock: Concurrent Acquires

thread 0 thread 1
flag[0] = true
victim = 0 flag[1] = true
victim = 1
while(flag[1] == true while(flag[0] == true
&& victim == 0); && victim == 1);

CS0
wait
flag[0] = false
CS1
flag[1] = false
21
From 2-way to N-way Mutual Exclusion

• Peterson’s lock provides 2-way mutual exclusion

• How can we generalize to N-way mutual exclusion, N > 2?
• Filter lock: direct generalization of Peterson’s lock

22
Filter Lock
class Filter: public Lock {
private:
volatile int level[N]; volatile int victim[N-1];
public:
void acquire() {
for (int j = 1; j < N; j++) {
level [self_threadid] = j;
victim [j] = self_threadid;
// wait while conflicts exist
while (sameOrHigher(self_threadid,j) &&
victim[j] == self_threadid);
}
}
bool sameOrHigher(int i, int j) {
for(int k = 0; k < N; k++)
if (k != i && level[k] >= j) return true;
return false;
}
void release() {
level[self_threadid] = 0;
}
}
23
Understanding the Filter Lock

• Peterson’s lock used two-element Boolean flag array

• Filter lock generalization: an N-element integer level array
—value of level[k] = highest level thread k is interested in entering
—each thread must pass through N-1 levels of exclusion
• Each level has it’s own victim flag to filter out 1 thread,
excluding it from the next level
—natural generalization of victim variable in Peterson’s algorithm
• Properties of levels
—at least one thread trying to enter level k succeeds
—if more than one thread is trying to enter level k, then at least one
is blocked
• For proofs, see Herlihy and Shavit’s manuscript

24
References

• Maurice Herlihy and Nir Shavit. “Multiprocessor

Synchronization and Concurrent Data Structures.” Chapter 3
“Mutual Exclusion.” Draft manuscript, 2005.
• Gary Peterson. Myths about the Mutual Exclusion Problem.
Information Processing Letters, 12(3), 115-116, 1981.

25
Lock Synchronization with
Atomic Primitives

Bill Scherer

Department of Computer Science

Rice University

scherer@cs.rice.edu

COMP 422 Lecture 20 March 2008

Topics for Today

• Atomic primitives for synchronization

• Lock algorithms using atomic primitives
—test-and-set lock
—test-and-set with exponential backoff
—Array-based queue locks
—MCS list-based queue lock
—CLH list-based queue lock
• Case study: performance of lock implementations
—BBN Butterfly and Sequent Symmetry

27
Atomic Primitives for Synchronization

Atomic read-modify-write primitives

• test_and_set(Word &M)
—writes a 1 into M
—returns M’s previous value
• swap(Word &M, Word V)
—replaces the contents of M with V
—returns M’s previous value
• fetch_and_Φ(Word &M, Word V)
—Φ can be ADD, OR, XOR
—replaces the value of M with Φ(old value, V)
—returns M’s previous value
• compare_and_swap(Word &M, Word oldV, Word newV)
—if (M == oldV) M ← newV
—returns TRUE if store was performed
—universal primitive 28
Load-Linked & Store Conditional

• load_linked(Word &M)
—sets a mark bit in M’s cache line
—returns M’s value
• store_conditional(Word &M, Word V)
—if mark bit is set for M’s cache line, store V into M, otherwise fail
—condition code indicates success or failure
—may spuriously fail if
– context switch, another load-link, cache line eviction

• Arbitrary read-modify-write operations with LL / SC

loop forever
load linked on M returns V
execute sequence of instructions performing arbitrary computation on V
and other values
store conditional of V’ into M
if store conditional succeeded exit loop
• Supported on Alpha, PowerPC, MIPS, and ARM
29
Test & Set Lock

type lock = (unlocked, locked)

procedure acquire_lock (L : ^lock)

loop
// NOTE: test and set returns old value
if test_and_set (L) = unlocked
return

procedure release_lock (L : ^lock)

L^ := unlocked

30
Test & Test & Set (TATAS) Lock

type lock = (unlocked, locked)

procedure acquire_lock (L : ^lock)

loop
// NOTE: test and set returns old value
if test_and_set (L) = unlocked
return
else
loop
until L^ <> locked

procedure release_lock (L : ^lock)

L^ := unlocked

31
Test & Set Lock Notes

• Space: n words for n locks and p processes

• Lock acquire properties
—spin waits using atomic read-modify-write
• Starvation theoretically possible; unlikely in practice
—Fairness, however can be very uneven
• Poor scalability
—continual updates to a lock cause heavy network traffic
– on cache-coherent machines, each update causes an invalidation
—Improved with TATAS variant, but still a big spike on each
release of the lock, even on cache-coherent machines

32
Test & Set Lock with Exponential Backoff
type lock = (unlocked, locked)

procedure acquire_lock (L : ^lock)

delay : integer := 1

// NOTE: test and set returns old value

while test_and_set (L) = locked
pause (delay) // wait this many units of time
delay := delay * 2 // double delay each time

procedure release_lock (L : ^lock)

L^ := unlocked

33
Test & Set Lock with Exp. Backoff Notes

• Similar to code developed by Tom Anderson

• Grants requests in unpredictable order
• Starvation is theoretically possible, but unlikely in practice
• Spins (with backoff) on remote locations
• Atomic primitives: test_and_set

• Pragmatics: need to cap probe delay to some maximum

IEEE TPDS, January 1990

34
Array-based Lock Notes

• Grants requests in FIFO order

• Space: O(pn) space for p processes and n locks

35
The MCS List-based Queue Lock
type qnode = record
next : ^qnode
locked : Boolean
type lock = ^qnode // initialized to nil

// parameter I, below, points to a qnode record allocated (in an enclosing scope) in

// shared memory locally-accessible to the invoking processor
procedure acquire_lock (L : ^lock, I : ^qnode)
I->next := nil
predecessor : ^qnode := fetch_and_store (L, I)
if predecessor != nil // queue was non-empty
I->locked := true
predecessor->next := I
repeat while I->locked // spin

leaving spin spin spin

• Process 1 prepares to release lock

—if it’s next field is set, signal successor directly
—suppose 1’s next pointer is still null
– attempt a compare_and_swap on the tail pointer
– finds that tail no longer points to self
– waits until successor pointer is valid (already points to 2 in diagram)
– signal successor (process 2) 41
MCS Lock In Action - VI

tail

1 2 3 4

leaving run spin spin

42
MCS Lock Notes

• Grants requests in FIFO order

• Space: 2p + n words of space for p processes and n locks
• Requires a local "queue node" to be passed in as a parameter
—alternatively, additional code can allocate these dynamically in
acquire_lock, and look them up in a table in release_lock).
• Spins only on local locations
— cache-coherent and non-cache-coherent machines
• Atomic primitives
—fetch_and_store and (ideally) compare_and_swap

ASPLOS, April 1991

ACM TOCS, February 1991
43
Impact of the MCS Lock

• Key lesson: importance of reducing memory traffic in

synchronization
—local spinning technique influenced virtually all practical scalable
synchronization algorithms since

• 2006 Edsger Dijkstra Prize in distributed computing

—“an outstanding paper on the principles of distributed computing, whose
significance and impact on the theory and/or practice of distributed
computing has been evident for at least a decade”
—“probably the most influential practical mutual exclusion algorithm ever”
—“vastly superior to all previous mutual exclusion algorithms”
—fast, scalable, and fair in a wide variety of multiprocessor systems
—avoids need to pre-allocate memory for a fixed, maximum # of threads
—widely used: e.g., monitor locks used in Java VMs are variants of MCS

44
CLH List-based Queue Lock

type qnode = record

prev : ^qnode
succ_must_wait : Boolean

type lock = ^qnode // initialized to point to an unowned qnode

procedure acquire_lock (L : ^lock, I : ^qnode)

I->succ_must_wait := true
pred : ^qnode := I->prev := fetch_and_store(L, I)
repeat while pred->succ_must_wait

procedure release_lock (ref I : ^qnode)

pred : ^qnode := I->prev
I->succ_must_wait := false
I := pred // take pred's qnode

45
CLH Lock In Action

tail

run spin spin spin

46
CLH Queue Lock Notes

• Discovered twice, independently

—Travis Craig (University of Washington)
– TR 93-02-02, February 1993
—Anders Landin and Eric Hagersten (Swedish Institute of CS)
– IPPS, 1994

• Space: 2p + 3n words of space for p processes and n locks

—MCS lock requires 2p + n words
• Requires a local "queue node" to be passed in as a parameter
• Spins only on local locations on a cache-coherent machine
• Local-only spinning possible when lacking coherent cache
—can modify implementation to use an extra level of indirection
(local spinning variant not shown)

• Atomic primitives: fetch_and_store

47
Case Study:

Evaluating Lock
Implementations for the
BBN Butterfly and Sequent
Symmetry

J. Mellor-Crummey and M. Scott. Algorithms for scalable

synchronization on shared-memory multiprocessors. ACM
Transactions on Computer Systems, 9(1):21-65, Feb. 1991.

57
• Evaluation criteria
—hardware support
—performance: latency, throughput
—fairness
• Mutual exclusion
—load-store based protocols
—test and set locks
—ticket locks
—queuing locks
• Barriers
—centralized barriers: counters and flags
—software combining trees
—tournament barrier
—dissemination barrier
• Problems and solutions
—re-initialization via sense switching
—handling counter overflow
58
Maintain the integrity of shared data structures
• Goal: avoid conflicting updates
—read/write conflicts
—write/write conflicts

L3 Chp2 Web
No ratings yet
L3 Chp2 Web
24 pages
018 Mutex2
No ratings yet
018 Mutex2
24 pages
13 Locks
No ratings yet
13 Locks
40 pages
Welc 2006
No ratings yet
Welc 2006
44 pages
Chapter2 Mutex
No ratings yet
Chapter2 Mutex
22 pages
Locks 28
No ratings yet
Locks 28
13 pages
Lock (Computer Science) : Locks, Where Attempting Unauthorized Access To A Locked Resource Will Force An
No ratings yet
Lock (Computer Science) : Locks, Where Attempting Unauthorized Access To A Locked Resource Will Force An
9 pages
Implementing Locks: How To Write Correct Concurrent Programs? No Race
No ratings yet
Implementing Locks: How To Write Correct Concurrent Programs? No Race
4 pages
Lab 3
No ratings yet
Lab 3
18 pages
Concurrent Programming
100% (3)
Concurrent Programming
9 pages
Operating Systems: Synchronization
No ratings yet
Operating Systems: Synchronization
26 pages
Lock
No ratings yet
Lock
53 pages
Spin Locks and Contention
No ratings yet
Spin Locks and Contention
53 pages
Concurrency Control & Deadlock
No ratings yet
Concurrency Control & Deadlock
34 pages
TRansaction 2
No ratings yet
TRansaction 2
27 pages
15 Synchronization
No ratings yet
15 Synchronization
120 pages
CS6210 4b - Synchronization
No ratings yet
CS6210 4b - Synchronization
27 pages
Unit Iii Os
No ratings yet
Unit Iii Os
6 pages
Concurrency Control Techniques Editing
No ratings yet
Concurrency Control Techniques Editing
31 pages
Lect 15 25052024 043931pm
No ratings yet
Lect 15 25052024 043931pm
21 pages
Process Management - Synchronization
No ratings yet
Process Management - Synchronization
34 pages
Week 11 DBMS Online Lecture Lock Based Protocol
No ratings yet
Week 11 DBMS Online Lecture Lock Based Protocol
28 pages
Air University Department of Computer Sciences Operating Systems
No ratings yet
Air University Department of Computer Sciences Operating Systems
9 pages
DBMS Unit 4 Concurrency Control & Deadlocks E83b281e 3208 4c42 9274 8f764cd52d32
No ratings yet
DBMS Unit 4 Concurrency Control & Deadlocks E83b281e 3208 4c42 9274 8f764cd52d32
50 pages
Full Exam Prep Cs 3307 PDF
No ratings yet
Full Exam Prep Cs 3307 PDF
110 pages
Summary Midterm Concurrency
No ratings yet
Summary Midterm Concurrency
22 pages
Implementing Locks in CSE 306
No ratings yet
Implementing Locks in CSE 306
27 pages
Unit 3
No ratings yet
Unit 3
104 pages
Chapter 3.2
No ratings yet
Chapter 3.2
73 pages
Spinlocks and All The Rest
No ratings yet
Spinlocks and All The Rest
35 pages
Java Concurrency for Developers
No ratings yet
Java Concurrency for Developers
14 pages
5-Process Synchronisation
No ratings yet
5-Process Synchronisation
10 pages
Concurrency Control PROTOCOL
No ratings yet
Concurrency Control PROTOCOL
11 pages
MCP-Unit 2
No ratings yet
MCP-Unit 2
77 pages
OS Concurrecny Summary
No ratings yet
OS Concurrecny Summary
10 pages
Lec08 Readerwriter
100% (1)
Lec08 Readerwriter
33 pages
DBMS UNIT 5 Part 2
No ratings yet
DBMS UNIT 5 Part 2
97 pages
Lab 9: Concurrency in Programming
No ratings yet
Lab 9: Concurrency in Programming
5 pages
THREADS
No ratings yet
THREADS
20 pages
Processes and Threads: Operating Systems CSE 4300
No ratings yet
Processes and Threads: Operating Systems CSE 4300
121 pages
CH 04
No ratings yet
CH 04
40 pages
Chapter 5
No ratings yet
Chapter 5
66 pages
Ch-4 Concurrency Control
No ratings yet
Ch-4 Concurrency Control
30 pages
Atomicity Analysis for Multithreaded Programs
No ratings yet
Atomicity Analysis for Multithreaded Programs
33 pages
Chapter 5-CC Final - My
No ratings yet
Chapter 5-CC Final - My
39 pages
9.concurrency Control
No ratings yet
9.concurrency Control
25 pages
CH 16 Updated
No ratings yet
CH 16 Updated
25 pages
Slides Guerraoui
No ratings yet
Slides Guerraoui
153 pages
CSC 453 Operating Systems: Concurrent Processes: An Example
No ratings yet
CSC 453 Operating Systems: Concurrent Processes: An Example
24 pages
CSC139 Operating Systems Readers-Writers Language Support For Synchronization
No ratings yet
CSC139 Operating Systems Readers-Writers Language Support For Synchronization
31 pages
Hardware and Software Synchronization Advanced Computer Architecture COMP 140 Thursday June 26, 2014
No ratings yet
Hardware and Software Synchronization Advanced Computer Architecture COMP 140 Thursday June 26, 2014
33 pages
IPC New
No ratings yet
IPC New
37 pages
Operating Systems Exam Notes Slide 3
No ratings yet
Operating Systems Exam Notes Slide 3
8 pages
5 Scheduling
No ratings yet
5 Scheduling
168 pages
Dbms - Concurrency Control
No ratings yet
Dbms - Concurrency Control
20 pages
CS0051 - M3-Locks and Liveness
No ratings yet
CS0051 - M3-Locks and Liveness
30 pages
13 Wrapup
No ratings yet
13 Wrapup
21 pages
Chapter7 MultipleResources
No ratings yet
Chapter7 MultipleResources
48 pages
11 Memallocation
No ratings yet
11 Memallocation
77 pages
03 Integersfloats
No ratings yet
03 Integersfloats
99 pages
3 Tobias Grosser 2017 Day2
No ratings yet
3 Tobias Grosser 2017 Day2
122 pages
CSE351: Hardware/Software Interface
No ratings yet
CSE351: Hardware/Software Interface
27 pages
3 Tobias Grosser 2017 Day1
No ratings yet
3 Tobias Grosser 2017 Day1
136 pages
2 Hal Finkel LLVM 2017
No ratings yet
2 Hal Finkel LLVM 2017
134 pages
LLVM Optimization Pipeline Guide
No ratings yet
LLVM Optimization Pipeline Guide
109 pages
CS 405 Operating Systems
No ratings yet
CS 405 Operating Systems
2 pages
Research Paper On Software Solution of Critical Se
No ratings yet
Research Paper On Software Solution of Critical Se
7 pages
OSC Question Bank (Unit 1, 2, 3)
No ratings yet
OSC Question Bank (Unit 1, 2, 3)
1 page
Ii BSC - Operating System Material
No ratings yet
Ii BSC - Operating System Material
87 pages
OS CSE Lessonplan
No ratings yet
OS CSE Lessonplan
8 pages
TIBCO BW Interview Questions
No ratings yet
TIBCO BW Interview Questions
5 pages
CSE325 Operating System Lab Manual
No ratings yet
CSE325 Operating System Lab Manual
36 pages
Classic Synchronization Problems
No ratings yet
Classic Synchronization Problems
26 pages
UNIT - 2 of Distrubuted System
No ratings yet
UNIT - 2 of Distrubuted System
35 pages
Module-5 ACA PDF
100% (1)
Module-5 ACA PDF
30 pages
RTS Unit 6 Notes
No ratings yet
RTS Unit 6 Notes
7 pages
20231220093224D2632 - COMP6697001 Week7 Session7 Deadlock
No ratings yet
20231220093224D2632 - COMP6697001 Week7 Session7 Deadlock
40 pages
Memory Dump Analysis Anthology (Dmitry Vostokov)
No ratings yet
Memory Dump Analysis Anthology (Dmitry Vostokov)
431 pages
CS609 Update SOLVED MCQs FINAL TERM BY JUNAID
No ratings yet
CS609 Update SOLVED MCQs FINAL TERM BY JUNAID
33 pages
Memory Models for Parallel Systems
No ratings yet
Memory Models for Parallel Systems
11 pages
Department of Computer Science and Engineering 18Cs43: Operating Systems Lecture Notes (QUESTION & ANSWER)
100% (1)
Department of Computer Science and Engineering 18Cs43: Operating Systems Lecture Notes (QUESTION & ANSWER)
8 pages
Lab Assignment 5 PDF
No ratings yet
Lab Assignment 5 PDF
36 pages
Operating System Questions With Their Answer
No ratings yet
Operating System Questions With Their Answer
15 pages
Process Synchronization
No ratings yet
Process Synchronization
17 pages
ch6 Isra
No ratings yet
ch6 Isra
55 pages
Distributed System UNIT - III
No ratings yet
Distributed System UNIT - III
23 pages
Distributed Mutual Exclusion Methods
No ratings yet
Distributed Mutual Exclusion Methods
23 pages
OS U-III Process Coordination
No ratings yet
OS U-III Process Coordination
74 pages
G1 - Chapter 5
No ratings yet
G1 - Chapter 5
14 pages
Lab # 10
No ratings yet
Lab # 10
7 pages
OS Notes1 Unit2
No ratings yet
OS Notes1 Unit2
25 pages
CS350 Operating Systems Notes
100% (1)
CS350 Operating Systems Notes
168 pages
Operating Systems MCQ Quiz
No ratings yet
Operating Systems MCQ Quiz
14 pages
2023 Os Model QP
No ratings yet
2023 Os Model QP
16 pages

Locks

Uploaded by

Locks

Uploaded by

Mutual Exclusion:

Classical Algorithms for Locks

Department of Computer Science

COMP 422 Lecture 20 March 2008

Ensure that a block of code manipulating a data structure is

• Transactional memory (TM)

• Mutual exclusion (safety property)

Classical locking algorithms using load and store

• Use atomic load and store only, no stronger atomic primitives

• First, consider two inadequate but interesting lock algorithms

class Lock1: public Lock {

• If one thread executes acquire before the other, works fine

class Lock2: public Lock {

• If the two threads run concurrently, acquire succeeds for one

Lock1 and Lock2 complement each other

class Peterson: public Lock {

• Peterson’s lock provides 2-way mutual exclusion

• Peterson’s lock used two-element Boolean flag array

• Maurice Herlihy and Nir Shavit. “Multiprocessor

Department of Computer Science

COMP 422 Lecture 20 March 2008

• Atomic primitives for synchronization

Atomic read-modify-write primitives

• Arbitrary read-modify-write operations with LL / SC

type lock = (unlocked, locked)

procedure acquire_lock (L : ^lock)

procedure release_lock (L : ^lock)

type lock = (unlocked, locked)

procedure acquire_lock (L : ^lock)

procedure release_lock (L : ^lock)

• Space: n words for n locks and p processes

procedure acquire_lock (L : ^lock)

// NOTE: test and set returns old value

procedure release_lock (L : ^lock)

• Similar to code developed by Tom Anderson

• Pragmatics: need to cap probe delay to some maximum

IEEE TPDS, January 1990

• Grants requests in FIFO order

// parameter I, below, points to a qnode record allocated (in an enclosing scope) in

procedure release_lock (L : ^lock, I: ^qnode)

run spin spin arriving

Process 4 arrives, attempting to acquire lock

run spin spin arriving

• Process 4 swaps self into tail pointer

run spin spin arriving

4 links behind predecessor (3)

run spin spin spin

4 links now spins until 3 signals that the lock is available

leaving spin spin spin

• Process 1 prepares to release lock

leaving run spin spin

• Grants requests in FIFO order

ASPLOS, April 1991

• Key lesson: importance of reducing memory traffic in

• 2006 Edsger Dijkstra Prize in distributed computing

type qnode = record

type lock = ^qnode // initialized to point to an unowned qnode

procedure acquire_lock (L : ^lock, I : ^qnode)

procedure release_lock (ref I : ^qnode)

run spin spin spin

• Discovered twice, independently

• Space: 2p + 3n words of space for p processes and n locks

• Atomic primitives: fetch_and_store

J. Mellor-Crummey and M. Scott. Algorithms for scalable

• 16 MHz Intel 80386

BBN Butterfly: distributed memory, no coherent caches

BBN Butterfly: distributed memory, no coherent caches

Sequent Symmetry: shared-bus, coherent caches

Sequent Symmetry: shared-bus, coherent caches

You might also like