0% found this document useful (0 votes)

51 views27 pages

Ctec2909 Data Structures and Algorithms: Lecture Week 3 Friday Hash Maps

The document discusses hash maps, which store information with keys. A hash function maps keys to locations in the hash map to allow for fast insertion, deletion, and searching. Collisions can occur if multiple keys map to the same location. Collision resolution techniques include open addressing, which finds another location, and separate chaining, which stores multiple items in a linked list at each location. The efficiency depends on the load factor, which should be less than 2/3.

Uploaded by

rabbit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views27 pages

Ctec2909 Data Structures and Algorithms: Lecture Week 3 Friday Hash Maps

Uploaded by

rabbit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

CTEC2909 DATA STRUCTURES AND

ALGORITHMS
Lecture Week 3 Friday Hash Maps
Hash Maps
A hash map (or hash table) stores information with keys.

The key is used to find the location of the item in the hash map.

Robert Culp
ID: 121535
Ingrid Bergman
ID: 685865

Ricardo Montalban
ID: 34637
Hash Maps
The aim of hash maps is to get O(1) worst-case performance
for insertion, deletion and searching.
insertion: calculate the address for the given key
store the item at that address
deletion: calculate the address for the given key
delete the item at that address

search: calculate the address for the given key

return the item at that address
Hash Maps
insert(Key key, Data item){ find(Key key){
int i = findAddress(key); int i = findAddress(key);
store item+key at map[i]; if (key at map[i] == key){
} return item at map[i];
}
return null;
delete(Key key){ }
int i = findAddress(key);
if (key at map[i] == key){
delete map[i];
}
}
Hash Functions
A hash function is used to map keys to locations.

Robert Culp

ID: 121535 Ingrid Bergman

Hash
ID: 685865 function
Ricardo Montalban
ID: 34637
Hash Functions
How does the hash function work?
It could be done by having a unique location for every single key.
However, this could require a large amount of space, e.g. for ID
numbers ranging from 111111 to 999999.
In most cases, there will be much fewer data items than the total
possible, e.g. only 100 people with ID numbers in the above range.
Solution: Use a smaller size hash map and map the keys into the
smaller range.
Hash Functions
A perfect hash function maps every key into a unique location, but this
is only possible if all the keys that occur in this set are known in advance.
Otherwise, collisions could occur:
Robert Culp

ID: 121535 Ingrid Bergman

Hash
ID: 685865
function Peter Falk
Ricardo Montalban
ID: 46363

ID: 34637
Hash Functions
Some simple hash functions for integer keys:
 Selecting particular digits – e.g. select the 3rd and 7th digit
– depending on the data, this might not evenly distribute the items,
e.g. for phone numbers where some digits are area codes.
 Folding digits – add all the digits, e.g. 2523 = 2 + 5 + 2 + 3 = 12
– or to get larger values, group and then add, e.g. 25 + 23 = 48.
 Mod function – number mod size, where size is the size of the hash
map, e.g. 482567 mod 10 = 7.
– Using prime number sizes distributes the items more evenly, e.g. 101
instead of 100.
Hash Functions
Hash functions for non-integers:
 For strings: Change each character into its ASCII value.
- Adding the values will mean “abcd” and “dacb” get mapped to the same
location.
- Alternative: change into binary and concatenate, e.g. N = 14 = 01110 and
T = 20 = 10100, so NT = 0111010100. Then convert back to decimal
(468) and use mod.
- Another way: 14 * 321 + 20 * 320 = 468. (Horner’s Rule)
Collisions
What should be done if collisions occur?
- can’t just not add the item since collisions can occur even if the map has
only 1 item!
Robert Culp

ID: 121535 Ingrid Bergman

Hash
ID: 685865
function Peter Falk Ricardo Montalban
ID: 46363

ID: 34637
Collision Resolution
Two options:
 Find another space in the hash map.
 Allow the hash map to store multiple items at each location.

Option 1: Find another space in the hash map.

This is called Open Addressing. If the location is taken, look

for another location.
- Remember that it has to be easy to find it again!
Collision Resolution – Open Addressing
Option 1: Linear Probing
Search the hash map locations sequentially starting from where it
should have been put.
e.g. if the key maps to position 2, and that location is full, try
position 3, then position 4, etc.
Key maps to here
Robert Culp
searching: start with the expected
location, then keep trying the next Ingrid Bergman
ones until an empty location is Item added here
reached or the item is found.
- if it is empty then the search
failed.
Collision Resolution – Open Addressing
Option 1: Linear Probing
deleting: start at the expected location and keep trying the next
ones, then delete.
Delete Peter Falk
Key maps to here
Robert Culp Robert Culp

Ingrid Bergman Ingrid Bergman

Item found here
Peter Falk

Ricardo Montalban Ricardo Montalban

Collision Resolution – Open Addressing
Option 1: Linear Probing
Now suppose that Stephanie Zimbalist was supposed to be in
position 2. How would a search for Stephanie’s key work?
Solution: have states full,
Robert Culp empty, deleted – search keeps
going if it reaches a deleted
Key maps to here
Ingrid Bergman slot
– insert uses empty or
deleted slots

Ricardo Montalban
Search stops here!
Stephanie Zimbalist
Collision Resolution
Linear probing can lead to clustering – lots of items next to each other.
– This keeps getting worse (large clusters get larger).
– It decreases performance as each search requires searching through
the clusters.
– called primary clustering.

Quadratic probing stops primary clustering.

Instead of trying the next space, try key+12 then key+22 then key+32 etc.
– can lead to secondary clustering – when two items hash to the same
location. This is not a problem.
Collision Resolution
Double hashing – Use one hash function for finding the initial location
and then another hash function to find the step size for finding the next
space.

e.g. hash function 1 is key mod 11 and hash function 2 is 7 – (key mod 7).
Key = 58 - therefore it maps to location 3 and if that is full, try steps of 5,
i.e. 3, 8, 2 (wrapped around), 7, etc.
Key = 14 - therefore it maps to location 3 and if that is full, try steps of 7,
i.e. 3, 10, 6, 2 (wrapped around), 9, etc.
So both keys map initially to the same place but then different places.
If more than one hash function is used it is called rehashing.
Collision Resolution
Option 2: Allowing several items to be stored at each location.

Buckets - Each location is an array.

Keys of Lisa and Anthony

Zach both map here
Lisa Zach

Problem: Could lead to wasted space if the arrays are too large,
or collisions again if the arrays are too small.
Collision Resolution
Option 2: Allowing several items to be stored at each location.
Separate Chaining - Each location is a linked list.

Keys of Lisa and Anthony

Zach both map here
Zach Lisa

Note the different order of Zach

and Lisa.
Now only the required amount of space is used.
Collision Resolution – Separate Chaining
Implementation of Separate Chaining:
public class Node{
private Object key;
private Object data;
private Node next;

public Node(Object key, Object data, Node next){

this.key = key;
this.data = data;
this.next = next;
}
// get and set methods.
}
Collision Resolution – Separate Chaining
Implementation of Separate Chaining:
void insert(Object key, Object item){
if (! map.isFull()){
int index = hashFunction(key);
Node n = new Node(key, item, map[index]);
map[index] = n;
} n
} map[index]

Lisa Zach Lisa

Collision Resolution – Separate Chaining
Implementation of Separate Chaining:
Object find(Object key){
int index = hashFunction(key);
Node n = map[index];
while((n != null) && (!n.getKey().equals(key)){
n = n.next();
} n
if (n != null){
return n.getData();
}
Zach Lisa
return null;
}
Hashing Efficiency
The efficiency of hash functions depends on the load factor of the hash
map: load factor = no. of items in the map / size of map
As the load factor increases, the number of collisions increases, so the
performance decreases.
Select a hash map size so that the load factor is less than 2/3 for the
estimated largest number of items.
Probing methods:
Unsuccessful searches take more time than successful searches.
Smaller hash maps can be used for quadratic probing/ double hashing
than for linear probing.
Hashing Efficiency
Separate Chaining:

Inserting is O(1) because it always adds to the start of the linked list.
Searching and deleting is O(n) where n is the size of the linked list, so
shorter lists are better.
Load factor = no. of items / no. of linked lists
The load factor is the average length of the linked lists.
In the worst case, all the items hash to the same location, so that linked
list contains all the items.
Java Hashmap
Map map = new HashMap();
Adding to the map: (using some int id and String name)
map.put(id, name);
Getting an item from the map: (using some int id)
String str = (String) map.get(id);
Also: containsKey; size; clear
Iterating through a map:
Set s = map.entrySet(); All Java objects have a hashcode.
Iterator it = s.iterator();
while(it.hasNext()){
currentEntry = (Map.Entry)it.next();
currentKey = (int)currentEntry.getKey();
currentItem = (String)currentEntry.getValue();
}
Java Hashmap
Example: A student grade system.
Map grades = new HashMap();
public void setGrade(int studentID, int grade){
grades.put(studentID, grade);
}

public int getGrade(int studentID){

return (int)grades.get(studentID);
}
Java Hashmap
public void printAllGrades(){
int student;
int grade;
Set s = map.entrySet();
Iterator it = s.iterator();
while(it.hasNext()){
currentEntry = (Map.Entry)it.next();
student = (int)currentEntry.getKey();
grade = (String)currentEntry.getValue();
System.out.println(“Student: ”+student+“Grade: ”+grade);
}
}
Java Hashmap
Example: A student record system.
Map studentInfo = new HashMap();
public void storeInfo(int studentID, String name, String address){
Student s = new Student(name, address);
studentInfo.put(studentID, s);
}
public int getName(int studentID){
Student s = (Student)studentInfo.get(studentID);
return s.getName();
}

Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Modifed Hash
No ratings yet
Modifed Hash
42 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Unit 6
No ratings yet
Unit 6
204 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing
No ratings yet
Hashing
30 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Hashing Techniques in Dictionaries
No ratings yet
Hashing Techniques in Dictionaries
40 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hash Tables for Computer Science Students
No ratings yet
Hash Tables for Computer Science Students
20 pages
Ds 17hashing
No ratings yet
Ds 17hashing
27 pages
Hashing
No ratings yet
Hashing
34 pages
Hashing
No ratings yet
Hashing
20 pages
4D2-5&6 Hashing Techniques v1.02
No ratings yet
4D2-5&6 Hashing Techniques v1.02
9 pages
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
No ratings yet
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
32 pages
Hashing Guide for SE Computer Students
No ratings yet
Hashing Guide for SE Computer Students
118 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
14 HashTable
No ratings yet
14 HashTable
38 pages
Chapter 11 Hashing
No ratings yet
Chapter 11 Hashing
42 pages
Final Hashing
No ratings yet
Final Hashing
41 pages
Mod 5
No ratings yet
Mod 5
13 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Chapter 28 Hashing: Hash Table. The Function That Maps A Key To An Index in The Hash Table Is
No ratings yet
Chapter 28 Hashing: Hash Table. The Function That Maps A Key To An Index in The Hash Table Is
4 pages
Dsa 240404 220052
No ratings yet
Dsa 240404 220052
9 pages
Hash Function
No ratings yet
Hash Function
9 pages
Dshash
No ratings yet
Dshash
4 pages
Hashing
No ratings yet
Hashing
25 pages
DSA G5 Hashing Handouts
No ratings yet
DSA G5 Hashing Handouts
7 pages
DSimp 2
No ratings yet
DSimp 2
21 pages
Hashing Techniques & Applications
No ratings yet
Hashing Techniques & Applications
10 pages
Hashing New
No ratings yet
Hashing New
48 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
Hash Tables: Concepts and Applications
No ratings yet
Hash Tables: Concepts and Applications
15 pages
Data and File Structures: Hashing
No ratings yet
Data and File Structures: Hashing
24 pages
Hashing Techniques - U3
No ratings yet
Hashing Techniques - U3
9 pages
Implementation Priority Queue Using Array
No ratings yet
Implementation Priority Queue Using Array
3 pages
DSA Unit VI Hashing and File Organization
No ratings yet
DSA Unit VI Hashing and File Organization
56 pages
Hashing
No ratings yet
Hashing
61 pages
CSE 12 The Map Abstract Data Type
No ratings yet
CSE 12 The Map Abstract Data Type
25 pages
Week 9 - Hash Functions and Collision
No ratings yet
Week 9 - Hash Functions and Collision
73 pages
Struktur Data: By: Sri Rezeki Candra Nursari
No ratings yet
Struktur Data: By: Sri Rezeki Candra Nursari
34 pages
Hashing
No ratings yet
Hashing
4 pages
GROUP 15.Pptx Presentation
No ratings yet
GROUP 15.Pptx Presentation
29 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
5 pages
Hashing
No ratings yet
Hashing
56 pages
9A Hash Tables
No ratings yet
9A Hash Tables
7 pages
Module 5-FS
No ratings yet
Module 5-FS
21 pages
L15 Maps and Hashes
No ratings yet
L15 Maps and Hashes
41 pages
L5 HashTables
No ratings yet
L5 HashTables
22 pages
Hashing
No ratings yet
Hashing
4 pages
Service Info No. 0003 - Com (English) A5.05.10.03.01.00 - en
No ratings yet
Service Info No. 0003 - Com (English) A5.05.10.03.01.00 - en
3 pages
Human Cell Structure & Function Guide
No ratings yet
Human Cell Structure & Function Guide
11 pages
SWA Catalogue 4.0
No ratings yet
SWA Catalogue 4.0
191 pages
Editorial Board - 2023 - Journal of Business Research
No ratings yet
Editorial Board - 2023 - Journal of Business Research
6 pages
Hondata S300 Wideband Setup Guide: Wiring Information
50% (2)
Hondata S300 Wideband Setup Guide: Wiring Information
2 pages
Liehr
No ratings yet
Liehr
9 pages
For User in Local PPP Secrets
No ratings yet
For User in Local PPP Secrets
7 pages
Student - PBL-1-Batch 23-27 - Guide Allocation Sheet
No ratings yet
Student - PBL-1-Batch 23-27 - Guide Allocation Sheet
20 pages
Elite 42M & Elite 42Ms Elite 51M & Elite 51Ms Elite 27 MS: Parts List
No ratings yet
Elite 42M & Elite 42Ms Elite 51M & Elite 51Ms Elite 27 MS: Parts List
198 pages
Mango Growing in Kenya
No ratings yet
Mango Growing in Kenya
122 pages
Ge15-3rd Exam
No ratings yet
Ge15-3rd Exam
4 pages
TME 7 Pandu Gelombang
No ratings yet
TME 7 Pandu Gelombang
27 pages
IJANS - Format - Phytochemicals Analysis of Various Parts of The Avocado Plant - Persea Americana
No ratings yet
IJANS - Format - Phytochemicals Analysis of Various Parts of The Avocado Plant - Persea Americana
11 pages
GHG Protocol Agricultural Guidance (April 26) - 0
No ratings yet
GHG Protocol Agricultural Guidance (April 26) - 0
103 pages
Technical Specification-ECG-3010 V2.0
No ratings yet
Technical Specification-ECG-3010 V2.0
8 pages
Herbarium
No ratings yet
Herbarium
3 pages
Print Head Doctor 14 for Pros
No ratings yet
Print Head Doctor 14 for Pros
6 pages
... Nihonweld Condensed Pricelist For Smaw Welding Electrodes (04.11.2022)
No ratings yet
... Nihonweld Condensed Pricelist For Smaw Welding Electrodes (04.11.2022)
2 pages
Science 8 Summative Test
No ratings yet
Science 8 Summative Test
3 pages
Recommendations Regarding Fuel Quality For Diesel Engines: Number 21 2003
No ratings yet
Recommendations Regarding Fuel Quality For Diesel Engines: Number 21 2003
36 pages
Handbook of Industrial Chemistry
100% (5)
Handbook of Industrial Chemistry
1,295 pages
Storm Drainage System Design
No ratings yet
Storm Drainage System Design
15 pages
Daisy Chain T-Node Orientation Table: Tnodes-Devicetag
No ratings yet
Daisy Chain T-Node Orientation Table: Tnodes-Devicetag
1 page
Philosophy for Senior High Students
100% (1)
Philosophy for Senior High Students
22 pages
Skin Microneedling Plus Platelet-Rich Plasma
100% (1)
Skin Microneedling Plus Platelet-Rich Plasma
7 pages
PA 304 HRD - IT and HRIS
No ratings yet
PA 304 HRD - IT and HRIS
23 pages
Basic Settings For Approval: Short Text
No ratings yet
Basic Settings For Approval: Short Text
27 pages
Transport From Bayswater
No ratings yet
Transport From Bayswater
7 pages
Mypna Se g10 Sel House Taken Web
No ratings yet
Mypna Se g10 Sel House Taken Web
14 pages
Unit - I
No ratings yet
Unit - I
17 pages

Ctec2909 Data Structures and Algorithms: Lecture Week 3 Friday Hash Maps

Uploaded by

Ctec2909 Data Structures and Algorithms: Lecture Week 3 Friday Hash Maps

Uploaded by

CTEC2909 DATA STRUCTURES AND

search: calculate the address for the given key

ID: 121535 Ingrid Bergman

ID: 121535 Ingrid Bergman

ID: 121535 Ingrid Bergman

Option 1: Find another space in the hash map.

This is called Open Addressing. If the location is taken, look

Ingrid Bergman Ingrid Bergman

Ricardo Montalban Ricardo Montalban

Quadratic probing stops primary clustering.

Buckets - Each location is an array.

Keys of Lisa and Anthony

Keys of Lisa and Anthony

Note the different order of Zach

public Node(Object key, Object data, Node next){

Lisa Zach Lisa

public int getGrade(int studentID){

You might also like