0% found this document useful (0 votes)

13 views29 pages

Week 3

Uploaded by

hiraazhar2030

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views29 pages

Week 3

Uploaded by

hiraazhar2030

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Data Structure & Algorithms

Hashed List
Searches
Outline

• What is hashing
• What are hashed list searches
• Hashing methods
• Direct
• Subtraction
• Modulo division
• Digit extraction
• Mid square
• Folding
• Rotation
• Pseudorandom
• Collision Resolution
• Collision Resolution Methods
Hashed List Searches

• List searches require a lot of tests before the data is found

• Ideal case: We know where our data is and we may access it directly

• Goal of Hashed Search: Find data in one test

What is Hashing?

• Generating address from a key

• Key is a record’s property on which a record is stored in memory (called
prime area)
• A hash search is a search in which the key is modified (using an
algorithm) into a memory address (called home address)
• The conversion process of converting a key to a memory address is
called Hashing
Hashing Terms

• Usually there are more keys than can be accommodated in memory

• More than one keys that hash to the same location are called synonyms
• A hash collision occurs when an key is inserted into an occupied memory
location
• Calculation of a memory address and its test for collision is called a probe
Concept of Hashing
Hash Collision
Hashing Methods
Direct Method

• The key is the address without any algorithmic modification (it starts from 0 or
1 and increments by 1)
• Example:
class Sales {
public:
Sales(const int day=0,
const int amount=0):day(day),amount(amount){}
private:
int day, amount;
};
Sale sale(0,100);
Sale dailySales[10];
dailySales[sale.day] += sale.amount;
Subtraction Method

• If keys do not start from 0 or 1, you get an index by subtracting a constant

value from the key

class Employee {
public:
Employee():id(count++) {}
const int getID() const { return id; }
void setName(const std::string& n) { name = n; }

private:
int id;
string name;
static int count;
};

int Employee::count = 1000;

Subtraction Example

Employee employees[100];
Employee e;
employees[e.getID()-1000].setName(“Yasir”);
Modulo Division Method

• Also known as division remainder or modulo-division method

• Divides the key by list/array size and uses the remainder as index

• Works with any size but prime numbers give less collisions
Modulo Division Example
Modulo Division Example

• Assume array size of 307

• Lets consider the following keys 121267,045128 and 379452

Input Hash Calculation Index/Address

121267 121267 % 307 2

045128 045128 % 307 306

379452 379452 % 307 0

Digit Extraction Method

• Extract certain digits and then use the obtained number as address
• Assuming that we have an array size of 1000 (indices from 0 to 999) we
can use three digits
• Example: (extracting 1st , 3rd and 4th digit)
• 145674 => 156
• 344565 => 345
• 132554 => 125
Midsquare Method

• The key or a portion of it is squared and the address is selected from the
mid of the squared number
• Example:
• Assuming we have a three digit key and three digit address

Input Square Index/Address

121 14641 464

365 133225 322

749 561001 100

Folding Method
• Two methods
• Fold shift
• Fold boundary
• Fold shift
• Key is divided into parts whose size matches the required
address size
• The left and right parts are shifted and added to the middle part
• Fold boundary
• The key is divided into parts as in fold shift
• The digits of left and right parts are reversed and then added to
the middle part
• In both folding methods, overflowing digits are discarded
Folding Example
Rotation Method

• Generally used in conjunction with other methods

• Useful when keys are assigned serially as in employee ids and part
numbers etc.
• Such numbers usually end up with identical numbers that differ by 1 digit
only
• When these numbers are used in hashing, they tend to hash to the same
address
• To solve this, the last digits are rotated to the front of the key
• Usually performed by bitwise shift (<<, >>) and bitwise and (&) operator
Rotation Example
Collision Resolution

• Apart from direct and subtraction methods, all other hashing methods
suffer from collisions
• Some hashed list terms:
𝑘
• Load factor (): Percentage of the number of elements in list divided by the total list
size ∝= ∗ 100
𝑛

• Clustering: Clustering is a build up of data unevenly across the hashed list due to
collision resolution.
• Primary Clustering: When clustering is at home address
• Secondary Clustering: When clustering is at collision path
Collision Resolution Methods
Open Addressing

• Resolves collision in the prime area (where all home address are stored)
• When collision occurs, prime area is searched for empty cells
• Basic two types
• Linear probe
• Quadratic probe
Linear Probe

• In case of collision, add 1 to the current address/index

• If the new address is occupied add 1 again to find an empty address
• An alternate scheme adds 1, if collision is found, it subtracts 2, if another
collision is found, it adds 3 and so on
• Generated address must be within the address range
• Advantages
• Simple to implement
• Data remains near home address
• Disadvantages
• Create primary clusters
• Make search algorithm complex due to addition and deletion
Quadratic Probe

• Instead of adding 1, we add the collision probe number squared

• For 1 collision probe, add 12=1, for 2 collision probe, add 22=4, for 3 collision
probe, add 32=9 and so on
• Continued until we find an empty address or we don’t have enough elements
• Advantages
• Easy to implement
• Avoids primary and secondary clustering
• Disadvantages
• More time required for squaring the probe number which can be avoided by just
doing a scalar multiplication
• It is not possible to generate a new address for every element
Linked List Resolution

• General disadvantage of open addressing is that each collision resolution

increases chances of future collision
• In linked list resolution, a separate area (called overflow area) is used for
collisions
• All synonyms are chained into a linked list and are then attached to the prime area
using a pointer
• In case of collision, one element is stored in prime area and it is chained to its
corresponding linked list in the overflow area through a linked list pointer
Bucket Hashing (overview)

• Buckets can contain multiple data occurrences

• Advantages
• Can store multiple data items during collision
• Disadvantages
• Takes significantly more space as a lot of buckets remain empty or partially empty
• The memory is fragmented considerably due to partial empty buckets
• It cannot completely resolve collisions

DSA - Unit 1
No ratings yet
DSA - Unit 1
43 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
File Organization
No ratings yet
File Organization
49 pages
08 Hashing
No ratings yet
08 Hashing
26 pages
Ds 17hashing
No ratings yet
Ds 17hashing
27 pages
Suresh
No ratings yet
Suresh
15 pages
Hashing
No ratings yet
Hashing
44 pages
Hashing
No ratings yet
Hashing
25 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Chapter 11 Hashing
No ratings yet
Chapter 11 Hashing
42 pages
Hashing
No ratings yet
Hashing
30 pages
Lab 09 - Hashing
No ratings yet
Lab 09 - Hashing
47 pages
Dsa Day 7 - Search, Hash, Store
100% (2)
Dsa Day 7 - Search, Hash, Store
31 pages
Hashing (DASTAL)
No ratings yet
Hashing (DASTAL)
27 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Unit 4 Hashing
No ratings yet
Unit 4 Hashing
35 pages
CH 4
No ratings yet
CH 4
58 pages
2,2 Hashing
No ratings yet
2,2 Hashing
30 pages
Unit 5
No ratings yet
Unit 5
50 pages
What Is Hashing
No ratings yet
What Is Hashing
11 pages
Module-4 Dictionaries and Hash Tables
No ratings yet
Module-4 Dictionaries and Hash Tables
31 pages
Unit - 7 Searching and Hasing (CSIT)
No ratings yet
Unit - 7 Searching and Hasing (CSIT)
14 pages
AW47 SIL ISERV DOC1700134 r4
No ratings yet
AW47 SIL ISERV DOC1700134 r4
27 pages
Module-6 Searching Techniques
No ratings yet
Module-6 Searching Techniques
44 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Final Hashing
No ratings yet
Final Hashing
41 pages
Hashing
No ratings yet
Hashing
20 pages
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
No ratings yet
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
32 pages
Hash Tables
No ratings yet
Hash Tables
37 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Maps
No ratings yet
Maps
36 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Ads-Unit I
No ratings yet
Ads-Unit I
16 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Collision
No ratings yet
Collision
24 pages
CSD203 Hashing
No ratings yet
CSD203 Hashing
32 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
DSA Chapter 08 (Searching)
No ratings yet
DSA Chapter 08 (Searching)
65 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Theory PDF
No ratings yet
Theory PDF
18 pages
Study Material On Hashing
No ratings yet
Study Material On Hashing
4 pages
Group 15 Hash Tables
No ratings yet
Group 15 Hash Tables
42 pages
Task 2 - Hashing and Linear Probing
No ratings yet
Task 2 - Hashing and Linear Probing
16 pages
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
No ratings yet
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
31 pages
Manual Alumno QRQC v2
100% (1)
Manual Alumno QRQC v2
35 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Microelectronic Circuit Design 4th Edition Jaeger Solutions Manual Instant Download
100% (1)
Microelectronic Circuit Design 4th Edition Jaeger Solutions Manual Instant Download
44 pages
Gynecologists Who Will Perform A Tubal Sterilization - United States
No ratings yet
Gynecologists Who Will Perform A Tubal Sterilization - United States
75 pages
Hashing
No ratings yet
Hashing
34 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Handout 9 - Hashing
No ratings yet
Handout 9 - Hashing
11 pages
Implementation Priority Queue Using Array
No ratings yet
Implementation Priority Queue Using Array
3 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Lecture 14 Hashing
No ratings yet
Lecture 14 Hashing
44 pages
Hashing
No ratings yet
Hashing
37 pages
Administrative Theory
No ratings yet
Administrative Theory
584 pages
Lab 2
No ratings yet
Lab 2
10 pages
DBQ - First American Industrial Revolution PDF
50% (2)
DBQ - First American Industrial Revolution PDF
13 pages
Triage
67% (3)
Triage
40 pages
Cruz2007 Chapter3 SM Final
50% (4)
Cruz2007 Chapter3 SM Final
20 pages
Investigating AI-Integrated Instruction in Improving Academic Performance of Senior High School Students in The Philippines
No ratings yet
Investigating AI-Integrated Instruction in Improving Academic Performance of Senior High School Students in The Philippines
7 pages
Abnormal Psychology Summary (Chapter 3 - 4)
No ratings yet
Abnormal Psychology Summary (Chapter 3 - 4)
15 pages
Principle of Acc ch-3 IS
No ratings yet
Principle of Acc ch-3 IS
6 pages
BN 311
No ratings yet
BN 311
33 pages
Four Acid To Aqua Regia
No ratings yet
Four Acid To Aqua Regia
2 pages
Juvelline Deligency tr1
No ratings yet
Juvelline Deligency tr1
9 pages
Scribd Download - Com Computerized Student Information System
0% (1)
Scribd Download - Com Computerized Student Information System
72 pages
Rillettes of Finnan Haddie With Oyster and Caviar: Ingredients: Method
No ratings yet
Rillettes of Finnan Haddie With Oyster and Caviar: Ingredients: Method
6 pages
Methods of Interrogation
No ratings yet
Methods of Interrogation
16 pages
Sathyan Resume
No ratings yet
Sathyan Resume
3 pages
ANSIB16 5Class150Slip-OnFlanges
No ratings yet
ANSIB16 5Class150Slip-OnFlanges
2 pages
Test 2-Review Unit 123
No ratings yet
Test 2-Review Unit 123
3 pages
Canon Underswap Research (Summary)
No ratings yet
Canon Underswap Research (Summary)
7 pages
Administrative Behavior in Education
No ratings yet
Administrative Behavior in Education
2 pages
Rahmat Bano vs. CPIO ITO Central Information Commission New Delhi
No ratings yet
Rahmat Bano vs. CPIO ITO Central Information Commission New Delhi
5 pages
CVENG 423 - Module 4 - Construction Estimates and Values Engineering
No ratings yet
CVENG 423 - Module 4 - Construction Estimates and Values Engineering
7 pages
Spelling 2021 - Lista de Palabras
No ratings yet
Spelling 2021 - Lista de Palabras
7 pages
Waste Collection Best Practivces - Musical Garbage Trucks in Taiwan China
No ratings yet
Waste Collection Best Practivces - Musical Garbage Trucks in Taiwan China
1 page
Tim's Pronunciation Workshop Plosives: BBC Learning English
No ratings yet
Tim's Pronunciation Workshop Plosives: BBC Learning English
2 pages
SAFETY DATA SHEET (Date of Issue 7/30/2014)
No ratings yet
SAFETY DATA SHEET (Date of Issue 7/30/2014)
2 pages
Technical Rider - Sound and Light Requirements Alcatraz: PA-System
No ratings yet
Technical Rider - Sound and Light Requirements Alcatraz: PA-System
3 pages
Jonathon Engels Design How To: 20 Garden Hacks For The Quirky and Pragmatic Permaculturalist
No ratings yet
Jonathon Engels Design How To: 20 Garden Hacks For The Quirky and Pragmatic Permaculturalist
3 pages
Basic Math Notes
From Everand
Basic Math Notes
Ernest Bywater
5/5 (2)
Master Fundamental Concepts of Math Olympiad: Maths, #1
From Everand
Master Fundamental Concepts of Math Olympiad: Maths, #1
Subbalakshmi Devaki
No ratings yet
The Beginners Math for GRE & GMAT: Maths, #1
From Everand
The Beginners Math for GRE & GMAT: Maths, #1
Subbalakshmi Devaki
No ratings yet

Week 3

Uploaded by

Week 3

Uploaded by

Data Structure & Algorithms

• List searches require a lot of tests before the data is found

• Goal of Hashed Search: Find data in one test

• Generating address from a key

• Usually there are more keys than can be accommodated in memory

• If keys do not start from 0 or 1, you get an index by subtracting a constant

int Employee::count = 1000;

• Also known as division remainder or modulo-division method

• Assume array size of 307

Input Hash Calculation Index/Address

121267 121267 % 307 2

045128 045128 % 307 306

379452 379452 % 307 0

Input Square Index/Address

121 14641 464

365 133225 322

749 561001 100

• Generally used in conjunction with other methods

• In case of collision, add 1 to the current address/index

• Instead of adding 1, we add the collision probe number squared

• General disadvantage of open addressing is that each collision resolution

• Buckets can contain multiple data occurrences

You might also like