0% found this document useful (0 votes)

8 views35 pages

Hashing

Uploaded by

Saumya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views35 pages

Hashing

Uploaded by

Saumya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Hashing

Tushar B. Kute,
http://tusharkute.com
Hashing

• Hashing is the process of transforming any given key

or a string of characters into another value.
• This is usually represented by a shorter, fixed-length
value or key that represents and makes it easier to
find or employ the original string.
Hashing

• The most popular use for hashing is the

implementation of hash tables.
• A hash table stores key and value pairs in a list that is
accessible through its index.
• Because key and value pairs are unlimited, the hash
function will map the keys to the table size.
• A hash value then becomes the index for a specific
element.
Hashing

• A hash function generates new values according to a

mathematical hashing algorithm, known as a hash
value or simply a hash.
• To prevent the conversion of hash back into the
original key, a good hash always uses a one-way
hashing algorithm.
• Hashing is relevant to -- but not limited to -- data
indexing and retrieval, digital signatures,
cybersecurity and cryptography.
Hashing

• Hashing uses functions or algorithms to map object

data to a representative integer value.
• A hash can then be used to narrow down searches
when locating these items on that object data map.
• For example, in hash tables, developers store data --
perhaps a customer record -- in the form of key and
value pairs.
• The key helps identify the data and operates as an
input to the hashing function, while the hash code or
the integer is then mapped to a fixed size.
Hashing
Hashing : Applications

• Databases: Hashing can be used to index data in a database,

which makes it faster to search for and retrieve data.
• Cryptography: Hashing can be used to create digital
signatures and other cryptographic primitives.
• Caching: Hashing can be used to implement caches, which are
data structures that store frequently accessed data in
memory for faster access.
• Bloom filters: Hashing can be used to implement Bloom
filters, which are a space-efficient probabilistic data structure
used to test whether an element is a member of a set.
Hash Table

• A hash table is a data structure that maps keys to

values. It is implemented using an array, where each
element of the array is a linked list of key-value pairs.
• The keys are hashed into a fixed-size bucket index
using a hash function.
• To insert a new key-value pair into the hash table, the
hash function is used to generate a bucket index.
• The new key-value pair is then added to the linked
list at the corresponding bucket index.
Hash Table

• To search for a value in the hash table, the hash function

is used to generate a bucket index.
• The linked list at the corresponding bucket index is then
searched for the key. If the key is found, the
corresponding value is returned. Otherwise, null is
returned.
• Hash tables are a very efficient way to store and retrieve
data, especially when the keys are evenly distributed.
• However, if the keys are not evenly distributed, the hash
table can become inefficient.
Hash Table
Hash Table

• Here are some examples of how hash tables are used:

– In a database, a hash table can be used to index the
data, which makes it faster to search for and
retrieve data.
– In a web browser, a hash table can be used to cache
frequently accessed web pages.
– In a compiler, a hash table can be used to store the
symbols in the program.
– In a programming language, a hash table can be
used to implement a dictionary or associative array.
Hashing
Types of Hash Functions

• The primary types of hash functions are:

– Division Method.
– Mid Square Method.
– Folding Method.
– Multiplication Method.
Types of Hash Functions

• Division Method
– The easiest and quickest way to create a hash
value is through division. The k-value is divided
by M in this hash function, and the result is used.
• Formula:
h(K) = k mod M
• (where k = key value and M = the size of the hash
table)
Types of Hash Functions

• Advantages:
– This method is effective for all values of M.
– The division strategy only requires one
operation, thus it is quite quick.
• Disadvantages:
– Since the hash table maps consecutive keys to
successive hash values, this could result in poor
performance.
– There are times when exercising extra caution
while selecting M's value is necessary.
Types of Hash Functions

• Example:
k = 1987
M = 13
h(1987) = 1987 mod 13
h(1987) = 4
Types of Hash Functions

• Mid Square Method

• The following steps are required to calculate this
hash method:
– k*k, or square the value of k
– Using the middle r digits, calculate the hash
value.
• Formula:
h(K) = h(k x k)
(where k = key value)
Types of Hash Functions

• Advantages:
– This technique works well because most or all of the
digits in the key value affect the result. All of the
necessary digits participate in a process that results in the
middle digits of the squared result.
– The result is not dominated by the top or bottom digits of
the initial key value.
• Disadvantages:
– The size of the key is one of the limitations of this system;
if the key is large, its square will contain twice as many
digits.
– Probability of collisions occurring repeatedly.
Types of Hash Functions

• Example:
k = 60
Therefore,
k=kxk
k = 60 x 60
k = 3600
Thus,
h(60) = 60
Types of Hash Functions

• Folding Method
• The process involves two steps:
– With the exception of the last component, which
may have fewer digits than the other parts, the
key-value k should be divided into a
predetermined number of pieces, such as k1, k2,
k3,..., kn, each having the exact same amount of
digits.
– Add each element individually. The hash value is
calculated without taking into account the final
carry, if any.
Types of Hash Functions

• Formula:
k = k1, k2, k3, k4, ….., kn
s = k1+ k2 + k3 + k4 +….+ kn
h(K)= s
(Where, s = addition of the parts of key k)
Types of Hash Functions

• Advantages:
– Creates a simple hash value by precisely splitting
the key value into equal-sized segments.
– Without regard to distribution in a hash table.
• Disadvantages:
– When there are too many collisions, efficiency
can occasionally suffer.
Types of Hash Functions

• Example:
k = 12345
k1 = 67; k2 = 89; k3 = 12
Therefore,
s = k1 + k2 + k3
s = 67 + 89 + 12
s = 168
Types of Hash Functions

• Multiplication Method
– Determine a constant value. A, where (0, A, 1)
– Add A to the key value and multiply.
– Consider kA's fractional portion.
– Multiply the outcome of the preceding step by M,
the hash table's size.
• Formula:
h(K) = floor (M (kA mod 1))
(Where, M = size of the hash table, k = key value and
A = constant value)
Types of Hash Functions

• Advantages:
– Any number between 0 and 1 can be applied to
it, however, some values seem to yield better
outcomes than others.
• Disadvantages:
– The multiplication method is often appropriate
when the table size is a power of two since
multiplication hashing makes it possible to
quickly compute the index by key.
Types of Hash Functions

• Example:
k = 5678
A = 0.6829
M = 200
Now, calculating the new value of h(5678):
h(5678) = floor[200(5678 x 0.6829 mod 1)]
h(5678) = floor[200(3881.5702 mod 1)]
h(5678) = floor[200(0.5702)]
h(5678) = floor[114.04]
h(5678) = 114
So, with the updated values, h(5678) is 114.
Hash Collision

• When a hash algorithm generates the same hash value for

two separate input values, this is known as a hash
collision.
• However, it's crucial to note that collisions are not an
issue; rather, they constitute a key component of hashing
algorithms.
• Because various hashing methods used in data structures
convert each input into a fixed-length code regardless of
its length, collisions happen.
• The hashing algorithms will eventually yield repeating
hashes since there are an infinite number of inputs and a
finite number of outputs.
Types of Hashing in Data Structures

• The two main hashing methods used in a data

structure are:
– Open hashing/separate chaining/closed
addressing
– Open addressing/closedhashing
Types of Hashing in Data Structures

• Open hashing/separate chaining/closed addressing

– A typical collision handling technique called
"separate chaining" links components with the
same hash using linked lists.
– It is also known as closed addressing and
employs arrays of linked lists to successfully
prevent hash collisions.
Types of Hashing in Data Structures

• Closed hashing (Open addressing)

– Instead of using linked lists, open addressing
stores each entry in the array itself. The hash value
is not used to locate objects.
– In order to insert, it first verifies the array
beginning from the hashed index and then
searches for an empty slot using probing
sequences.
– The probe sequence, with changing gaps between
subsequent probes, is the process of progressing
through entries.
Types of Hashing in Data Structures

• There are three methods for dealing with collisions

in closed hashing:
– Linear Probing
– Quadratic Probing
– Double-Hashing
Linear Probing

• Linear probing includes inspecting the hash table sequentially

from the very beginning. If the site requested is already
occupied, a different one is searched. The distance between
probes in linear probing is typically fixed (often set to a value of
1).
• Formula:
index = key % hashTableSize
• Sequence
index = ( hash(n) % T)
(hash(n) + 1) % T
(hash(n) + 2) % T
(hash(n) + 3) % T … and so on.
Quadratic Probing

• The distance between subsequent probes or entry slots is the only

difference between linear and quadratic probing. You must begin
traversing until you find an available hashed index slot for an entry
record if the slot is already taken. By adding each succeeding value
of any arbitrary polynomial in the original hashed index, the distance
between slots is determined.
• Formula:
index = index % hashTableSize
• Sequence:
index = ( hash(n) % T)
(hash(n) + 1 x 1) % T
(hash(n) + 2 x 2) % T
(hash(n) + 3 x 3) % T … and so on
Double Probing

• The time between probes is determined by yet another

hash function. Double hashing is an optimized technique
for decreasing clustering. The increments for the probing
sequence are computed using an extra hash function.
• Formula
(first hash(key) + i * secondHash(key)) % size of the table
• Sequence
index = hash(x) % S
(hash(x) + 1*hash2(x)) % S
(hash(x) + 2*hash2(x)) % S
(hash(x) + 3*hash2(x)) % S … and so on
Thank you
This presentation is created using LibreOffice Impress 7.4.1.2, can be used freely as per GNU General Public License

@mITuSkillologies @mitu_group @mitu-skillologies @MITUSkillologies

Web Resources
https://mitu.co.in
@mituskillologies http://tusharkute.com @mituskillologies

contact@mitu.co.in
tushar@tusharkute.com

Notes of Advanced Data Structures
No ratings yet
Notes of Advanced Data Structures
202 pages
Hashing
No ratings yet
Hashing
48 pages
Unit 1 Dsa Hashing 2022 Compressed 1
No ratings yet
Unit 1 Dsa Hashing 2022 Compressed 1
115 pages
C &DS (Unit5)
No ratings yet
C &DS (Unit5)
42 pages
Hash
No ratings yet
Hash
17 pages
Hashing
No ratings yet
Hashing
44 pages
Hashing and Skiplist - Removed
No ratings yet
Hashing and Skiplist - Removed
113 pages
Unit 1 Dsa Hashing
No ratings yet
Unit 1 Dsa Hashing
137 pages
I. Write SQL Statements To Create Database "Productorders" As Following
No ratings yet
I. Write SQL Statements To Create Database "Productorders" As Following
3 pages
UNIT - 2 Notes
No ratings yet
UNIT - 2 Notes
40 pages
UNIT 1 - Hashing
No ratings yet
UNIT 1 - Hashing
118 pages
Module 6 DSA 24
No ratings yet
Module 6 DSA 24
64 pages
Unit-5 Hashing
No ratings yet
Unit-5 Hashing
12 pages
Grammarian THESIS-AMS-Final
No ratings yet
Grammarian THESIS-AMS-Final
100 pages
Dbms Unit 1 Bca 1 Notes For Dbms
No ratings yet
Dbms Unit 1 Bca 1 Notes For Dbms
32 pages
Module 5 Hashing
No ratings yet
Module 5 Hashing
66 pages
Microsoft Excel Training (New)
No ratings yet
Microsoft Excel Training (New)
76 pages
Dsa M5
No ratings yet
Dsa M5
38 pages
HAshing (Satish Sir)
No ratings yet
HAshing (Satish Sir)
52 pages
2,2 Hashing
No ratings yet
2,2 Hashing
30 pages
Hashing Methods
No ratings yet
Hashing Methods
20 pages
Unit 5
No ratings yet
Unit 5
50 pages
Modue 5
No ratings yet
Modue 5
10 pages
ADS Unit-2
No ratings yet
ADS Unit-2
53 pages
Kevin's Resume
No ratings yet
Kevin's Resume
2 pages
UNIT 1 - Hashing
No ratings yet
UNIT 1 - Hashing
118 pages
DS Module-X
No ratings yet
DS Module-X
74 pages
What Is Hashing
No ratings yet
What Is Hashing
11 pages
Week 9 - Hash Functions and Collision
No ratings yet
Week 9 - Hash Functions and Collision
73 pages
An Agent Framework For Real-Time Financial Information Searching With Large Language Models
No ratings yet
An Agent Framework For Real-Time Financial Information Searching With Large Language Models
7 pages
Database Performance
No ratings yet
Database Performance
3 pages
ML Unit 1
No ratings yet
ML Unit 1
17 pages
Cloud Chapter1
No ratings yet
Cloud Chapter1
42 pages
Configuration Steps in SAP MDM
No ratings yet
Configuration Steps in SAP MDM
9 pages
Assignment Set 1 Dbms
No ratings yet
Assignment Set 1 Dbms
10 pages
Hashing
No ratings yet
Hashing
30 pages
Module 5
No ratings yet
Module 5
33 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Dbms Lab PRG
No ratings yet
Dbms Lab PRG
61 pages
DS Lecture 01.1 Fall-24-35
No ratings yet
DS Lecture 01.1 Fall-24-35
20 pages
Hash-Data Structure
No ratings yet
Hash-Data Structure
16 pages
Anuraag Gujje - Cloud FInal Project
No ratings yet
Anuraag Gujje - Cloud FInal Project
11 pages
Final Hashing
No ratings yet
Final Hashing
41 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
IVAS Services Monthly Reporting Apr-2017
No ratings yet
IVAS Services Monthly Reporting Apr-2017
8 pages
Hashing
No ratings yet
Hashing
23 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
Hashing
No ratings yet
Hashing
30 pages
Hash
No ratings yet
Hash
7 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Hashing in Data Structures
No ratings yet
Hashing in Data Structures
8 pages
Hashing
No ratings yet
Hashing
20 pages
Hash
No ratings yet
Hash
17 pages
DSA G5 Hashing Handouts
No ratings yet
DSA G5 Hashing Handouts
7 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
5 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Hashing
No ratings yet
Hashing
5 pages
Vijaykumar - Validation Resume
No ratings yet
Vijaykumar - Validation Resume
7 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing
No ratings yet
Hashing
34 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
Hashing
No ratings yet
Hashing
7 pages
Swapnil Patil-1
No ratings yet
Swapnil Patil-1
3 pages
DS 2024 Roadmap Version 2
No ratings yet
DS 2024 Roadmap Version 2
13 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Goldengate Encryption Wallet
No ratings yet
Goldengate Encryption Wallet
8 pages
Lis-311 Indexing and Abstracting: Lecture On
No ratings yet
Lis-311 Indexing and Abstracting: Lecture On
72 pages
Xii Practice Paper 2
No ratings yet
Xii Practice Paper 2
14 pages
Chapters 10 - Exercise Assignment - ANOVA
No ratings yet
Chapters 10 - Exercise Assignment - ANOVA
3 pages
AZ - 900 Part 5
No ratings yet
AZ - 900 Part 5
11 pages
Chapter 15
No ratings yet
Chapter 15
17 pages
Spatial Data Infrastructure Concepts and Components
No ratings yet
Spatial Data Infrastructure Concepts and Components
26 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
SAP Master Data Governance For Material Data - Overview
No ratings yet
SAP Master Data Governance For Material Data - Overview
113 pages
Hash Tables: A Detailed Description
No ratings yet
Hash Tables: A Detailed Description
10 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
Annexure - 15-Cloud Services Bouquet
No ratings yet
Annexure - 15-Cloud Services Bouquet
38 pages
Hash Function
No ratings yet
Hash Function
9 pages
BCS304 DS Module 5 Notes
No ratings yet
BCS304 DS Module 5 Notes
45 pages
Upgrade Oracle Database From 12.1.0.2 To 19.3.0.0
No ratings yet
Upgrade Oracle Database From 12.1.0.2 To 19.3.0.0
5 pages
Example Program To Find The Sum of 2 Numbers Using Rmi Inter - Java
No ratings yet
Example Program To Find The Sum of 2 Numbers Using Rmi Inter - Java
18 pages
Arid Agriculture University, Rawalpindi: Final Exam / Fall 2020 (Paper Duration 24 Hours) To Be Filled by Teacher
No ratings yet
Arid Agriculture University, Rawalpindi: Final Exam / Fall 2020 (Paper Duration 24 Hours) To Be Filled by Teacher
10 pages
Math Grade 8 TCAP Practice Test
No ratings yet
Math Grade 8 TCAP Practice Test
48 pages

Hashing

Uploaded by

Hashing

Uploaded by

Hashing

• Hashing is the process of transforming any given key

• The most popular use for hashing is the

• A hash function generates new values according to a

• Hashing uses functions or algorithms to map object

• Databases: Hashing can be used to index data in a database,

• A hash table is a data structure that maps keys to

• To search for a value in the hash table, the hash function

• Here are some examples of how hash tables are used:

• The primary types of hash functions are:

• Mid Square Method

• When a hash algorithm generates the same hash value for

• The two main hashing methods used in a data

• Open hashing/separate chaining/closed addressing

• Closed hashing (Open addressing)

• There are three methods for dealing with collisions

• Linear probing includes inspecting the hash table sequentially

• The distance between subsequent probes or entry slots is the only

• The time between probes is determined by yet another

@mITuSkillologies @mitu_group @mitu-skillologies @MITUSkillologies

You might also like