Dshash

Hashing is a data structure technique that allows efficient storage and retrieval of data using a hash function to map keys to indices in a hash table, enabling average O(1) time complexity for search, insert, and delete operations. Key components include the key, hash function, and hash table, while hash collisions occur when multiple keys map to the same index, resolved through techniques like direct chaining and open addressing. Applications of hashing include cryptography, password verification, and various data structures in programming languages.

Uploaded by

bprajna64

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views4 pages

Dshash

Uploaded by

bprajna64

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

INTRODUCTION TO HASHING

Syllabus: Hashing - Perfect hashing functions. Hash table, Hash Functions, Operations, Hash collision,
Application.
Hashing
 Hashing is a technique used in data structures that efficiently stores and retrieves data in a way that
allows for quick access.
 Hashing involves mapping data to a specific index in a hash table (an array of items) using a hash
function. It enables fast retrieval of information based on its key.
 The great thing about hashing is, we can achieve all three operations (search, insert and delete) in
O(1) time on average.
 Hashing is mainly used to implement a set of distinct items (only keys) and dictionaries (key value
pairs).

Components of Hashing
There are majorly three components of hashing:
 Key: A Key can be anything string or integer which is fed as input in the hash function the
technique that determines an index or location for storage of an item in a data structure.
 Hash Function: Receives the input key and returns the index of an element in an array called a hash
table. The index is known as the hash index.
 Hash Table: Hash table is typically an array of lists. It stores values corresponding to the keys. Hash
stores the data in an associative manner in an array where each data value has its own unique index.

Hash Table
A Hash table is defined as a data structure used to insert, look up, and remove key-value pairs quickly. It
operates on the hashing concept, where each key is translated by a hash function into a distinct index in an
array. The index functions as a storage location for the matching value. In simple words, it maps the keys
with the value.
Hash Function
 Hash functions are a fundamental concept in computer science and play a crucial role in various
applications such as data storage, retrieval, and cryptography.
 A hash function creates a mapping from an input key to an index in hash table.
 Properties,
a. Deterministic: A hash function must consistently produce the same output for the same input.

Subject: SEPP Notes/20CS43P pg. 1

b. Fixed Output Size: The output of a hash function should have a fixed size, regardless of the size
of the input.
c. Efficiency: The hash function should be able to process input quickly.
d. Uniformity: The hash function should distribute the hash values uniformly across the output
space to avoid clustering.
e. Pre-image Resistance: It should be computationally infeasible to reverse the hash function, i.e.,
to find the original input given a hash value.
f. Collision Resistance: It should be difficult to find two different inputs that produce the same
hash value.
g. Avalanche Effect: A small change in the input should produce a significantly different hash
value.
Hash Collision
Hash functions arc used to map each key to a different address space, but practically it is not possible to
create such a hash function and the problem is called Collision.
Collision is the condition where two records are stored in the same location.
Collision Resolution Techniques
The process of finding an alternate location is called collision resolution. Even though hash tables have
collision problems, they are more efficient in many cases compared to all other data structures, like search
trees.
There are a number of collision resolution techniques, and the most popular are direct chaining and open
addressing.
1) Direct Chaining: An array of linked list application
o Separate chaining
2) Open Addressing: Array-based implementation
o Linear probing (linear search)
o Quadratic probing (nonlinear search)
o Double hashing (use two hash functions)
1. Direct Chaining
Separate Chaining
Collision resolution by chaining combines linked representation with hash table.
When two or more records hash to the same location, these records are constituted into a singly-linked list
called a chain.
Open Addressing
In open addressing all keys are stored in the hash table itself. This approach is also known as closed
hashing. This procedure is based on probing. A collision is resolved by probing.
Linear Probing
The interval between probes is fixed at 1. In linear probing, we search the hash table sequentially. starting
from the original hash location. If a location is occupied, we check the next location. We wrap around from
the last table location to the first table location if necessary. The function for rehashing is the following:
rehash(key) = (n + 1) % table size
One of the problems with line r probing is that table items tend to cluster together in the hash table. This
means that the table contains groups of consecutively occupied locations that are called clustering.
Clusters can get close to one another, and merge into a larger cluster. Thus, the one part of the table might
be quite dense, even though another part has relatively few items. Clustering causes long probe searches

Subject: SEPP Notes/20CS43P pg. 2

and therefore decreases the overall efficiency. The next location to be probed is determined by step-size,
where other step-sizes (more than one) arc possible. The step-size should be relatively prime to the table
size, i.e. their greatest common divisor should be equal to 1. If we choose the table size to be a prime
number, then any step-size is relatively prime to the table size. Clustering cannot be avoided by larger step-
sizes.
Quadratic Probing
The interval between probes increases proportionally to the hash value (the interval thus increasing
linearly, and the indices are described by a quadratic function). The problem of Clustering can be
eliminated if we use the quadratic probing method. In quadratic probing, we start from the original hash
location i. If a location is occupied, we check the locations i+12, i+22, i+32 , i+42 ..... We wrap around
from the last table location to the first table location if necessary. The function for rehashing is
the following: rehash(key) = (n + k2) % table size
Comparisons between Open Addressing methods

Applications
Hashing provides constant time search, insert and delete operations on average. This is why hashing is one
of the most used data structure, example problems are, distinct elements, counting frequencies of items,
finding duplicates, etc.
There are many other applications of hashing, including modern day cryptography hash functions. Some of
these applications are listed below:
 Message Digest : This is an application of cryptographic Hash Functions
 Password Verification : Cryptographic hash functions are very commonly used in password
verification.
 Data Structures(Programming Languages) : Various programming languages have hash table-based
Data Structures. The basic idea is to create a key-value pair where key is supposed to be a unique
value, whereas value can be same for different keys.
 Compiler Operation : To differentiate between the keywords of a programming language(if, else,
for, return etc.) and other identifiers and to successfully compile the program, the compiler stores
all these keywords in a set which is implemented using a hash table.
 Rabin-Karp Algorithm : This is basically a string-searching algorithm which uses hashing to find
any one set of patterns in a string.

Subject: SEPP Notes/20CS43P pg. 3

 Linking File name and path together : In order to store the correspondence between file_name and
file_path the system uses a map(file_name, file_path)which is implemented using a hash table.
 Game Boards: In a game like Tic-Tac-Toe or chess the position of the game may be stored using
hash table.

QUESTIONS
1. Define hashing, hash table, hash collision, hash function
2. List the properties of hash function.
3. Explain components of hashing.
4. List & explain applications of hashing.
5. Code
6. Compare open addressing methods.

Subject: SEPP Notes/20CS43P pg. 4

Product Manual B406-4 ARTEX
100% (1)
Product Manual B406-4 ARTEX
70 pages
Bellabee Explained in Easy Terms and Business Model To Practitioners 2
No ratings yet
Bellabee Explained in Easy Terms and Business Model To Practitioners 2
8 pages
Memo For Hand Receipt Holders
No ratings yet
Memo For Hand Receipt Holders
10 pages
Dsa Labtask 12
No ratings yet
Dsa Labtask 12
5 pages
Hashing Updated
No ratings yet
Hashing Updated
26 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
Ads M Tech Mid 2
No ratings yet
Ads M Tech Mid 2
26 pages
Hash Tables in DS
No ratings yet
Hash Tables in DS
14 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Hashing
No ratings yet
Hashing
4 pages
DS Module-X
No ratings yet
DS Module-X
74 pages
Hashing
No ratings yet
Hashing
20 pages
Hashing: Data Structure
No ratings yet
Hashing: Data Structure
17 pages
Hashing New
No ratings yet
Hashing New
48 pages
Exp 5 - Dsa Lab File
No ratings yet
Exp 5 - Dsa Lab File
10 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Hashing
No ratings yet
Hashing
37 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Wa0024.
No ratings yet
Wa0024.
11 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Hashing: Data Structure
No ratings yet
Hashing: Data Structure
17 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Matrix Hashing With Two Level of Collision Resolution: National Institute of Technology Raipur
No ratings yet
Matrix Hashing With Two Level of Collision Resolution: National Institute of Technology Raipur
7 pages
Hash Table Data Structure
No ratings yet
Hash Table Data Structure
34 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Lab5 Hashing Algos
No ratings yet
Lab5 Hashing Algos
10 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
HAshing (Satish Sir)
No ratings yet
HAshing (Satish Sir)
52 pages
DS Lecture 01.1 Fall-24-35
No ratings yet
DS Lecture 01.1 Fall-24-35
20 pages
Hashing v2 12032018
No ratings yet
Hashing v2 12032018
23 pages
Hash Function
No ratings yet
Hash Function
9 pages
Hashing
No ratings yet
Hashing
23 pages
Module 5
No ratings yet
Module 5
33 pages
Lab 2
No ratings yet
Lab 2
10 pages
Hash Table: Didih Rizki Chandranegara
No ratings yet
Hash Table: Didih Rizki Chandranegara
33 pages
Hashing
No ratings yet
Hashing
23 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
Hashing Part1 - 241021 - 152911
No ratings yet
Hashing Part1 - 241021 - 152911
10 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Dsa Hashing (21CS32)
No ratings yet
Dsa Hashing (21CS32)
16 pages
Lec 11 Hashing and Collision
No ratings yet
Lec 11 Hashing and Collision
16 pages
Implementation Priority Queue Using Array
No ratings yet
Implementation Priority Queue Using Array
3 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Ds 5 Update
No ratings yet
Ds 5 Update
26 pages
Unit 5
No ratings yet
Unit 5
50 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
DSAL Manual Assignment 4
No ratings yet
DSAL Manual Assignment 4
6 pages
Hashing
No ratings yet
Hashing
7 pages
6 Dec. 24 Unit 5 DSA
No ratings yet
6 Dec. 24 Unit 5 DSA
56 pages
Unit V
No ratings yet
Unit V
14 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Oop 1
No ratings yet
Oop 1
7 pages
Seppp
No ratings yet
Seppp
8 pages
Seppp 7
No ratings yet
Seppp 7
7 pages
Recursion: Syllabus
No ratings yet
Recursion: Syllabus
7 pages
Sepp 12
No ratings yet
Sepp 12
9 pages
COVID-19, Online Teaching, and Deepening Digital Divide in India
No ratings yet
COVID-19, Online Teaching, and Deepening Digital Divide in India
4 pages
Quotation For MR - Sandeep Bangalore
No ratings yet
Quotation For MR - Sandeep Bangalore
4 pages
07 9100-1.0 Preformed Joint Sealants PD - Exterior Shell
No ratings yet
07 9100-1.0 Preformed Joint Sealants PD - Exterior Shell
41 pages
Final Need Assessment Report of Project Sites, Enhanced Management and Enforcement of Ethiopia's Protected Areas Estate Project
No ratings yet
Final Need Assessment Report of Project Sites, Enhanced Management and Enforcement of Ethiopia's Protected Areas Estate Project
68 pages
Sem - 4 - Jadavpur University Questions Paper
0% (1)
Sem - 4 - Jadavpur University Questions Paper
13 pages
Terminal GTWIN Ver.2.F2 (Full Version)
No ratings yet
Terminal GTWIN Ver.2.F2 (Full Version)
1 page
Tere Naam Ka Shajar by Ayesha Aftab Ali Complete
No ratings yet
Tere Naam Ka Shajar by Ayesha Aftab Ali Complete
303 pages
Grade 11 Functions Notes
No ratings yet
Grade 11 Functions Notes
53 pages
P.E. MCQs XII
No ratings yet
P.E. MCQs XII
69 pages
Natural Cures
100% (3)
Natural Cures
390 pages
Week 4 - General Physics Damped Oscillations PDF
100% (1)
Week 4 - General Physics Damped Oscillations PDF
77 pages
Practice Test - 2
No ratings yet
Practice Test - 2
4 pages
142535
No ratings yet
142535
10 pages
SPCV019598 - in The Matter of Property Seized in Wall Lake PDF
No ratings yet
SPCV019598 - in The Matter of Property Seized in Wall Lake PDF
6 pages
5 Surprise Test
No ratings yet
5 Surprise Test
2 pages
The Roya Thread PDF
No ratings yet
The Roya Thread PDF
46 pages
10g Install in Linux 5
No ratings yet
10g Install in Linux 5
5 pages
Analytical Chemistry
No ratings yet
Analytical Chemistry
7 pages
Cbse Class 10 Science Assertion - Reason Questions For Term 2 Exam 2022
0% (1)
Cbse Class 10 Science Assertion - Reason Questions For Term 2 Exam 2022
2 pages
Syllabus - Energy Supply Systems For Buildings
No ratings yet
Syllabus - Energy Supply Systems For Buildings
13 pages
Magnetic Resonance Brain Imaging Modeling and Data Analysis Using R Jörg Polzehl 2024 Scribd Download
No ratings yet
Magnetic Resonance Brain Imaging Modeling and Data Analysis Using R Jörg Polzehl 2024 Scribd Download
55 pages
Ee Laws Codes and Professional Ethics
No ratings yet
Ee Laws Codes and Professional Ethics
19 pages
Guide For The Preparation and Bend Testing of Welder and Welding Procedure Qualification Test Specimens
No ratings yet
Guide For The Preparation and Bend Testing of Welder and Welding Procedure Qualification Test Specimens
2 pages
Revision of Tenses
0% (1)
Revision of Tenses
4 pages
Participate in Environmental Work Practices
100% (1)
Participate in Environmental Work Practices
20 pages
Knitting For Prems
No ratings yet
Knitting For Prems
3 pages
85001-0527 - SIGA-SEC2 Security Module
No ratings yet
85001-0527 - SIGA-SEC2 Security Module
4 pages

Dshash

Uploaded by

Dshash

Uploaded by

INTRODUCTION TO HASHING

Subject: SEPP Notes/20CS43P pg. 1

Subject: SEPP Notes/20CS43P pg. 2

Subject: SEPP Notes/20CS43P pg. 3

Subject: SEPP Notes/20CS43P pg. 4

You might also like