0% found this document useful (0 votes)

39 views8 pages

Sign Language To Text Conversion - A Survey

Uploaded by

deep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views8 pages

Sign Language To Text Conversion - A Survey

Uploaded by

deep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Sign Language to Text Conversion – A

Survey

Abhishek Kulkarni, Vishwatej Harer, Deep Thombare

Under the guidance of,

Prof. Kopal Gangrade

Department of Computer Science and Engineering,

Pune Institute Of Computer Technology, Pune, MH

Abstract. Sign languages are languages that use the visual-manual modality to convey meaning. It is a communication
system using gestures usually used by deaf and dumb people to communicate with each other and with other people. As sign
language is not a common one that people know, only people well versed in sign language are able to interpret the gestures
and communicate with the mute. Hence, a need arises to bridge this gap and this is our aim. We plan to develop a web
application that reads in sign language and converts it to text that many people understand. Most of the techniques present
now have few dis- advantages like less accuracy, skin tones, motion gestures, clutter, variability etc. Our main aim is to
develop a web application that converts sign language to text while also trying to mitigate the above drawbacks to some
extent.

Keywords: Sign to Text Conversion, Convolutional neural networks, Deep learning, Web application.
I. Introduction

Sign language detection and conversion is a multi-step process that includes object detection, image processing and feature extraction.
Object detection is a computer technology for computer vision that detects objects such as hand signs, faces etc. Image Processing is a
method that performs operations on a given image to enhance it or extract information useful to us. Once an object is detected, we apply
image processing techniques to remove noise/clutter and obtain a simplified version of the image. Feature extraction is a process by
which we obtain relevant information from data and represent it in lower dimensionality space. Once we get the enhanced image, we
apply feature extraction techniques on it to get useful information.

Once feature extraction is done, we use this relevant information to train a deep learning model. Deep learning is a branch of Machine
Learning and Artificial Intelligence where we can train a network on unsupervised data.These are usually neural networks and can be a
Convolutional Neural Network, Recurrent Neural Network, Generative Adversarial Networks etc. A CNN would be most suitable in the
case of converting sign language to text as it is a type of neural network that uses layers of perceptrons to analyze data and train a model.
It applies to image processing, Natural Language Processing and many other tasks related to cognitive capabilities.

Once a trained model is obtained, it is deployed on the cloud and then an interface has to be developed to use that model to detect and
understand gestures. We would also need a camera to record the gestures and an output device to display the text. Most people use
smartphones and laptops hence developing a web application is a very convenient way to achieve this. A web application is developed
where we record the sign language gestures and pass it on to the model on the cloud for conversion. The resultant text obtained can then
be displayed on the screen.

The proposed system would work like this:

Fig 1: The proposed system to convert sign language to text.

II. Related Works

Bantupalli et al., [1] have used a convolutional neural network for spatial feature extraction, long short term Memory and recurrent neural networks for
temporal feature extraction and then used ADAM optimizer and a softmax layer for prediction. Modi et al., [2] have developed a system that obtains
frames from a video every 4 seconds and matches it with a database to find out the least error. Abdulla et al., [3] have used sensors that have RF
transmitters that send signals based on hand movement to a receiver that translates it into Arabic language. Dutta et al., [4] have developed a system
that calcu-lates Eigen values for images and the takes pre-processed image as input and checks for maximum match.

Padmavathi et al., [5] have developed a system which Convert image to frames and apply HSI color model based segmentation on each. Neural
networks are used to predict the character. Anand et al., [6] have developed a system which creates image feature vector by binarization, noise removal
and hand detection in image and then compare with existing database. For speech to sign conversion, remove noise from audio, con-vert to text and
then compare with existing database. Ong et al., [7] have developed a system which detects hand signs, hand shape and position of the hand across all
posi-tions and scales in the given image after removing erroneous values. Bhat et al., [8] have developed a system which uses Sensors in gloves to pick
up gestures, convert to text with Analog to Digital Converter and microcontrollers, sends it to phone via Blue-tooth which then converts text to speech.
Pramada et al., [9] have developed a system which captures image and performs RGB color detection and converts it to binary image and performs
pattern matching and text to speech conversion. Huang et al., [10] have developed a system which uses Mi-crosoft Kinect as input device and gives
color and depth video streams and uses five inputs. The CNN has 9 frames and 8 layers, subsampling and convolution is done mul-tiple times. Madhuri
et al., [11] have developed a system in which image is obtained using a mobile camera and then image processing is done to extract hand sign and
match it and then corresponding audio file is played. Wu et al., [12] have developed a system in which classifier is trained on both positive and
negative data, weak classifier with lowest error rate is chosen in each run and then all the classifiers are combined as a strong classifier to detect the
meaning of the gestures.

Jarndal et al., [13] have developed two systems for conversion to dual language (English and Arabic) text and voice, a vision based system and a
wireless-interfaced glove based system. Hays et al., [14] have developed a mobile application for real time sign language to text conversion from a
video input using classification algorithms Lo-cality Preserving Projections (LPP) and Support Vector Machine (SVM). Vijaya-lakshmi et al., [15] have
developed a flex sensor, tactile sensor and accelerator, HMM based sign language to text and speech conversion model.

III. Comparison of different sign language translation methods

The following table 1 gives us an idea of the different methods used in the field of sign language detection and translation by different
authors. It also illustrates some of our recommendations that we thought could be implemented.
Table 1. Comparison of different methods used for sign language
conversion
Objective Methodology Results/ Outcome Advantage Disadvantage

Create a vision A CNN named ‘Inception’ is As CNN and

used in spatial feature extraction RNN are trained This model faced
based application
from the video, then using a Accuracy was more for in- dependently problems while testing with
that offers sign
LSTM and a RNN model we softmax layer than pool there is a different skin tones. It
language translation
extract temporal features using layer for various sample minimization of dropped accuracy if it hadn’t
to text, aiding
the outputs of softmax and the sizes. cross-entropy-cost been trained on a certain
communication. skin tone.
pool layer of function ADAM.
CNN.
A method to
enable translating Extraction of video every 4 Simple
This approach results
Sign language seconds and processing it, mechanism to
in the clear comparison There is a need of
finger-spellings extracted features are compared detect the
obtained for each addition of more gestures and
to English text and with database of finger-spelling.
finger-spelling with all features so
enable finger- finger-spellings. Error is Easy to implement that it supports motion too.
other database images. It
spelling to Digital calculated, one with minimum resulted in 96% accuracy. with the desired
,Audio or Text con- error is the best match. output probability
version. of 0.96.
Five flex sensors detect the When a person
To develop a There is a need to combine
bind- ing of each finger. The wearing the smart gloves Low cost and it
device which uses two gloves instead of one as it
Arduino NANO interfaced with does an Arabic letter can be used to
gloves to convert doesn’t cover a wider range of
RF trans- mitter transmits the gesture, the LCD displays represent a wider
sign language to signs and use of smart gloves
signals to receiver, The received the letter and the speaker range of words.
Arabic text is a compulsion.
signals generates Arabic letters outputs the voice when the
language. sound button is clicked.
and are displayed on LCD
screen
To develop a Min Eigenvalue is applied on 5 The test image and the It is carried
system trained to images of each alphabet and database image is out with bare
convert sign pre- processed input image. compared by matching the hands and the
language to text Interest- ing points are feature points results were -
language using extracted. Check for max and the database image background and
single and double matching. matching is displayed as person
hand text and later to speech. independent.
and Min Eigenvalue
Convert image to frames and The accuracy Accuracy is
To convert the apply HSI color model based percentage of the result more. This Improper segmentation
Indian sign segmentation on each. obtained is approach gave which results in varying of
language hand Features line centroid of the 99%,precision better results with robustness
gestures to hand is extracted and is fed to when hands are overlapped.
89.47%,recall 89.78% sigmoid trans-
appropriate text neural network to fer.
and specificity
messages. recognize the particular character. 97.54%
Table 1. Comparison of different methods used for sign language
conversion
Ease Create image feature vector, Convenient
communication noise removal, hand detection in way of Should be extended to
between deaf/dumb image, compare with existing communication words and sentences. Difficult
database. For speech to sign Not implemented yet.
and normal people between to implement image
without use of conversion, remove noise from deaf/dumb and processing technique
sophisticated audio, convert to text, and normal people on mobile phones.
devices like data compare with existing with two way
database.
gloves etc. translation.
Exhaustive detection across all
Training a detector Unsupervised
positions and scales, 99.8% success rate No motion or background
to recognize the approach trained
thresholding image, connected on hand detection and models, accuracy not evaluated
human hand in the using K-medoid
component analysis to detect 97.4% success rate in in environments with more
image and also algorithm,very
position of hand. Sub image in hand shape clutter and variability.
classify the hand classification. efficient on
area of detected hand given to
shape. gray-level images.
hand shape detectors.

Improve Sensors in gloves pick up Successfully converts Reliable, user

communication in gestures, convert to text with In- dian Sign Language, independent and
Indian Sign ADC, microcontrollers and numbers and symbols to portable system -
Language using flex send to phone via Bluetooth text and display it on a and consumes less
sensor technology. which then mobile phone. power compared to
converts text to speech. other

IV. Conclusion

Sign language is a visual language. Visual information is the most important type of information perceived, processed and
interpreted by the human brain. Digital image processing, as a computer-based technology, has applications in a variety of
fields such as image sharpening, restoration, medical, remote sensing etc. Also, deep learning, as a sub field of machine
learning, tries to imitate the actions of the human brain and is used in fields like speech and image recognition.

After going through the above listed papers on sign language conversion, we feel it is safe to say that there are many
techniques used to convert sign language to various output types each with its own advantages and drawbacks in some. Our
plan is to develop a web application that helps convert sign language to text output.
V. Acknowledgment
The authors express gratitude towards the assistance provided by our mentors and faculty members who guided us
throughout the research and helped us in achieving desired results.

VI. References

1. Kshitij Bantupalli, Ying Xie: “American Sign Language Recognition using Deep Learning and Computer Vision”, 2018 IEEE Conference
on Big Data.
2. Krishna Modi, Amrita More: “Translation of Sign Language Finger - Spelling to Text using Image Processing”, International Journal of
Computer Applications, 11 Septem- ber 2013, vol. 77.
3. Dalal Abdulla, Shahrazad Abdulla, Rameesa Manaf, Anwar H. Jarndal: “Design and Implementation of A Sign to Speech/Text System

for Deaf and Dumb People”, 2016 Fifth International Conference on Electronic Devices, Systems and Application (ICEDSA).

4. Kusumika Krori Dutta, Satheesh Kumar Raju K, Anil Kumar G S, Sunny Arokia Swamy B: “Double Handed Indian Sign Language to
Speech and Text”, 2015 Third International Conference on Image Information Processing.
5. Padmavathi. S, Saipreethy M S, Valliammai V: “Indian Sign Language character recognition using Neural Networks”, IJCA Special Issue
on Recent Trends in Pattern Recognition and Image Analysis RTPRIA.
6. M Suresh Anand, A. Kumaresan, Dr. N Mohan Kumar: “An integrated two way ISL(Indian Sign Language) translation system - A new
approach”, International Journal of Advanced Research in Computer Science, Jan/Feb2013, Vol. 4 Issue 1, p7-12. 6p.
7. Eng-Jon Ong, Richard Bowden: “A Boosted Classifier tree for Hand Shape Detection”, Sixth IEEE International Conference on
Automatic Face and Gesture Recognition, 2004.
8. Sachin Bhat, Amruthesh M, Ashik Chidanandas, Sujith: “Translating Indian Sign Language to text and voice messages using flex
sensors”, International Journal of Ad- vanced Research in Computer and Communication Engineering, May 2015, Vol. 4 Issue 5.
9. Sawaant Pramada, Deshpande Saylee, Naale Pranita, Nerkar Samiksha, Mrs. Archana S Vaidya: “Intelligent Sign Language recognition
using Image Processing”, IOSR Journal of Engineering, Feb 2013, Vol. 3 Issue 2, pp 45-51.
10. Jie Huang, Wengang Zhou, Houqiang Li, Weiping Li: “Sign Language Recognition using 3D Convolutional Neural Networks”, 2015
IEEE Conference on Multimedia and Expo (ICME), 1-6, 2015.
11. Yellapu Madhuri, Anitha G, Anburajan M: “Vision-based Sign Language Translation Device”, 2013 International Conference on
Information Communication and Embed- ded Systems (ICICES), 565-568, 2013.
12. Shuqiong Wu, Hiroshi Nagahashi: “Real-time 2D hands detection and tracking for Sign Language Recognition”, Proceedings of the 2013
8th International Conference on Sys- tem of Systems Engineering, Maui, Hawaii, USA - Jun 2-6, 2013.
13. Anwar Jarndal, Ahmed Al-Maflehi: “On Design and Implementation of A Sign-to- Speech/Text System”, 2017 International Conference
on Electrical, Electronics, Com- munication, Computer and Optimization Techniques (ICEECCOT).
14. Philip Hays, Raymond Ptucha, Roy Melton: “Mobile Device to Cloud co-processing of ASL Finger Spelling to Text Conversion”, 2013
IEEE Western New York Image Processing Workshop (WNYIPW), 22-23 Nov, 2013.
15. Vijayalakshmi P, Aarthi M: “Sign Language to Speech Conversion”, 2016 International Conference on Recent Trends in Information
Technology, 8-9 April, 2016.

Sign Language Translator Presentation
No ratings yet
Sign Language Translator Presentation
19 pages
JOURNAL Sign
No ratings yet
JOURNAL Sign
2 pages
Mudratalk: Indian Sign Language Translator: Bharati Vidyapeeth Deemed To Be University
No ratings yet
Mudratalk: Indian Sign Language Translator: Bharati Vidyapeeth Deemed To Be University
18 pages
Final Report
No ratings yet
Final Report
39 pages
Research Paper
No ratings yet
Research Paper
13 pages
Research Paper On Sign Language To Text
No ratings yet
Research Paper On Sign Language To Text
7 pages
Ijst 2023 2583
No ratings yet
Ijst 2023 2583
9 pages
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
No ratings yet
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
8 pages
SIGNLANGUAGE PPT
100% (1)
SIGNLANGUAGE PPT
15 pages
Department of Computer Science and Engineering: Nandha College of Technology, Erode-638052
No ratings yet
Department of Computer Science and Engineering: Nandha College of Technology, Erode-638052
16 pages
Sign Language Interpretation and Sentence Building: A CNN-Based Solution
No ratings yet
Sign Language Interpretation and Sentence Building: A CNN-Based Solution
9 pages
"Asl To Text Conversion": Bachelor of Technology
No ratings yet
"Asl To Text Conversion": Bachelor of Technology
15 pages
SET Project Presentation
No ratings yet
SET Project Presentation
21 pages
Deep Learning Based Sign Language Recognition System Using Convolutional Neural Network
No ratings yet
Deep Learning Based Sign Language Recognition System Using Convolutional Neural Network
68 pages
IJRPR20645
No ratings yet
IJRPR20645
9 pages
Sign 1
No ratings yet
Sign 1
10 pages
Sign Language Recognition Using Convolutional Neur
No ratings yet
Sign Language Recognition Using Convolutional Neur
12 pages
Real-Time Sign Language Recognition System
No ratings yet
Real-Time Sign Language Recognition System
6 pages
Gesture Recognition and Natural Language Processing For Real
No ratings yet
Gesture Recognition and Natural Language Processing For Real
11 pages
Farman
No ratings yet
Farman
9 pages
Architecture
No ratings yet
Architecture
17 pages
PPTT
No ratings yet
PPTT
35 pages
Conversion of Sign Language To Text and Audio Using Deep Learning Techniques
No ratings yet
Conversion of Sign Language To Text and Audio Using Deep Learning Techniques
9 pages
AI Report
No ratings yet
AI Report
23 pages
Smart Translation
No ratings yet
Smart Translation
24 pages
Design Project 2
No ratings yet
Design Project 2
9 pages
Sign Language Translation Presentation
No ratings yet
Sign Language Translation Presentation
20 pages
Visual Language Interpreter
No ratings yet
Visual Language Interpreter
7 pages
Sign Language Detection Using Mediapipe and Deep Learning
No ratings yet
Sign Language Detection Using Mediapipe and Deep Learning
6 pages
Convolutional Neural Network For Detection
No ratings yet
Convolutional Neural Network For Detection
4 pages
Sign Language Recognition Using LSTM and Media Pipe
No ratings yet
Sign Language Recognition Using LSTM and Media Pipe
6 pages
Conversion of Sign Language To Text: Presented by
No ratings yet
Conversion of Sign Language To Text: Presented by
16 pages
Sign Language RECOGNITION USING DEEP LEARNING
No ratings yet
Sign Language RECOGNITION USING DEEP LEARNING
28 pages
Report
No ratings yet
Report
8 pages
Recognizing and Transforming Sign Language To Speech
No ratings yet
Recognizing and Transforming Sign Language To Speech
23 pages
Srivatsa
No ratings yet
Srivatsa
26 pages
Hand Signs To Audio Converte1
No ratings yet
Hand Signs To Audio Converte1
11 pages
Sign Recognition Research Paper
No ratings yet
Sign Recognition Research Paper
16 pages
Project Review 1
No ratings yet
Project Review 1
24 pages
Final Capstone Review
No ratings yet
Final Capstone Review
29 pages
Research
No ratings yet
Research
8 pages
G7 Synopsis
No ratings yet
G7 Synopsis
14 pages
Conference Paper Signmeet
No ratings yet
Conference Paper Signmeet
6 pages
Sign Language to Text/Voice App
No ratings yet
Sign Language to Text/Voice App
13 pages
G41 FinalEval
No ratings yet
G41 FinalEval
29 pages
Sign Language
No ratings yet
Sign Language
5 pages
Journal Paper - Sign Language
No ratings yet
Journal Paper - Sign Language
10 pages
Ends Emp PT Sign Language
No ratings yet
Ends Emp PT Sign Language
16 pages
American Sign Language Research Paper
No ratings yet
American Sign Language Research Paper
5 pages
Indian Sign Language Recognition System
No ratings yet
Indian Sign Language Recognition System
3 pages
Feasibility Report
No ratings yet
Feasibility Report
12 pages
Sign Language
No ratings yet
Sign Language
9 pages
Development of An End-To-End Deep Learning Framework For Sign Language Recognition Translation and Video Generation
No ratings yet
Development of An End-To-End Deep Learning Framework For Sign Language Recognition Translation and Video Generation
17 pages
Sign Language Recognition Project
No ratings yet
Sign Language Recognition Project
24 pages
Visualizing Language: CNNs For Sign Language Recognition
No ratings yet
Visualizing Language: CNNs For Sign Language Recognition
6 pages
Sign Language Recognition System Using Convolutional Neural Network and Computer Vision
No ratings yet
Sign Language Recognition System Using Convolutional Neural Network and Computer Vision
6 pages
2021a1r002 1
No ratings yet
2021a1r002 1
14 pages
Updated Research Paper
No ratings yet
Updated Research Paper
14 pages
Noisy Label Detection with DynaCor
No ratings yet
Noisy Label Detection with DynaCor
11 pages
Research Paper
No ratings yet
Research Paper
5 pages
Deep Learning for Handwriting OCR
No ratings yet
Deep Learning for Handwriting OCR
13 pages
Survey On Categorical Data For Neural Networks: Open Access Survey Paper
No ratings yet
Survey On Categorical Data For Neural Networks: Open Access Survey Paper
41 pages
Malaria Detection via Deep Learning
No ratings yet
Malaria Detection via Deep Learning
55 pages
AI REPORT GOLDMAN SACHS FT-Artificial-Intelligence
No ratings yet
AI REPORT GOLDMAN SACHS FT-Artificial-Intelligence
24 pages
Deep Learning Case Studies Guide
No ratings yet
Deep Learning Case Studies Guide
39 pages
Skin Lesion Classification Using Deep Learning Architectures: Abhishek C. Salian Shalaka Vaze Pragya Singh
No ratings yet
Skin Lesion Classification Using Deep Learning Architectures: Abhishek C. Salian Shalaka Vaze Pragya Singh
6 pages
Paper 11590
No ratings yet
Paper 11590
7 pages
B.tech 20-21 Internship
No ratings yet
B.tech 20-21 Internship
18 pages
Flatten Layer and Pooling Technique
No ratings yet
Flatten Layer and Pooling Technique
3 pages
AI in Industry Real World Applications A
No ratings yet
AI in Industry Real World Applications A
9 pages
Ai Developer Program Syllabus
No ratings yet
Ai Developer Program Syllabus
17 pages
Thesis Help for Computer Engineers
100% (2)
Thesis Help for Computer Engineers
5 pages
IEEE2023 Cloud-Based Intrusion Detection Approach Using Machine Learning Techniques
No ratings yet
IEEE2023 Cloud-Based Intrusion Detection Approach Using Machine Learning Techniques
10 pages
AI Thesis Topic Selection Guide
100% (3)
AI Thesis Topic Selection Guide
7 pages
Assignment 2, Machine Learning
No ratings yet
Assignment 2, Machine Learning
5 pages
ICAESM 2021 Bangalore With CP
No ratings yet
ICAESM 2021 Bangalore With CP
76 pages
Machine Learning Adoption in Blockchain-Based Intelligent Manufacturing (Om Prakash Jena, Sabyasachi Pramanik Etc.)
No ratings yet
Machine Learning Adoption in Blockchain-Based Intelligent Manufacturing (Om Prakash Jena, Sabyasachi Pramanik Etc.)
207 pages
AI CYBERSECURITY - Merged
No ratings yet
AI CYBERSECURITY - Merged
32 pages
Final Report
No ratings yet
Final Report
42 pages
Identification of Medicinal Plants Using Deep Learning Synopsis
No ratings yet
Identification of Medicinal Plants Using Deep Learning Synopsis
2 pages
mit课程
100% (2)
mit课程
11 pages
Applications of Artificial Intelligence AI in Libraries
No ratings yet
Applications of Artificial Intelligence AI in Libraries
19 pages
CV DL Resource Guide
100% (2)
CV DL Resource Guide
17 pages
Full Download Python Deep Learning Second Edition Ivan Vasilev & Daniel Slater & Gianmario Spacagna &peter Roelants & Valentino Zocca PDF
No ratings yet
Full Download Python Deep Learning Second Edition Ivan Vasilev & Daniel Slater & Gianmario Spacagna &peter Roelants & Valentino Zocca PDF
51 pages
Unit 5 UA
No ratings yet
Unit 5 UA
19 pages
Mini Projects Merged
No ratings yet
Mini Projects Merged
67 pages
Deep Learning Math Background
No ratings yet
Deep Learning Math Background
30 pages
The International Journal of Management Education: Margaret A. Goralski, Tay Keong Tan T
No ratings yet
The International Journal of Management Education: Margaret A. Goralski, Tay Keong Tan T
9 pages

Sign Language To Text Conversion - A Survey

Uploaded by

Sign Language To Text Conversion - A Survey

Uploaded by

Sign Language to Text Conversion – A

Abhishek Kulkarni, Vishwatej Harer, Deep Thombare

Under the guidance of,

Department of Computer Science and Engineering,

The proposed system would work like this:

II. Related Works

III. Comparison of different sign language translation methods

Create a vision A CNN named ‘Inception’ is As CNN and

Improve Sensors in gloves pick up Successfully converts Reliable, user

You might also like