[go: up one dir, main page]

IN2014MU00739A - - Google Patents

Download PDF

Info

Publication number
IN2014MU00739A
IN2014MU00739A IN739MU2014A IN2014MU00739A IN 2014MU00739 A IN2014MU00739 A IN 2014MU00739A IN 739MU2014 A IN739MU2014 A IN 739MU2014A IN 2014MU00739 A IN2014MU00739 A IN 2014MU00739A
Authority
IN
India
Prior art keywords
cvr
real
modification
signal processing
consonant
Prior art date
Application number
Inventor
C Pandey Prem
R Jayan A
Tiwari Nitya
Original Assignee
Indian Inst Technology Bombay
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Indian Inst Technology Bombay filed Critical Indian Inst Technology Bombay
Priority to IN739MU2014 priority Critical patent/IN2014MU00739A/en
Priority to US15/121,599 priority patent/US10176824B2/en
Priority to PCT/IN2015/000048 priority patent/WO2015132798A2/en
Publication of IN2014MU00739A publication Critical patent/IN2014MU00739A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Monitoring And Testing Of Transmission In General (AREA)

Abstract

Increasing the level of the consonant segments relative to the nearby vowel segments, known as consonant-vowel ratio (CVR) modification, is reported to be effective in improving speech intelligibility by listeners in noisy backgrounds and by hearing-impaired listeners. A method along with a system for real-time CVR modification using the rate of change of spectral centroid for detection of spectral transitions is disclosed. A preferred embodiment of the invention using a 16-bit fixed point processor with on-chip FFT hardware is also presented for real-time signal processing. It can be integrated with other FFT-based signal processing in communication devices, hearing aids, and other systems for improving speech perception under adverse listening conditions.
IN739MU2014 2014-03-04 2015-01-27 IN2014MU00739A (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
IN739MU2014 IN2014MU00739A (en) 2014-03-04 2015-01-27
US15/121,599 US10176824B2 (en) 2014-03-04 2015-01-27 Method and system for consonant-vowel ratio modification for improving speech perception
PCT/IN2015/000048 WO2015132798A2 (en) 2014-03-04 2015-01-27 Method and system for consonant-vowel ratio modification for improving speech perception

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IN739MU2014 IN2014MU00739A (en) 2014-03-04 2015-01-27

Publications (1)

Publication Number Publication Date
IN2014MU00739A true IN2014MU00739A (en) 2015-09-25

Family

ID=54055960

Family Applications (1)

Application Number Title Priority Date Filing Date
IN739MU2014 IN2014MU00739A (en) 2014-03-04 2015-01-27

Country Status (3)

Country Link
US (1) US10176824B2 (en)
IN (1) IN2014MU00739A (en)
WO (1) WO2015132798A2 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170294185A1 (en) * 2016-04-08 2017-10-12 Knuedge Incorporated Segmentation using prior distributions
TWI622978B (en) * 2017-02-08 2018-05-01 宏碁股份有限公司 Voice signal processing apparatus and voice signal processing method
KR102017244B1 (en) * 2017-02-27 2019-10-21 한국전자통신연구원 Method and apparatus for performance improvement in spontaneous speech recognition
CN109346061B (en) * 2018-09-28 2021-04-20 腾讯音乐娱乐科技(深圳)有限公司 Audio detection method, device and storage medium
CN111429935B (en) * 2020-02-28 2023-08-29 北京捷通华声科技股份有限公司 Voice caller separation method and device
WO2022034139A1 (en) * 2020-08-12 2022-02-17 Dolby International Ab Automatic detection and attenuation of speech-articulation noise events
KR102338563B1 (en) * 2021-02-05 2021-12-13 이기헌 System for visualizing voice for english education and method thereof
CN113707156B (en) * 2021-08-06 2024-04-05 武汉科技大学 Vehicle-mounted voice recognition method and system

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4454609A (en) 1981-10-05 1984-06-12 Signatron, Inc. Speech intelligibility enhancement
US5737719A (en) 1995-12-19 1998-04-07 U S West, Inc. Method and apparatus for enhancement of telephonic speech signals
AUPQ366799A0 (en) 1999-10-26 1999-11-18 University Of Melbourne, The Emphasis of short-duration transient speech features
US7920697B2 (en) * 1999-12-09 2011-04-05 Broadcom Corp. Interaction between echo canceller and packet voice processing
US6889186B1 (en) 2000-06-01 2005-05-03 Avaya Technology Corp. Method and apparatus for improving the intelligibility of digitally compressed speech
US8023557B2 (en) * 2007-12-31 2011-09-20 Silicon Laboratories Inc. Hardware synchronizer for 802.15.4 radio to minimize processing power consumption
ES2678415T3 (en) * 2008-08-05 2018-08-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and procedure for processing and audio signal for speech improvement by using a feature extraction
US9084893B2 (en) * 2009-02-03 2015-07-21 Hearworks Pty Ltd Enhanced envelope encoded tone, sound processor and system
US8260220B2 (en) * 2009-09-28 2012-09-04 Broadcom Corporation Communication device with reduced noise speech coding
JPWO2011055489A1 (en) * 2009-11-04 2013-03-21 パナソニック株式会社 hearing aid
JP5665780B2 (en) * 2012-02-21 2015-02-04 株式会社東芝 Speech synthesis apparatus, method and program
US9177559B2 (en) * 2012-04-24 2015-11-03 Tom Stephenson Method and apparatus for analyzing animal vocalizations, extracting identification characteristics, and using databases of these characteristics for identifying the species of vocalizing animals

Also Published As

Publication number Publication date
US10176824B2 (en) 2019-01-08
WO2015132798A2 (en) 2015-09-11
US20160365099A1 (en) 2016-12-15
WO2015132798A3 (en) 2015-11-12

Similar Documents

Publication Publication Date Title
IN2014MU00739A (en)
EP2846225A3 (en) Systems and methods for visual processing of spectrograms to generate haptic effects
EP3726525A4 (en) Electronic device for analyzing meaning of speech, and operation method therefor
EP2804177A3 (en) Method for processing an audio signal and audio receiving circuit
WO2013162994A3 (en) Systems and methods for audio signal processing
EP4053500A4 (en) Object recognition system, signal processing method of object recognition system, and electronic device
EP3438623A4 (en) Abnormal sound detection learning device, acoustic feature value extraction device, abnormal sound sampling device, and method and program for same
EP3713182A4 (en) Processing method and device for cache synchronous exception
EP3669289A4 (en) Method and electronic device for translating speech signal
EP3204944A4 (en) Method, device, and system of noise reduction and speech enhancement
EP3484141A4 (en) Image processing device, image processing method, and image processing circuit
EP3588797A4 (en) Electronic device, communication apparatus, and signal processing method
EP2925016A3 (en) Microphone device and microphone unit
TW201615036A (en) Ear pressure sensors integrated with speakers for smart sound level exposure
UA114027C2 (en) SYSTEMS AND METHODS OF IMPLEMENTATION OF ADJUSTMENT
EP3304548A4 (en) Electronic device and method of audio processing thereof
EP3663905A4 (en) Information processing device, speech recognition system, and information processing method
EP3602555B8 (en) Apparatus and method for determining a predetermined characteristic related to a spectral enhancement processing of an audio signal
EP3457402A4 (en) Signal processing method and device adaptive to noise environment and terminal device employing same
EP3596939A4 (en) Sound output apparatus and signal processing method thereof
EP2663095A3 (en) Hearing aid with distributed processing in ear piece
EP3508949A4 (en) Signal processing device, signal processing method, program, and electronic device
EP3253034A4 (en) Method and apparatus for controlling sound collection range of multi-microphone de-noising of terminal
WO2015183728A3 (en) Enhancing intelligibility of speech content in an audio signal
EP3882657A4 (en) Signal processing device and signal processing method