[go: up one dir, main page]

GB2617613B - An audio processing method and apparatus - Google Patents

An audio processing method and apparatus Download PDF

Info

Publication number
GB2617613B
GB2617613B GB2205590.9A GB202205590A GB2617613B GB 2617613 B GB2617613 B GB 2617613B GB 202205590 A GB202205590 A GB 202205590A GB 2617613 B GB2617613 B GB 2617613B
Authority
GB
United Kingdom
Prior art keywords
processing method
audio processing
audio
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
GB2205590.9A
Other versions
GB2617613A (en
GB202205590D0 (en
Inventor
Zorila Tudor-Catalin
Sanand Doddipatla Rama
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to GB2205590.9A priority Critical patent/GB2617613B/en
Publication of GB202205590D0 publication Critical patent/GB202205590D0/en
Priority to CN202310163454.XA priority patent/CN116913296A/en
Priority to JP2023028408A priority patent/JP7551805B2/en
Publication of GB2617613A publication Critical patent/GB2617613A/en
Application granted granted Critical
Publication of GB2617613B publication Critical patent/GB2617613B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/60Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Image Processing (AREA)
  • Control Of Amplification And Gain Control (AREA)
GB2205590.9A 2022-04-14 2022-04-14 An audio processing method and apparatus Active GB2617613B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
GB2205590.9A GB2617613B (en) 2022-04-14 2022-04-14 An audio processing method and apparatus
CN202310163454.XA CN116913296A (en) 2022-04-14 2023-02-16 Audio processing method and device
JP2023028408A JP7551805B2 (en) 2022-04-14 2023-02-27 Audio processing method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB2205590.9A GB2617613B (en) 2022-04-14 2022-04-14 An audio processing method and apparatus

Publications (3)

Publication Number Publication Date
GB202205590D0 GB202205590D0 (en) 2022-06-01
GB2617613A GB2617613A (en) 2023-10-18
GB2617613B true GB2617613B (en) 2024-10-30

Family

ID=81753229

Family Applications (1)

Application Number Title Priority Date Filing Date
GB2205590.9A Active GB2617613B (en) 2022-04-14 2022-04-14 An audio processing method and apparatus

Country Status (3)

Country Link
JP (1) JP7551805B2 (en)
CN (1) CN116913296A (en)
GB (1) GB2617613B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9741360B1 (en) * 2016-10-09 2017-08-22 Spectimbre Inc. Speech enhancement for target speakers
WO2022056226A1 (en) * 2020-09-14 2022-03-17 Pindrop Security, Inc. Speaker specific speech enhancement

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4162604B2 (en) 2004-01-08 2008-10-08 株式会社東芝 Noise suppression device and noise suppression method
JP6764028B2 (en) 2017-07-19 2020-09-30 日本電信電話株式会社 Mask calculation device, cluster weight learning device, mask calculation neural network learning device, mask calculation method, cluster weight learning method and mask calculation neural network learning method
JP7404657B2 (en) 2019-05-28 2023-12-26 沖電気工業株式会社 Speech recognition device, speech recognition program, and speech recognition method
CN111261146B (en) 2020-01-16 2022-09-09 腾讯科技(深圳)有限公司 Speech recognition and model training method, device and computer readable storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9741360B1 (en) * 2016-10-09 2017-08-22 Spectimbre Inc. Speech enhancement for target speakers
WO2022056226A1 (en) * 2020-09-14 2022-03-17 Pindrop Security, Inc. Speaker specific speech enhancement

Also Published As

Publication number Publication date
JP7551805B2 (en) 2024-09-17
GB2617613A (en) 2023-10-18
CN116913296A (en) 2023-10-20
JP2023157845A (en) 2023-10-26
GB202205590D0 (en) 2022-06-01

Similar Documents

Publication Publication Date Title
EP4054177C0 (en) AUDIO PROCESSING METHOD AND APPARATUS
EP4270897A4 (en) Message processing method and related apparatus
PL4320877T3 (en) Audio apparatus and method therefor
GB2610461B (en) Processing method and apparatus
EP4273863A4 (en) Audio processing method and apparatus, and electronic device
EP4236533A4 (en) Signal processing method and apparatus
EP4236513A4 (en) Signal processing method and apparatus
EP4120561A4 (en) Signal processing method and apparatus
EP4383697A4 (en) Audio processing method and apparatus
GB2617081B (en) Audio signal processing method and apparatus
EP4213428A4 (en) Signal processing method and apparatus
EP4195558A4 (en) Signal processing method and apparatus
EP4184797A4 (en) Signal processing method and apparatus
GB2617613B (en) An audio processing method and apparatus
EP4194894A4 (en) Signal processing method and apparatus
EP4332585A4 (en) Signal processing method and apparatus
EP4156510A4 (en) Signal processing method and apparatus
GB202412986D0 (en) Audio signal processing method and apparatus
GB2607992B (en) Speech processing method and apparatus
GB2620117B (en) Data processing apparatus and method
GB2621305B (en) Data processing apparatus and method
GB2618815B (en) Data processing apparatus and method
ZA202209411B (en) Audio de-amplification method and apparatus
GB202411146D0 (en) Methods and processing apparatus
GB202305509D0 (en) Audio processing system and method