SG11202010803VA - System and method for determining voice characteristics - Google Patents
System and method for determining voice characteristicsInfo
- Publication number
- SG11202010803VA SG11202010803VA SG11202010803VA SG11202010803VA SG11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA
- Authority
- SG
- Singapore
- Prior art keywords
- voice characteristics
- determining voice
- determining
- voice
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/20—Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Business, Economics & Management (AREA)
- Game Theory and Decision Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
- Image Analysis (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/114812 WO2020035085A2 (en) | 2019-10-31 | 2019-10-31 | System and method for determining voice characteristics |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202010803VA true SG11202010803VA (en) | 2020-11-27 |
Family
ID=69525955
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202010803VA SG11202010803VA (en) | 2019-10-31 | 2019-10-31 | System and method for determining voice characteristics |
SG11202013135XA SG11202013135XA (en) | 2019-10-31 | 2020-01-09 | System and method for personalized speaker verification |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202013135XA SG11202013135XA (en) | 2019-10-31 | 2020-01-09 | System and method for personalized speaker verification |
Country Status (5)
Country | Link |
---|---|
US (3) | US10997980B2 (en) |
CN (2) | CN111712874B (en) |
SG (2) | SG11202010803VA (en) |
TW (1) | TWI737462B (en) |
WO (2) | WO2020035085A2 (en) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806696B (en) * | 2018-05-08 | 2020-06-05 | 平安科技(深圳)有限公司 | Method and device for establishing voiceprint model, computer equipment and storage medium |
US11556848B2 (en) * | 2019-10-21 | 2023-01-17 | International Business Machines Corporation | Resolving conflicts between experts' intuition and data-driven artificial intelligence models |
SG11202010803VA (en) * | 2019-10-31 | 2020-11-27 | Alipay Hangzhou Inf Tech Co Ltd | System and method for determining voice characteristics |
US11651767B2 (en) | 2020-03-03 | 2023-05-16 | International Business Machines Corporation | Metric learning of speaker diarization |
US11443748B2 (en) * | 2020-03-03 | 2022-09-13 | International Business Machines Corporation | Metric learning of speaker diarization |
CN111833855B (en) * | 2020-03-16 | 2024-02-23 | 南京邮电大学 | Multi-to-multi speaker conversion method based on DenseNet STARGAN |
CN111540367B (en) * | 2020-04-17 | 2023-03-31 | 合肥讯飞数码科技有限公司 | Voice feature extraction method and device, electronic equipment and storage medium |
CN111524525B (en) * | 2020-04-28 | 2023-06-16 | 平安科技(深圳)有限公司 | Voiceprint recognition method, device, equipment and storage medium of original voice |
US20220067279A1 (en) * | 2020-08-31 | 2022-03-03 | Recruit Co., Ltd., | Systems and methods for multilingual sentence embeddings |
US12165311B2 (en) * | 2020-11-04 | 2024-12-10 | Samsung Sds America, Inc. | Unsupervised representation learning and active learning to improve data efficiency |
CN112487384B (en) * | 2020-11-25 | 2024-12-03 | 华为技术有限公司 | Identity verification method and system |
CN112418173A (en) * | 2020-12-08 | 2021-02-26 | 北京声智科技有限公司 | Abnormal sound recognition method, device and electronic device |
CN113555032B (en) * | 2020-12-22 | 2024-03-12 | 腾讯科技(深圳)有限公司 | Multi-speaker scene recognition and network training method and device |
US11605369B2 (en) * | 2021-03-10 | 2023-03-14 | Spotify Ab | Audio translator |
US11689868B2 (en) * | 2021-04-26 | 2023-06-27 | Mun Hoong Leong | Machine learning based hearing assistance system |
CN113345454B (en) * | 2021-06-01 | 2024-02-09 | 平安科技(深圳)有限公司 | Training and application methods, devices, equipment and storage medium of voice conversion model |
CN114067803B (en) * | 2021-10-21 | 2025-01-03 | 北京工业大学 | A speaker verification method based on distance correlation metric learning |
CN114023343B (en) * | 2021-10-30 | 2024-04-30 | 西北工业大学 | Voice conversion method based on semi-supervised feature learning |
TWI795173B (en) * | 2022-01-17 | 2023-03-01 | 中華電信股份有限公司 | Multilingual speech recognition system, method and computer readable medium |
CN114512128B (en) * | 2022-02-08 | 2025-07-04 | 招商银行股份有限公司 | Speech recognition method, device, equipment and computer-readable storage medium |
CN114529191B (en) * | 2022-02-16 | 2024-10-22 | 支付宝(杭州)信息技术有限公司 | Method and device for risk identification |
CN114566170B (en) * | 2022-03-01 | 2024-12-10 | 北京邮电大学 | A lightweight voice spoofing detection algorithm based on one-class classification |
CN114639372B (en) * | 2022-03-07 | 2024-10-25 | 哈尔滨理工大学 | Language Identification Method Based on Adjusted Cosine Mutual Information Estimation |
CN114694658B (en) * | 2022-03-15 | 2025-05-02 | 青岛海尔科技有限公司 | Speaker recognition model training, speaker recognition method and device |
US12277939B2 (en) | 2022-05-02 | 2025-04-15 | Tencent America LLC | Progressive contrastive learning framework for self-supervised speaker verification |
US20230386479A1 (en) * | 2022-05-27 | 2023-11-30 | Tencent America LLC | Techniques for improved zero-shot voice conversion with a conditional disentangled sequential variational auto-encoder |
US20230402041A1 (en) * | 2022-06-10 | 2023-12-14 | International Business Machines Corporation | Individual recognition using voice detection |
CN115035890B (en) * | 2022-06-23 | 2023-12-05 | 北京百度网讯科技有限公司 | Training method and device of voice recognition model, electronic equipment and storage medium |
TWI878975B (en) * | 2023-07-13 | 2025-04-01 | 新唐科技股份有限公司 | Speech recognition device and method |
CN116863961A (en) * | 2023-08-07 | 2023-10-10 | 中国信息通信研究院 | A method of forgery detection based on language processing |
CN117495571B (en) * | 2023-12-28 | 2024-04-05 | 北京芯盾时代科技有限公司 | Data processing method and device, electronic equipment and storage medium |
CN118053432B (en) * | 2024-03-21 | 2024-10-11 | 中科南京智能技术研究院 | Method for obtaining long and short speech universal speaker embedded layer model and speaker identification method |
Family Cites Families (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69322894T2 (en) * | 1992-03-02 | 1999-07-29 | At & T Corp., New York, N.Y. | Learning method and device for speech recognition |
US5640429A (en) * | 1995-01-20 | 1997-06-17 | The United States Of America As Represented By The Secretary Of The Air Force | Multichannel non-gaussian receiver and method |
CN1302427A (en) | 1997-11-03 | 2001-07-04 | T-内提克斯公司 | Model adaptation system and method for speaker verification |
US6609093B1 (en) * | 2000-06-01 | 2003-08-19 | International Business Machines Corporation | Methods and apparatus for performing heteroscedastic discriminant analysis in pattern recognition systems |
US20030225719A1 (en) * | 2002-05-31 | 2003-12-04 | Lucent Technologies, Inc. | Methods and apparatus for fast and robust model training for object classification |
US9113001B2 (en) * | 2005-04-21 | 2015-08-18 | Verint Americas Inc. | Systems, methods, and media for disambiguating call data to determine fraud |
TWI297487B (en) * | 2005-11-18 | 2008-06-01 | Tze Fen Li | A method for speech recognition |
US9247056B2 (en) * | 2007-02-28 | 2016-01-26 | International Business Machines Corporation | Identifying contact center agents based upon biometric characteristics of an agent's speech |
US7958068B2 (en) * | 2007-12-12 | 2011-06-07 | International Business Machines Corporation | Method and apparatus for model-shared subspace boosting for multi-label classification |
EP2189976B1 (en) * | 2008-11-21 | 2012-10-24 | Nuance Communications, Inc. | Method for adapting a codebook for speech recognition |
FR2940498B1 (en) * | 2008-12-23 | 2011-04-15 | Thales Sa | METHOD AND SYSTEM FOR AUTHENTICATING A USER AND / OR CRYPTOGRAPHIC DATA |
EP2574216B1 (en) * | 2010-09-28 | 2015-11-25 | Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. | Method and device for recovering a digital image from a sequence of observed digital images |
US8442823B2 (en) * | 2010-10-19 | 2013-05-14 | Motorola Solutions, Inc. | Methods for creating and searching a database of speakers |
US9679561B2 (en) * | 2011-03-28 | 2017-06-13 | Nuance Communications, Inc. | System and method for rapid customization of speech recognition models |
US9967218B2 (en) * | 2011-10-26 | 2018-05-08 | Oath Inc. | Online active learning in user-generated content streams |
US9042867B2 (en) * | 2012-02-24 | 2015-05-26 | Agnitio S.L. | System and method for speaker recognition on mobile devices |
US8527276B1 (en) * | 2012-10-25 | 2013-09-03 | Google Inc. | Speech synthesis using deep neural networks |
US9406298B2 (en) * | 2013-02-07 | 2016-08-02 | Nuance Communications, Inc. | Method and apparatus for efficient i-vector extraction |
US20140222423A1 (en) * | 2013-02-07 | 2014-08-07 | Nuance Communications, Inc. | Method and Apparatus for Efficient I-Vector Extraction |
CN103310788B (en) * | 2013-05-23 | 2016-03-16 | 北京云知声信息技术有限公司 | A kind of voice information identification method and system |
US9514753B2 (en) * | 2013-11-04 | 2016-12-06 | Google Inc. | Speaker identification using hash-based indexing |
US9311932B2 (en) * | 2014-01-23 | 2016-04-12 | International Business Machines Corporation | Adaptive pause detection in speech recognition |
US9542948B2 (en) * | 2014-04-09 | 2017-01-10 | Google Inc. | Text-dependent speaker identification |
US10073985B2 (en) * | 2015-02-27 | 2018-09-11 | Samsung Electronics Co., Ltd. | Apparatus and method for trusted execution environment file protection |
US9687208B2 (en) * | 2015-06-03 | 2017-06-27 | iMEDI PLUS Inc. | Method and system for recognizing physiological sound |
US9978374B2 (en) * | 2015-09-04 | 2018-05-22 | Google Llc | Neural networks for speaker verification |
US10262654B2 (en) * | 2015-09-24 | 2019-04-16 | Microsoft Technology Licensing, Llc | Detecting actionable items in a conversation among participants |
CN107274904A (en) * | 2016-04-07 | 2017-10-20 | 富士通株式会社 | Method for distinguishing speek person and Speaker Identification equipment |
CN105869630B (en) * | 2016-06-27 | 2019-08-02 | 上海交通大学 | Speaker voice spoofing attack detection method and system based on deep learning |
US10535000B2 (en) | 2016-08-08 | 2020-01-14 | Interactive Intelligence Group, Inc. | System and method for speaker change detection |
US9824692B1 (en) * | 2016-09-12 | 2017-11-21 | Pindrop Security, Inc. | End-to-end speaker recognition using deep neural network |
US10347256B2 (en) * | 2016-09-19 | 2019-07-09 | Pindrop Security, Inc. | Channel-compensated low-level features for speaker recognition |
US10553218B2 (en) | 2016-09-19 | 2020-02-04 | Pindrop Security, Inc. | Dimensionality reduction of baum-welch statistics for speaker recognition |
US10755718B2 (en) * | 2016-12-07 | 2020-08-25 | Interactive Intelligence Group, Inc. | System and method for neural network based speaker classification |
US10140980B2 (en) * | 2016-12-21 | 2018-11-27 | Google LCC | Complex linear projection for acoustic modeling |
CN108288470B (en) * | 2017-01-10 | 2021-12-21 | 富士通株式会社 | Voiceprint-based identity verification method and device |
CN106991312B (en) * | 2017-04-05 | 2020-01-10 | 百融云创科技股份有限公司 | Internet anti-fraud authentication method based on voiceprint recognition |
US11556794B2 (en) * | 2017-08-31 | 2023-01-17 | International Business Machines Corporation | Facilitating neural networks |
US10679129B2 (en) * | 2017-09-28 | 2020-06-09 | D5Ai Llc | Stochastic categorical autoencoder network |
JP6879433B2 (en) * | 2017-09-29 | 2021-06-02 | 日本電気株式会社 | Regression device, regression method, and program |
US20190213705A1 (en) * | 2017-12-08 | 2019-07-11 | Digimarc Corporation | Artwork generated to convey digital messages, and methods/apparatuses for generating such artwork |
CN108417217B (en) * | 2018-01-11 | 2021-07-13 | 思必驰科技股份有限公司 | Speaker recognition network model training method, speaker recognition method and system |
WO2019161011A1 (en) * | 2018-02-16 | 2019-08-22 | Dolby Laboratories Licensing Corporation | Speech style transfer |
US11468316B2 (en) * | 2018-03-13 | 2022-10-11 | Recogni Inc. | Cluster compression for compressing weights in neural networks |
US10347241B1 (en) * | 2018-03-23 | 2019-07-09 | Microsoft Technology Licensing, Llc | Speaker-invariant training via adversarial learning |
CN109065022B (en) * | 2018-06-06 | 2022-08-09 | 平安科技(深圳)有限公司 | Method for extracting i-vector, method, device, equipment and medium for speaker recognition |
CN109256139A (en) * | 2018-07-26 | 2019-01-22 | 广东工业大学 | A kind of method for distinguishing speek person based on Triplet-Loss |
CN110289003B (en) * | 2018-10-10 | 2021-10-29 | 腾讯科技(深圳)有限公司 | Voiceprint recognition method, model training method and server |
CN110288979B (en) * | 2018-10-25 | 2022-07-05 | 腾讯科技(深圳)有限公司 | Voice recognition method and device |
US10510002B1 (en) * | 2019-02-14 | 2019-12-17 | Capital One Services, Llc | Stochastic gradient boosting for deep neural networks |
CN110136729B (en) * | 2019-03-27 | 2021-08-20 | 北京奇艺世纪科技有限公司 | Model generation method, audio processing method, device and computer-readable storage medium |
CN109903774A (en) * | 2019-04-12 | 2019-06-18 | 南京大学 | A Voiceprint Recognition Method Based on Angular Separation Loss Function |
US10878575B2 (en) * | 2019-04-15 | 2020-12-29 | Adobe Inc. | Foreground-aware image inpainting |
CN110223699B (en) * | 2019-05-15 | 2021-04-13 | 桂林电子科技大学 | Speaker identity confirmation method, device and storage medium |
SG11202010803VA (en) * | 2019-10-31 | 2020-11-27 | Alipay Hangzhou Inf Tech Co Ltd | System and method for determining voice characteristics |
-
2019
- 2019-10-31 SG SG11202010803VA patent/SG11202010803VA/en unknown
- 2019-10-31 WO PCT/CN2019/114812 patent/WO2020035085A2/en active Application Filing
- 2019-10-31 CN CN201980011206.5A patent/CN111712874B/en active Active
-
2020
- 2020-01-09 SG SG11202013135XA patent/SG11202013135XA/en unknown
- 2020-01-09 WO PCT/CN2020/071194 patent/WO2020098828A2/en active Application Filing
- 2020-01-09 CN CN202080000759.3A patent/CN111418009B/en active Active
- 2020-08-25 TW TW109128922A patent/TWI737462B/en active
- 2020-10-27 US US17/081,956 patent/US10997980B2/en active Active
- 2020-12-22 US US17/131,182 patent/US11031018B2/en active Active
-
2021
- 2021-03-22 US US17/208,294 patent/US11244689B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20210110833A1 (en) | 2021-04-15 |
WO2020035085A2 (en) | 2020-02-20 |
CN111418009B (en) | 2023-09-05 |
CN111712874B (en) | 2023-07-14 |
WO2020098828A3 (en) | 2020-09-03 |
CN111418009A (en) | 2020-07-14 |
CN111712874A (en) | 2020-09-25 |
TW202119393A (en) | 2021-05-16 |
US10997980B2 (en) | 2021-05-04 |
US20210043216A1 (en) | 2021-02-11 |
US11244689B2 (en) | 2022-02-08 |
SG11202013135XA (en) | 2021-01-28 |
US20210210101A1 (en) | 2021-07-08 |
TWI737462B (en) | 2021-08-21 |
WO2020035085A3 (en) | 2020-08-20 |
WO2020098828A2 (en) | 2020-05-22 |
US11031018B2 (en) | 2021-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202010803VA (en) | System and method for determining voice characteristics | |
SG11202006772QA (en) | System and method for decentralized-identifier creation | |
EP3736684A4 (en) | Method and system for performing voice command | |
EP3917082A4 (en) | Method and apparatus for determining configuration resource | |
GB2596770B (en) | Carrier-resolved photo-hall system and method | |
EP3873157A4 (en) | Method and apparatus for determining uplink | |
EP3709249A4 (en) | System for providing user-customized last and method therefor | |
SG11202012810TA (en) | System and method for storage | |
EP3874580A4 (en) | System and method for determining q factor | |
GB2582952B (en) | Audio contribution identification system and method | |
EP3976825A4 (en) | Systems and methods for determining sequence | |
SG11202110703TA (en) | Method for determining reference value and terminal | |
EP3951311A4 (en) | Measurement system and measurement method | |
GB2585087B (en) | Positioning system and method | |
GB201901644D0 (en) | Testing system and method | |
EP3815028A4 (en) | Method and system for risk determination | |
SG10201805515QA (en) | Method and system for crediting account | |
GB2590126B (en) | Navigation system and method | |
GB202013413D0 (en) | Method and system for evaluating project based on blockchain | |
GB2600580B (en) | System and method for preparing MRNA | |
GB2588760B (en) | Interface system and corresponding method | |
GB2586451B (en) | Sound prioritisation system and method | |
SI3966585T1 (en) | Calibration system and method | |
EP4044476A4 (en) | Method and apparatus for determining resources | |
GB201817593D0 (en) | An ATM-requesting-and-accessing system and method |