[go: up one dir, main page]

SG11202010803VA - System and method for determining voice characteristics - Google Patents

System and method for determining voice characteristics

Info

Publication number
SG11202010803VA
SG11202010803VA SG11202010803VA SG11202010803VA SG11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA SG 11202010803V A SG11202010803V A SG 11202010803VA
Authority
SG
Singapore
Prior art keywords
voice characteristics
determining voice
determining
voice
Prior art date
Application number
SG11202010803VA
Inventor
Zhiming Wang
Kaisheng Yao
Xiaolong Li
Original Assignee
Alipay Hangzhou Inf Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alipay Hangzhou Inf Tech Co Ltd filed Critical Alipay Hangzhou Inf Tech Co Ltd
Publication of SG11202010803VA publication Critical patent/SG11202010803VA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/18Artificial neural networks; Connectionist approaches
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/20Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Image Analysis (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
SG11202010803VA 2019-10-31 2019-10-31 System and method for determining voice characteristics SG11202010803VA (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/114812 WO2020035085A2 (en) 2019-10-31 2019-10-31 System and method for determining voice characteristics

Publications (1)

Publication Number Publication Date
SG11202010803VA true SG11202010803VA (en) 2020-11-27

Family

ID=69525955

Family Applications (2)

Application Number Title Priority Date Filing Date
SG11202010803VA SG11202010803VA (en) 2019-10-31 2019-10-31 System and method for determining voice characteristics
SG11202013135XA SG11202013135XA (en) 2019-10-31 2020-01-09 System and method for personalized speaker verification

Family Applications After (1)

Application Number Title Priority Date Filing Date
SG11202013135XA SG11202013135XA (en) 2019-10-31 2020-01-09 System and method for personalized speaker verification

Country Status (5)

Country Link
US (3) US10997980B2 (en)
CN (2) CN111712874B (en)
SG (2) SG11202010803VA (en)
TW (1) TWI737462B (en)
WO (2) WO2020035085A2 (en)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108806696B (en) * 2018-05-08 2020-06-05 平安科技(深圳)有限公司 Method and device for establishing voiceprint model, computer equipment and storage medium
US11556848B2 (en) * 2019-10-21 2023-01-17 International Business Machines Corporation Resolving conflicts between experts' intuition and data-driven artificial intelligence models
SG11202010803VA (en) * 2019-10-31 2020-11-27 Alipay Hangzhou Inf Tech Co Ltd System and method for determining voice characteristics
US11651767B2 (en) 2020-03-03 2023-05-16 International Business Machines Corporation Metric learning of speaker diarization
US11443748B2 (en) * 2020-03-03 2022-09-13 International Business Machines Corporation Metric learning of speaker diarization
CN111833855B (en) * 2020-03-16 2024-02-23 南京邮电大学 Multi-to-multi speaker conversion method based on DenseNet STARGAN
CN111540367B (en) * 2020-04-17 2023-03-31 合肥讯飞数码科技有限公司 Voice feature extraction method and device, electronic equipment and storage medium
CN111524525B (en) * 2020-04-28 2023-06-16 平安科技(深圳)有限公司 Voiceprint recognition method, device, equipment and storage medium of original voice
US20220067279A1 (en) * 2020-08-31 2022-03-03 Recruit Co., Ltd., Systems and methods for multilingual sentence embeddings
US12165311B2 (en) * 2020-11-04 2024-12-10 Samsung Sds America, Inc. Unsupervised representation learning and active learning to improve data efficiency
CN112487384B (en) * 2020-11-25 2024-12-03 华为技术有限公司 Identity verification method and system
CN112418173A (en) * 2020-12-08 2021-02-26 北京声智科技有限公司 Abnormal sound recognition method, device and electronic device
CN113555032B (en) * 2020-12-22 2024-03-12 腾讯科技(深圳)有限公司 Multi-speaker scene recognition and network training method and device
US11605369B2 (en) * 2021-03-10 2023-03-14 Spotify Ab Audio translator
US11689868B2 (en) * 2021-04-26 2023-06-27 Mun Hoong Leong Machine learning based hearing assistance system
CN113345454B (en) * 2021-06-01 2024-02-09 平安科技(深圳)有限公司 Training and application methods, devices, equipment and storage medium of voice conversion model
CN114067803B (en) * 2021-10-21 2025-01-03 北京工业大学 A speaker verification method based on distance correlation metric learning
CN114023343B (en) * 2021-10-30 2024-04-30 西北工业大学 Voice conversion method based on semi-supervised feature learning
TWI795173B (en) * 2022-01-17 2023-03-01 中華電信股份有限公司 Multilingual speech recognition system, method and computer readable medium
CN114512128B (en) * 2022-02-08 2025-07-04 招商银行股份有限公司 Speech recognition method, device, equipment and computer-readable storage medium
CN114529191B (en) * 2022-02-16 2024-10-22 支付宝(杭州)信息技术有限公司 Method and device for risk identification
CN114566170B (en) * 2022-03-01 2024-12-10 北京邮电大学 A lightweight voice spoofing detection algorithm based on one-class classification
CN114639372B (en) * 2022-03-07 2024-10-25 哈尔滨理工大学 Language Identification Method Based on Adjusted Cosine Mutual Information Estimation
CN114694658B (en) * 2022-03-15 2025-05-02 青岛海尔科技有限公司 Speaker recognition model training, speaker recognition method and device
US12277939B2 (en) 2022-05-02 2025-04-15 Tencent America LLC Progressive contrastive learning framework for self-supervised speaker verification
US20230386479A1 (en) * 2022-05-27 2023-11-30 Tencent America LLC Techniques for improved zero-shot voice conversion with a conditional disentangled sequential variational auto-encoder
US20230402041A1 (en) * 2022-06-10 2023-12-14 International Business Machines Corporation Individual recognition using voice detection
CN115035890B (en) * 2022-06-23 2023-12-05 北京百度网讯科技有限公司 Training method and device of voice recognition model, electronic equipment and storage medium
TWI878975B (en) * 2023-07-13 2025-04-01 新唐科技股份有限公司 Speech recognition device and method
CN116863961A (en) * 2023-08-07 2023-10-10 中国信息通信研究院 A method of forgery detection based on language processing
CN117495571B (en) * 2023-12-28 2024-04-05 北京芯盾时代科技有限公司 Data processing method and device, electronic equipment and storage medium
CN118053432B (en) * 2024-03-21 2024-10-11 中科南京智能技术研究院 Method for obtaining long and short speech universal speaker embedded layer model and speaker identification method

Family Cites Families (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69322894T2 (en) * 1992-03-02 1999-07-29 At & T Corp., New York, N.Y. Learning method and device for speech recognition
US5640429A (en) * 1995-01-20 1997-06-17 The United States Of America As Represented By The Secretary Of The Air Force Multichannel non-gaussian receiver and method
CN1302427A (en) 1997-11-03 2001-07-04 T-内提克斯公司 Model adaptation system and method for speaker verification
US6609093B1 (en) * 2000-06-01 2003-08-19 International Business Machines Corporation Methods and apparatus for performing heteroscedastic discriminant analysis in pattern recognition systems
US20030225719A1 (en) * 2002-05-31 2003-12-04 Lucent Technologies, Inc. Methods and apparatus for fast and robust model training for object classification
US9113001B2 (en) * 2005-04-21 2015-08-18 Verint Americas Inc. Systems, methods, and media for disambiguating call data to determine fraud
TWI297487B (en) * 2005-11-18 2008-06-01 Tze Fen Li A method for speech recognition
US9247056B2 (en) * 2007-02-28 2016-01-26 International Business Machines Corporation Identifying contact center agents based upon biometric characteristics of an agent's speech
US7958068B2 (en) * 2007-12-12 2011-06-07 International Business Machines Corporation Method and apparatus for model-shared subspace boosting for multi-label classification
EP2189976B1 (en) * 2008-11-21 2012-10-24 Nuance Communications, Inc. Method for adapting a codebook for speech recognition
FR2940498B1 (en) * 2008-12-23 2011-04-15 Thales Sa METHOD AND SYSTEM FOR AUTHENTICATING A USER AND / OR CRYPTOGRAPHIC DATA
EP2574216B1 (en) * 2010-09-28 2015-11-25 Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. Method and device for recovering a digital image from a sequence of observed digital images
US8442823B2 (en) * 2010-10-19 2013-05-14 Motorola Solutions, Inc. Methods for creating and searching a database of speakers
US9679561B2 (en) * 2011-03-28 2017-06-13 Nuance Communications, Inc. System and method for rapid customization of speech recognition models
US9967218B2 (en) * 2011-10-26 2018-05-08 Oath Inc. Online active learning in user-generated content streams
US9042867B2 (en) * 2012-02-24 2015-05-26 Agnitio S.L. System and method for speaker recognition on mobile devices
US8527276B1 (en) * 2012-10-25 2013-09-03 Google Inc. Speech synthesis using deep neural networks
US9406298B2 (en) * 2013-02-07 2016-08-02 Nuance Communications, Inc. Method and apparatus for efficient i-vector extraction
US20140222423A1 (en) * 2013-02-07 2014-08-07 Nuance Communications, Inc. Method and Apparatus for Efficient I-Vector Extraction
CN103310788B (en) * 2013-05-23 2016-03-16 北京云知声信息技术有限公司 A kind of voice information identification method and system
US9514753B2 (en) * 2013-11-04 2016-12-06 Google Inc. Speaker identification using hash-based indexing
US9311932B2 (en) * 2014-01-23 2016-04-12 International Business Machines Corporation Adaptive pause detection in speech recognition
US9542948B2 (en) * 2014-04-09 2017-01-10 Google Inc. Text-dependent speaker identification
US10073985B2 (en) * 2015-02-27 2018-09-11 Samsung Electronics Co., Ltd. Apparatus and method for trusted execution environment file protection
US9687208B2 (en) * 2015-06-03 2017-06-27 iMEDI PLUS Inc. Method and system for recognizing physiological sound
US9978374B2 (en) * 2015-09-04 2018-05-22 Google Llc Neural networks for speaker verification
US10262654B2 (en) * 2015-09-24 2019-04-16 Microsoft Technology Licensing, Llc Detecting actionable items in a conversation among participants
CN107274904A (en) * 2016-04-07 2017-10-20 富士通株式会社 Method for distinguishing speek person and Speaker Identification equipment
CN105869630B (en) * 2016-06-27 2019-08-02 上海交通大学 Speaker voice spoofing attack detection method and system based on deep learning
US10535000B2 (en) 2016-08-08 2020-01-14 Interactive Intelligence Group, Inc. System and method for speaker change detection
US9824692B1 (en) * 2016-09-12 2017-11-21 Pindrop Security, Inc. End-to-end speaker recognition using deep neural network
US10347256B2 (en) * 2016-09-19 2019-07-09 Pindrop Security, Inc. Channel-compensated low-level features for speaker recognition
US10553218B2 (en) 2016-09-19 2020-02-04 Pindrop Security, Inc. Dimensionality reduction of baum-welch statistics for speaker recognition
US10755718B2 (en) * 2016-12-07 2020-08-25 Interactive Intelligence Group, Inc. System and method for neural network based speaker classification
US10140980B2 (en) * 2016-12-21 2018-11-27 Google LCC Complex linear projection for acoustic modeling
CN108288470B (en) * 2017-01-10 2021-12-21 富士通株式会社 Voiceprint-based identity verification method and device
CN106991312B (en) * 2017-04-05 2020-01-10 百融云创科技股份有限公司 Internet anti-fraud authentication method based on voiceprint recognition
US11556794B2 (en) * 2017-08-31 2023-01-17 International Business Machines Corporation Facilitating neural networks
US10679129B2 (en) * 2017-09-28 2020-06-09 D5Ai Llc Stochastic categorical autoencoder network
JP6879433B2 (en) * 2017-09-29 2021-06-02 日本電気株式会社 Regression device, regression method, and program
US20190213705A1 (en) * 2017-12-08 2019-07-11 Digimarc Corporation Artwork generated to convey digital messages, and methods/apparatuses for generating such artwork
CN108417217B (en) * 2018-01-11 2021-07-13 思必驰科技股份有限公司 Speaker recognition network model training method, speaker recognition method and system
WO2019161011A1 (en) * 2018-02-16 2019-08-22 Dolby Laboratories Licensing Corporation Speech style transfer
US11468316B2 (en) * 2018-03-13 2022-10-11 Recogni Inc. Cluster compression for compressing weights in neural networks
US10347241B1 (en) * 2018-03-23 2019-07-09 Microsoft Technology Licensing, Llc Speaker-invariant training via adversarial learning
CN109065022B (en) * 2018-06-06 2022-08-09 平安科技(深圳)有限公司 Method for extracting i-vector, method, device, equipment and medium for speaker recognition
CN109256139A (en) * 2018-07-26 2019-01-22 广东工业大学 A kind of method for distinguishing speek person based on Triplet-Loss
CN110289003B (en) * 2018-10-10 2021-10-29 腾讯科技(深圳)有限公司 Voiceprint recognition method, model training method and server
CN110288979B (en) * 2018-10-25 2022-07-05 腾讯科技(深圳)有限公司 Voice recognition method and device
US10510002B1 (en) * 2019-02-14 2019-12-17 Capital One Services, Llc Stochastic gradient boosting for deep neural networks
CN110136729B (en) * 2019-03-27 2021-08-20 北京奇艺世纪科技有限公司 Model generation method, audio processing method, device and computer-readable storage medium
CN109903774A (en) * 2019-04-12 2019-06-18 南京大学 A Voiceprint Recognition Method Based on Angular Separation Loss Function
US10878575B2 (en) * 2019-04-15 2020-12-29 Adobe Inc. Foreground-aware image inpainting
CN110223699B (en) * 2019-05-15 2021-04-13 桂林电子科技大学 Speaker identity confirmation method, device and storage medium
SG11202010803VA (en) * 2019-10-31 2020-11-27 Alipay Hangzhou Inf Tech Co Ltd System and method for determining voice characteristics

Also Published As

Publication number Publication date
US20210110833A1 (en) 2021-04-15
WO2020035085A2 (en) 2020-02-20
CN111418009B (en) 2023-09-05
CN111712874B (en) 2023-07-14
WO2020098828A3 (en) 2020-09-03
CN111418009A (en) 2020-07-14
CN111712874A (en) 2020-09-25
TW202119393A (en) 2021-05-16
US10997980B2 (en) 2021-05-04
US20210043216A1 (en) 2021-02-11
US11244689B2 (en) 2022-02-08
SG11202013135XA (en) 2021-01-28
US20210210101A1 (en) 2021-07-08
TWI737462B (en) 2021-08-21
WO2020035085A3 (en) 2020-08-20
WO2020098828A2 (en) 2020-05-22
US11031018B2 (en) 2021-06-08

Similar Documents

Publication Publication Date Title
SG11202010803VA (en) System and method for determining voice characteristics
SG11202006772QA (en) System and method for decentralized-identifier creation
EP3736684A4 (en) Method and system for performing voice command
EP3917082A4 (en) Method and apparatus for determining configuration resource
GB2596770B (en) Carrier-resolved photo-hall system and method
EP3873157A4 (en) Method and apparatus for determining uplink
EP3709249A4 (en) System for providing user-customized last and method therefor
SG11202012810TA (en) System and method for storage
EP3874580A4 (en) System and method for determining q factor
GB2582952B (en) Audio contribution identification system and method
EP3976825A4 (en) Systems and methods for determining sequence
SG11202110703TA (en) Method for determining reference value and terminal
EP3951311A4 (en) Measurement system and measurement method
GB2585087B (en) Positioning system and method
GB201901644D0 (en) Testing system and method
EP3815028A4 (en) Method and system for risk determination
SG10201805515QA (en) Method and system for crediting account
GB2590126B (en) Navigation system and method
GB202013413D0 (en) Method and system for evaluating project based on blockchain
GB2600580B (en) System and method for preparing MRNA
GB2588760B (en) Interface system and corresponding method
GB2586451B (en) Sound prioritisation system and method
SI3966585T1 (en) Calibration system and method
EP4044476A4 (en) Method and apparatus for determining resources
GB201817593D0 (en) An ATM-requesting-and-accessing system and method