Asemi et al., 2019 - Google Patents

Adaptive neuro-fuzzy inference system for evaluating dysarthric automatic speech recognition (ASR) systems: a case study on MVML-based ASR

Asemi et al., 2019

Document ID: 10107235515850670868
Author: Asemi A; Salim S; Shahamiri S; Asemi A; Houshangi N
Publication year: 2019
Publication venue: Soft Computing

External Links

Cited by

Snippet

Due to the improvements of dysarthric automatic speech recognition (ASR) during the last few decades, the demand for assessment and evaluation of such technologies increased significantly. Evaluation methods of ASRs are now required to consider multiple qualitative …

Continue reading at link.springer.com (other versions)

230000003044 adaptive 0 title abstract description 20

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition

Similar Documents

Publication	Publication Date	Title
Asemi et al.	2019	Adaptive neuro-fuzzy inference system for evaluating dysarthric automatic speech recognition (ASR) systems: a case study on MVML-based ASR
Huang et al.	2019	Feature fusion methods research based on deep belief networks for speech emotion recognition under noise condition
US11755912B2 (en)	2023-09-12	Controlling distribution of training data to members of an ensemble
Kakuba et al.	2022	Deep learning-based speech emotion recognition using multi-level fusion of concurrent features
Gharavian et al.	2017	Audio-visual emotion recognition using FCBF feature selection method and particle swarm optimization for fuzzy ARTMAP neural networks
Ganchev et al.	2007	Generalized locally recurrent probabilistic neural networks with application to text-independent speaker verification
KR100306848B1 (en)	2001-09-24	A selective attention method using neural networks
Lee et al.	2021	Deep representation learning for affective speech signal analysis and processing: Preventing unwanted signal disparities
CN116244474A (en)	2023-06-09	Learner learning state acquisition method based on multi-mode emotion feature fusion
CN118918883B (en)	2024-12-13	Scene-based voice recognition method and device
Sadeghi et al.	2017	Optimal MFCC features extraction by differential evolution algorithm for speaker recognition
Thakur et al.	2019	Speech emotion recognition: A review
Gupta et al.	2015	Speech emotion recognition using svm with thresholding fusion
Gudmalwar et al.	2019	Improving the performance of the speaker emotion recognition based on low dimension prosody features vector
Shah et al.	2019	Articulation constrained learning with application to speech emotion recognition
Banerjee et al.	2022	Intelligent stuttering speech recognition: A succinct review
Young et al.	2016	Evaluation of statistical pomdp-based dialogue systems in noisy environments
CN110363074A (en)	2019-10-22	A human-like recognition and interaction method for complex abstract things
Zoughi et al.	2019	A gender-aware deep neural network structure for speech recognition
Aly et al.	2015	An online fuzzy-based approach for human emotions detection: an overview on the human cognitive model of understanding and generating multimodal actions
Jolad et al.	2021	ANNs for automatic speech recognition—a survey
Singh et al.	2015	Human perception based criminal identification through human robot interaction
Patnaik et al.	2017	Recent Developments in Intelligent Computing, Communication and Devices: Proceedings of ICCD 2017
Wendemuth et al.	2017	Emotion recognition from speech
Du et al.	2019	Bag-of-acoustic-words for mental health assessment: A deep autoencoding approach