Arrieta et al. - Google Patents
Study of Speech Syllables using LORENZ Model for Nonlinear AnalysisArrieta et al.
View PDF- Document ID
- 7773798454362206383
- Author
- Arrieta V
- Licona F
- Licona A
External Links
Snippet
Nonlinear methods for signal analysis has been a research area where techniques have tried to overcome the limitations that linear techniques show. Speech signals have characteristics considered as nonlinear as for example high frequency and short duration …
- 238000004458 analytical method 0 title abstract description 8
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP4177882B1 (en) | Methods and systems for synthesising speech from text | |
Wang et al. | Playing technique recognition by joint time–frequency scattering | |
US20030130846A1 (en) | Speech processing with hmm trained on tespar parameters | |
Theodorou et al. | Automatic sound recognition of urban environment events | |
Arrieta et al. | Study of Speech Syllables using LORENZ Model for Nonlinear Analysis | |
Sundar et al. | A mixture model approach for formant tracking and the robustness of student's-t distribution | |
Lim et al. | Sound event detection in domestic environments using ensemble of convolutional recurrent neural networks | |
Ferroudj | Detection of rain in acoustic recordings of the environment using machine learning techniques | |
Kubin et al. | Identification of nonlinear oscillator models for speech analysis and synthesis | |
Staroletov | A hierarchical temporal memory model in the sense of Hawkins | |
KR102767013B1 (en) | Encoding method and decoding method for audio signal, and encoder and decoder | |
Niranjan et al. | Temporal decomposition: a framework for enhanced speech recognition | |
Rajesh et al. | Preventing Illegal Deforestation using Acoustic Surveillance | |
Prajith | Investigations on the applications of dynamical instabilities and deterministic chaos for speech signal processing | |
Vargas et al. | Cascade prediction filters with adaptive zeros to track the time-varying resonances of the vocal tract | |
Bollapragada | Towards Efficient Deep Learning Based Siren Detection | |
Gao et al. | Modeling of speech signals using an optimal neural network structure based on the PMDL principle | |
Heynderickx | Deep Learning for Security Applications: the Sound | |
Talukdar et al. | Recognition of Assamese SpokenWords using a Hybrid Neural Framework and Clustering Aided Apriori Knowledge | |
Досбаев et al. | AUDIOSIGNAL BASED EVENT DETECTION USING DEEP LEARNING TECHNIQUES | |
De Luca | Explaining black-box models in the context of audio classification | |
Love | A speech recognition system using a neural network model for vocal shaping | |
Carreira-Perpinán | One-to-many mappings, continuity constraints and latent variable models | |
Shehab et al. | Classifying Bird Songs Based on Chroma and Spectrogram Feature Extraction | |
Pompe | A tool to measure dependencies in data sequences |