Kadambi et al., 2023 - Google Patents
Wav2DDK: analytical and clinical validation of an automated diadochokinetic rate estimation algorithm on remotely collected speechKadambi et al., 2023
View PDF- Document ID
- 7278981733314488921
- Author
- Kadambi P
- Stegmann G
- Liss J
- Berisha V
- Hahn S
- Publication year
- Publication venue
- Journal of Speech, Language, and Hearing Research
External Links
Snippet
Purpose: Oral diadochokinesis is a useful task in assessment of speech motor function in the context of neurological disease. Remote collection of speech tasks provides a convenient alternative to in-clinic visits, but scoring these assessments can be a laborious process for …
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11545173B2 (en) | Automatic speech-based longitudinal emotion and mood recognition for mental health treatment | |
Jeancolas et al. | X-vectors: new quantitative biomarkers for early Parkinson's disease detection from speech | |
US10010288B2 (en) | Screening for neurological disease using speech articulation characteristics | |
Le et al. | Automatic quantitative analysis of spontaneous aphasic speech | |
Weiner et al. | Manual and Automatic Transcriptions in Dementia Detection from Speech. | |
Baghai-Ravary et al. | Automatic speech signal analysis for clinical diagnosis and assessment of speech disorders | |
Wang et al. | Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples | |
Orozco-Arroyave | Analysis of speech of people with Parkinson's disease | |
McKechnie et al. | Automated speech analysis tools for children’s speech production: A systematic literature review | |
JP2025506076A (en) | A multimodal system for voice-based mental health assessment with emotional stimuli and its uses | |
Matton et al. | Into the wild: Transitioning from recognizing mood in clinical interactions to personal conversations for individuals with bipolar disorder | |
Gosztolya et al. | Cross-lingual detection of mild cognitive impairment based on temporal parameters of spontaneous speech | |
Romana et al. | Automatically detecting errors and disfluencies in read speech to predict cognitive impairment in people with parkinson’s disease | |
Kadambi et al. | Wav2DDK: analytical and clinical validation of an automated diadochokinetic rate estimation algorithm on remotely collected speech | |
Tanchip et al. | Validating automatic diadochokinesis analysis methods across dysarthria severity and syllable task in amyotrophic lateral sclerosis | |
Wang et al. | Automatic detection of putative mild cognitive impairment from speech acoustic features in Mandarin-speaking elders | |
Lv et al. | Leveraging multimodal deep learning framework and a comprehensive audio-visual dataset to advance Parkinson’s detection | |
Liss et al. | Operationalizing clinical speech analytics: Moving from features to measures for real-world clinical impact | |
Vojtech et al. | Acoustic identification of the voicing boundary during intervocalic offsets and onsets based on vocal fold vibratory measures | |
Lustyk et al. | Evaluation of disfluent speech by means of automatic acoustic measurements | |
Cummins et al. | Quantitative assessment of interutterance stability: Application to dysarthria | |
Al-Hammadi et al. | Machine Learning Approaches for Dementia Detection Through Speech and Gait Analysis: A Systematic Literature Review | |
Hitczenko et al. | Speech characteristics yield important clues about motor function: Speech variability in individuals at clinical high-risk for psychosis | |
Koniaris et al. | On mispronunciation analysis of individual foreign speakers using auditory periphery models | |
Santos et al. | A domain-agnostic approach for opinion prediction on speech |