Verbree et al., 2006 - Google Patents
Dialogue-act tagging using smart feature selection; results on multiple corporaVerbree et al., 2006
View PDF- Document ID
- 2215828925874010278
- Author
- Verbree D
- Rienks R
- Heylen D
- Publication year
- Publication venue
- 2006 IEEE Spoken Language Technology Workshop
External Links
Snippet
This paper presents an overview of our on-going work on dialogue-act classification. Results are presented on the ICSI, switchboard, and on a selection of the AMI corpus, setting a baseline for forthcoming research. For these corpora the best accuracy scores obtained are …
- 230000035897 transcription 0 abstract description 5
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G06K9/6807—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
- G06K9/6842—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries according to the linguistic properties, e.g. English, German
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Verbree et al. | Dialogue-act tagging using smart feature selection; results on multiple corpora | |
Makhoul et al. | Speech and language technologies for audio indexing and retrieval | |
Christensen et al. | Punctuation annotation using statistical prosody models. | |
Bulyko et al. | A bootstrapping approach to automating prosodic annotation for limited-domain synthesis | |
Shafran et al. | Voice signatures | |
Zue | Toward systems that understand spoken language | |
Byrne et al. | Automatic recognition of spontaneous speech for access to multilingual oral history archives | |
Devillers et al. | Emotion detection in task-oriented spoken dialogues | |
Murray et al. | Evaluating automatic summaries of meeting recordings | |
Howell et al. | Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: I. Psychometric procedures appropriate for selection of training material for lexical dysfluency classifiers | |
Levitan et al. | Combining Acoustic-Prosodic, Lexical, and Phonotactic Features for Automatic Deception Detection. | |
Vaudable et al. | Negative emotions detection as an indicator of dialogs quality in call centers | |
Koumpis et al. | Automatic summarization of voicemail messages using lexical and prosodic features | |
Parlikar et al. | Data-driven phrasing for speech synthesis in low-resource languages | |
Dalva et al. | Effective semi-supervised learning strategies for automatic sentence segmentation | |
CN110675292A (en) | Child language ability evaluation method based on artificial intelligence | |
Hazen et al. | Topic modeling for spoken documents using only phonetic information | |
Boakye et al. | Any questions? Automatic question detection in meetings | |
Faria | Accent classification for speech recognition | |
Zhang et al. | Automatic parliamentary meeting minute generation using rhetorical structure modeling | |
Lagus et al. | Topic identification in natural language dialogues using neural networks | |
Hillard et al. | Impact of automatic comma prediction on POS/name tagging of speech | |
Jonson | Dialogue context-based re-ranking of ASR hypotheses | |
Wu et al. | Using a knowledge base to automatically annotate speech corpora and to identify sociolinguistic variation | |
CN1198260C (en) | Method for speech recognition system recognizing multiple languages |