Müller et al., 2016 - Google Patents
Lecture translator-speech translation framework for simultaneous lecture translationMüller et al., 2016
View PDF- Document ID
- 17753603074673636302
- Author
- Müller M
- Nguyen T
- Niehues J
- Cho E
- Krüger B
- Ha T
- Kilgour K
- Sperber M
- Mediani M
- Stüker S
- Waibel A
- Publication year
- Publication venue
- Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations
External Links
Snippet
Foreign students at German universities often have difficulties following lectures as they are often held in German. Since human interpreters are too expensive for universities we are addressing this problem via speech translation technology deployed in KIT's lecture halls …
- 230000011218 segmentation 0 abstract description 6
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2845—Using very large corpora, e.g. the world wide web [WWW]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2863—Processing of non-latin text
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI684881B (en) | Method, system and non-transitory machine-readable medium for generating a conversational agentby automatic paraphrase generation based on machine translation | |
US9646001B2 (en) | Machine translation (MT) based spoken dialog systems customer/machine dialog | |
AU2018260853B2 (en) | Customized virtual reality learning environment | |
US9864745B2 (en) | Universal language translator | |
Müller et al. | Lecture translator-speech translation framework for simultaneous lecture translation | |
Nishimura et al. | Multi-source neural machine translation with missing data | |
Cho et al. | A real-world system for simultaneous translation of German lectures. | |
Jabaian et al. | Comparison and combination of lightly supervised approaches for language portability of a spoken language understanding system | |
KR20230088434A (en) | Improving cross-lingual speech synthesis using speech recognition | |
WO2022238881A1 (en) | Method and system for processing user inputs using natural language processing | |
Hu et al. | Data Augmentation for Code-Switch Language Modeling by Fusing Multiple Text Generation Methods. | |
Punjabi et al. | Language model bootstrapping using neural machine translation for conversational speech recognition | |
Prasad et al. | BBN TransTalk: Robust multilingual two-way speech-to-speech translation for mobile platforms | |
CN117094329B (en) | Voice translation method and device for solving voice ambiguity | |
Stepanov et al. | Language style and domain adaptation for cross-language SLU porting | |
Fügen et al. | Open domain speech translation: from seminars and speeches to lectures | |
US11984125B2 (en) | Speech recognition using on-the-fly-constrained language model per utterance | |
KR20190052924A (en) | Apparatus and method of an automatic simultaneous interpretation using presentation scripts analysis | |
US11238844B1 (en) | Automatic turn-level language identification for code-switched dialog | |
Bach et al. | Incremental adaptation of speech-to-speech translation | |
Thakkar et al. | Deep Learning Based Sign Language Gesture Recognition with Multilingual Translation | |
Tsourakis et al. | Building Mobile Spoken Dialogue Applications Using Regulus. | |
Chettiar et al. | Multilingual Video Translation and Speech Synthesis: A Deep Learning Approach for Seamless Language Adaptation | |
Hanzlíček et al. | Speakers Talking Foreign Languages in a Multi-lingual TTS System | |
Cámara et al. | Open-Source System for Multilingual Translation and Cloned Speech Synthesis |