Müller et al., 2016 - Google Patents

Lecture translator-speech translation framework for simultaneous lecture translation

Müller et al., 2016

Document ID: 17753603074673636302
Author: Müller M; Nguyen T; Niehues J; Cho E; Krüger B; Ha T; Kilgour K; Sperber M; Mediani M; Stüker S; Waibel A
Publication year: 2016
Publication venue: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations

External Links

Cited by

Snippet

Foreign students at German universities often have difficulties following lectures as they are often held in German. Since human interpreters are too expensive for universities we are addressing this problem via speech translation technology deployed in KIT's lecture halls …

Continue reading at aclanthology.org (PDF) (other versions)

230000011218 segmentation 0 abstract description 6

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2845—Using very large corpora, e.g. the world wide web [WWW]
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2863—Processing of non-latin text
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility

Similar Documents

Publication	Publication Date	Title
TWI684881B (en)	2020-02-11	Method, system and non-transitory machine-readable medium for generating a conversational agentby automatic paraphrase generation based on machine translation
US9646001B2 (en)	2017-05-09	Machine translation (MT) based spoken dialog systems customer/machine dialog
AU2018260853B2 (en)	2020-01-16	Customized virtual reality learning environment
US9864745B2 (en)	2018-01-09	Universal language translator
Müller et al.	2016	Lecture translator-speech translation framework for simultaneous lecture translation
Nishimura et al.	2019	Multi-source neural machine translation with missing data
Cho et al.	2013	A real-world system for simultaneous translation of German lectures.
Jabaian et al.	2012	Comparison and combination of lightly supervised approaches for language portability of a spoken language understanding system
KR20230088434A (en)	2023-06-19	Improving cross-lingual speech synthesis using speech recognition
WO2022238881A1 (en)	2022-11-17	Method and system for processing user inputs using natural language processing
Hu et al.	2020	Data Augmentation for Code-Switch Language Modeling by Fusing Multiple Text Generation Methods.
Punjabi et al.	2019	Language model bootstrapping using neural machine translation for conversational speech recognition
Prasad et al.	2013	BBN TransTalk: Robust multilingual two-way speech-to-speech translation for mobile platforms
CN117094329B (en)	2024-02-02	Voice translation method and device for solving voice ambiguity
Stepanov et al.	2013	Language style and domain adaptation for cross-language SLU porting
Fügen et al.	2006	Open domain speech translation: from seminars and speeches to lectures
US11984125B2 (en)	2024-05-14	Speech recognition using on-the-fly-constrained language model per utterance
KR20190052924A (en)	2019-05-17	Apparatus and method of an automatic simultaneous interpretation using presentation scripts analysis
US11238844B1 (en)	2022-02-01	Automatic turn-level language identification for code-switched dialog
Bach et al.	2009	Incremental adaptation of speech-to-speech translation
Thakkar et al.	2024	Deep Learning Based Sign Language Gesture Recognition with Multilingual Translation
Tsourakis et al.	2008	Building Mobile Spoken Dialogue Applications Using Regulus.
Chettiar et al.	2025	Multilingual Video Translation and Speech Synthesis: A Deep Learning Approach for Seamless Language Adaptation
Hanzlíček et al.	2021	Speakers Talking Foreign Languages in a Multi-lingual TTS System
Cámara et al.	2025	Open-Source System for Multilingual Translation and Cloned Speech Synthesis