Bhatnagar et al., 2022 - Google Patents
DeepCon: An end-to-end multilingual toolkit for automatic minuting of multi-party dialoguesBhatnagar et al., 2022
View PDF- Document ID
- 3034260904724223929
- Author
- Bhatnagar A
- Bhavsar N
- Singh M
- Publication year
- Publication venue
- Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue
External Links
Snippet
In this paper, we present our minuting tool DeepCon, an end-to-end toolkit for minuting the multiparty dialogues of meetings. It provides technological support for (multilingual) communication and collaboration, with a specific focus on Natural Language Processing …
- 238000003058 natural language processing 0 abstract description 3
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G06F17/2881—Natural language generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Barrault et al. | Seamless: Multilingual Expressive and Streaming Speech Translation | |
AU2021202694B2 (en) | Facilitating end-to-end communications with automated assistants in multiple languages | |
US11354521B2 (en) | Facilitating communications with automated assistants in multiple languages | |
EP3353776B1 (en) | Detecting actionable items in a conversation among participants | |
Tur et al. | The CALO meeting assistant system | |
US10394963B2 (en) | Natural language processor for providing natural language signals in a natural language output | |
Ghosal et al. | Overview of the first shared task on automatic minuting (automin) at interspeech 2021 | |
Roddy et al. | Neural generation of dialogue response timings | |
KR20240024960A (en) | Robust direct speech-to-speech translation | |
Schneider et al. | Team zoom@ automin 2023: Utilizing topic segmentation and llm data augmentation for long-form meeting summarization | |
Franceschini et al. | Removing European language barriers with innovative machine translation technology | |
Avila et al. | Towards cross-language prosody transfer for dialog | |
Pieraccini et al. | Spoken language communication with machines: The long and winding road from research to business | |
Bhatnagar et al. | DeepCon: An end-to-end multilingual toolkit for automatic minuting of multi-party dialogues | |
Yusufali et al. | Refining Text Input For Augmentative and Alternative Communication (AAC) Devices: Analysing Language Model Layers For Optimisation | |
Joseph et al. | Conversational agents and chatbots: current trends | |
Rousseau et al. | Darbarer@ AutoMin2023: Transcription simplification for concise minute generation from multi-party conversations | |
Balaskas et al. | An end-to-end system for transcription, translation, and summarization to support the co-creation process. A Health CASCADE Study. | |
Vasilateanu et al. | Call-center virtual assistant using natural language processing and speech recognition | |
Le-Duc et al. | Real-time Speech Summarization for Medical Conversations | |
KR20180121120A (en) | A machine learning based voice ordering system that can combine voice, text, visual interfaces to purchase products through mobile divices | |
Obaidah et al. | A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain | |
Sakti et al. | Distributed speech translation technologies for multiparty multilingual communication | |
Papi et al. | How" Real" is Your Real-Time Simultaneous Speech-to-Text Translation System? | |
Bhatnagar et al. | An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues |