Ponting et al., 2016 - Google Patents
At least” operator for combining audio search hitsPonting et al., 2016
- Document ID
- 2689927738268779021
- Author
- Ponting K
- Aurix Limited (Worcestershire, GB)
- BAKER M
- Publication year
External Links
Snippet
System and method to search audio data, including: receiving audio data representing speech; receiving a search query related to the audio data; compiling, by use of a processor, the search query into a hierarchy of scored speech recognition sub-searches; searching, by …
- 239000000203 mixture 0 abstract description 60
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30289—Database design, administration or maintenance
- G06F17/30303—Improving data quality; Data cleansing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022095368A1 (en) | Question-answer corpus generation method and device based on text generation model | |
CN109840287B (en) | Cross-modal information retrieval method and device based on neural network | |
US9535987B2 (en) | “At least” operator for combining audio search hits | |
US11520971B2 (en) | System and method for artificial intelligence story generation allowing content introduction | |
JP7162648B2 (en) | Systems and methods for intent discovery from multimedia conversations | |
US11604926B2 (en) | Method and system of creating and summarizing unstructured natural language sentence clusters for efficient tagging | |
US8793130B2 (en) | Confidence measure generation for speech related searching | |
US9373086B1 (en) | Crowdsource reasoning process to facilitate question answering | |
US10430405B2 (en) | Apply corrections to an ingested corpus | |
US12229108B2 (en) | Efficient embedding table storage and lookup | |
US9646260B1 (en) | Using existing relationships in a knowledge base to identify types of knowledge for addition to the knowledge base | |
Huang et al. | Adapting pretrained transformer to lattices for spoken language understanding | |
US10970488B2 (en) | Finding of asymmetric relation between words | |
US11553085B2 (en) | Method and apparatus for predicting customer satisfaction from a conversation | |
US7949651B2 (en) | Disambiguating residential listing search results | |
US20230297778A1 (en) | Identifying high effort statements for call center summaries | |
US20240062012A1 (en) | Automated system and method to prioritize language model and ontology expansion and pruning | |
Yang et al. | Extracting commonsense properties from embeddings with limited human guidance | |
JP2007219947A (en) | Causal relationship knowledge extraction apparatus and program | |
US11990131B2 (en) | Method for processing a video file comprising audio content and visual content comprising text content | |
US20220309413A1 (en) | Method and apparatus for automated workflow guidance to an agent in a call center environment | |
CN114091447A (en) | Text recognition method, device and equipment | |
Ponting et al. | At least” operator for combining audio search hits | |
Karakos et al. | Estimating document frequencies in a speech corpus | |
Im et al. | Multilayer CARU model for text summarization |