Dharanipragada et al., 1998 - Google Patents
Audio-Indexing For Broadcast News.Dharanipragada et al., 1998
View PDF- Document ID
- 17206234363049587075
- Author
- Dharanipragada S
- Franz M
- Roukos S
- Publication year
- Publication venue
- TREC
External Links
Snippet
In this paper we describe the IBM Audio-Indexing System which is a combination of a large vocabulary speech recognizer and a text-based information retrieval system. Our speech recognizer was used to produce the baseline transcripts for the NIST SDR97 evaluation. We …
- 238000011156 evaluation 0 abstract description 10
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30796—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using original textual content or text extracted from visual content or transcript of audio data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Allan et al. | Inquery and TREC-7 | |
US6345253B1 (en) | Method and apparatus for retrieving audio information using primary and supplemental indexes | |
KR100388344B1 (en) | Method and apparatus for retrieving audio information using content and speaker information | |
Dowman et al. | Web-assisted annotation, semantic indexing and search of television and radio news | |
US7953751B2 (en) | System and method for audio hot spotting | |
US7761298B1 (en) | Document expansion in speech retrieval | |
Wechsler et al. | New techniques for open-vocabulary spoken document retrieval | |
Kubala et al. | Integrated technologies for indexing spoken language | |
James | A system for unrestricted topic retrieval from radio news broadcasts | |
Dharanipragada et al. | Audio-Indexing For Broadcast News. | |
Ng | Information fusion for spoken document retrieval | |
Chen et al. | Using information retrieval methods for language model adaptation. | |
Witbrock et al. | Speech recognition for a digital video library | |
Wechsler et al. | Speech retrieval based on automatic indexing | |
Audhkhasi et al. | Keyword search using modified minimum edit distance measure | |
Viswanathan et al. | Retrieval from spoken documents using content and speaker information | |
Martins et al. | Dynamic language modeling for a daily broadcast news transcription system | |
Dharanipragada et al. | Experimental results in audio indexing | |
Chen et al. | Improved spoken document retrieval by exploring extra acoustic and linguistic cues. | |
Suzuki et al. | Unsupervised language model adaptation based on automatic text collection from WWW. | |
Wechsler et al. | New approaches to spoken document retrieval | |
Wang | Mandarin spoken document retrieval based on syllable lattice matching | |
Illina et al. | Proper name retrieval from diachronic documents for automatic speech transcription using lexical and temporal context | |
Choi et al. | SCAN-speech content based audio navigator: a systems overview | |
Chen et al. | Language model adaptation for broadcast news transcription |