Freitag et al., 2007 - Google Patents

A sequence alignment model based on the averaged perceptron

Freitag et al., 2007

Document ID: 2676116871816395276
Author: Freitag D; Khadivi S
Publication year: 2007
Publication venue: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

External Links

Cited by

Snippet

We describe a discriminatively trained sequence alignment model based on the averaged perceptron. In common with other approaches to sequence modeling using perceptrons, and in contrast with comparable generative models, this model permits and transparently …

Continue reading at aclanthology.org (PDF) (other versions)

238000002864 sequence alignment 0 title abstract description 8

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2217—Character encodings
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
Abandah et al.	2015	Automatic diacritization of Arabic text using recurrent neural networks
Azmi et al.	2015	A survey of automatic Arabic diacritization techniques
Ghazvininejad et al.	2016	Generating topical poetry
Yadav et al.	2018	Deep affix features improve neural named entity recognizers
Alkanhal et al.	2012	Automatic stochastic arabic spelling correction with emphasis on space insertions and deletions
KR20230009564A (en)	2023-01-17	Learning data correction method and apparatus thereof using ensemble score
Khorsheed	2018	Diacritizing Arabic text using a single hidden Markov model
Freitag et al.	2007	A sequence alignment model based on the averaged perceptron
Nguyen et al.	2016	Text normalization for named entity recognition in Vietnamese tweets
JP5436307B2 (en)	2014-03-05	Similar document search device
Mabokela	2020	Phone clustering methods for multilingual language identification
Somsap et al.	2020	Isarn Dharma word segmentation using a statistical approach with named entity recognition
Göker et al.	2018	Neural text normalization for turkish social media
Elhadj	2009	Statistical part-of-speech tagger for traditional Arabic texts
Krantz et al.	2018	Syllabification by phone categorization
Banisakher et al.	2020	Improving the identification of the discourse function of news article paragraphs
Eger	2015	Multiple many-to-many sequence alignment for combining string-valued variables: A G2P experiment
Kang et al.	2000	Two approaches for the resolution of word mismatch problem caused by English words and foreign words in Korean information retrieval
Marchand et al.	2007	Evaluating automatic syllabification algorithms for English
Winata	2021	Multilingual transfer learning for code-switched language and speech neural modeling
Damper et al.	2002	A pronunciation-by-analogy module for the festival text-to-speech synthesiser
Eun et al.	2004	An information extraction approach for spoken language understanding.
Alfiansyah	2018	Partial greedy algorithm to extract a minimum phonetically-and-prosodically rich sentence set
Chubarian et al.	2021	Grouping Words with Semantic Diversity
Yamashita et al.	2018	A comparison of entity matching methods between English and Japanese katakana