[go: up one dir, main page]

Joshi et al., 2003 - Google Patents

A phonemic code based scheme for effective processing of Indian Languages

Joshi et al., 2003

View PDF
Document ID
16927774813943940060
Author
Joshi R
Shoff K
Mudur S
Publication year
Publication venue
National Centre for Software Technology, Mumbai, 23rd Internationalization and Unicode Conference, Prague, Czech Republic

External Links

Snippet

The multitude of Indian languages and dialects are written using 9 scripts. While each of these scripts has been encoded separately in the Unicode scheme, applications supporting Indian languages are yet to be found on a number of standard platforms. One primary …
Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters
    • G06F17/2217Character encodings
    • G06F17/2223Handling non-latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters
    • G06F17/2264Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/211Formatting, i.e. changing of presentation of document
    • G06F17/214Font handling; Temporal and kinetic typography
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2872Rule based translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2863Processing of non-latin text
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • G10L13/10Prosody rules derived from text; Stress or intonation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules

Similar Documents

Publication Publication Date Title
Sproat A computational theory of writing systems
CN102902660B (en) Chinese phonetics codes spelling and Mixed Pinyin Chinese holographic information processing method
CN100568225C (en) Text symbolization processing method and system for numbers and special symbol strings in text
CA2650614A1 (en) System and method for generating a pronunciation dictionary
Alves What’s so Chinese about Vietnamese
CN1558341A (en) Chinese character / pin yin / english translator
Samudravijaya Indian language speech label (ILSL): A de facto national standard
CN102479078A (en) Chinese Phonetic Code Computer Chinese Programming Method
Lagally ArabTEX—Typesetting Arabic with vowels and ligatures
Joshi et al. A phonemic code based scheme for effective processing of Indian Languages
Cherifi et al. Arabic grapheme-to-phoneme conversion based on joint multi-gram model
CN103853705A (en) Real-time voice subtitle translation method of Chinese voice and foreign language voice of computer
Nair et al. English to Indian Language and Back Transliteration with Phonetic Transcription for Computational Linguistics Tools based on Conventional Transliteration Schemes
Ngugi et al. Swahili text-to-speech system
Ganjavi et al. ASCII based transcription systems for languages with the Arabic script: The case of Persian
Gutkin et al. Extensions to Brahmic script processing within the Nisaba library: new scripts, languages and utilities
CN101515207A (en) General voice input method for global languages on keyboard
CN1257444C (en) Complete pronunciation Chinese input method for computer
Soiffer A flexible design for accessible spoken math
Poupard Between the Oral and the Literary: The Case of the Naxi Dongba Texts
Bradley et al. Mansi et al. in Print before and under Unicode
Scharf Linguistic issues and intelligent technological solutions in encoding Sanskrit
Vikas Issues in Representation of Indic Scripts in Unicode
CN1388430A (en) Modern Chinese pronunciation input method
CN100517190C (en) Chinese character input method of specific Latin alphabet tone Chinese character pinyin