Joshi et al., 2003 - Google Patents
A phonemic code based scheme for effective processing of Indian LanguagesJoshi et al., 2003
View PDF- Document ID
- 16927774813943940060
- Author
- Joshi R
- Shoff K
- Mudur S
- Publication year
- Publication venue
- National Centre for Software Technology, Mumbai, 23rd Internationalization and Unicode Conference, Prague, Czech Republic
External Links
Snippet
The multitude of Indian languages and dialects are written using 9 scripts. While each of these scripts has been encoded separately in the Unicode scheme, applications supporting Indian languages are yet to be found on a number of standard platforms. One primary …
- 238000009877 rendering 0 abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2217—Character encodings
- G06F17/2223—Handling non-latin characters, e.g. kana-to-kanji conversion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2264—Transformation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/211—Formatting, i.e. changing of presentation of document
- G06F17/214—Font handling; Temporal and kinetic typography
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2863—Processing of non-latin text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sproat | A computational theory of writing systems | |
CN102902660B (en) | Chinese phonetics codes spelling and Mixed Pinyin Chinese holographic information processing method | |
CN100568225C (en) | Text symbolization processing method and system for numbers and special symbol strings in text | |
CA2650614A1 (en) | System and method for generating a pronunciation dictionary | |
Alves | What’s so Chinese about Vietnamese | |
CN1558341A (en) | Chinese character / pin yin / english translator | |
Samudravijaya | Indian language speech label (ILSL): A de facto national standard | |
CN102479078A (en) | Chinese Phonetic Code Computer Chinese Programming Method | |
Lagally | ArabTEX—Typesetting Arabic with vowels and ligatures | |
Joshi et al. | A phonemic code based scheme for effective processing of Indian Languages | |
Cherifi et al. | Arabic grapheme-to-phoneme conversion based on joint multi-gram model | |
CN103853705A (en) | Real-time voice subtitle translation method of Chinese voice and foreign language voice of computer | |
Nair et al. | English to Indian Language and Back Transliteration with Phonetic Transcription for Computational Linguistics Tools based on Conventional Transliteration Schemes | |
Ngugi et al. | Swahili text-to-speech system | |
Ganjavi et al. | ASCII based transcription systems for languages with the Arabic script: The case of Persian | |
Gutkin et al. | Extensions to Brahmic script processing within the Nisaba library: new scripts, languages and utilities | |
CN101515207A (en) | General voice input method for global languages on keyboard | |
CN1257444C (en) | Complete pronunciation Chinese input method for computer | |
Soiffer | A flexible design for accessible spoken math | |
Poupard | Between the Oral and the Literary: The Case of the Naxi Dongba Texts | |
Bradley et al. | Mansi et al. in Print before and under Unicode | |
Scharf | Linguistic issues and intelligent technological solutions in encoding Sanskrit | |
Vikas | Issues in Representation of Indic Scripts in Unicode | |
CN1388430A (en) | Modern Chinese pronunciation input method | |
CN100517190C (en) | Chinese character input method of specific Latin alphabet tone Chinese character pinyin |