The present work deals with the phonological rules in Urdu language. All these rules have been re... more The present work deals with the phonological rules in Urdu language. All these rules have been reported by considering the multiple pronunciations of a word, which has same spellings and parts of speech (POS). For the confirmation of multiple pronunciations, firstly a word list of 13717 words has been extracted from 10 hours speech corpus of a female native Urdu speaker. Secondly, in order to confirm whether these multiple pronunciations are speaker dependent or language dependent, data from 9 more native speakers have been collected for the confirmation of multiple pronunciations. In this paper, phonological rules related to the segment alternation, segment deletion and segment insertion have been investigated. Analysis reports that (i) segment alternation occurs due to stress, (ii) unstressed articulation causes segment deletion and (iii) segment insertion emerges to break consonant cluster at coda position.
2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
This paper describes the multi-level annotation process of Urdu speech corpus and its quality ass... more This paper describes the multi-level annotation process of Urdu speech corpus and its quality assessment using PRAAT. The annotation of speech corpus has been done at phoneme, word, syllable and break index levels. Phoneme, word and break index level annotation has been done manually by trained linguists whereas syllable-tier annotation has been done automatically using template matching algorithm. The mean accuracy achieved at phoneme and break index label and boundary identification is 79.07% and 89.67% respectively. The quality assessment of word and syllable tiers is still under investigation.
The present work deals with the phonological rules in Urdu language. All these rules have been re... more The present work deals with the phonological rules in Urdu language. All these rules have been reported by considering the multiple pronunciations of a word, which has same spellings and parts of speech (POS). For the confirmation of multiple pronunciations, firstly a word list of 13717 words has been extracted from 10 hours speech corpus of a female native Urdu speaker. Secondly, in order to confirm whether these multiple pronunciations are speaker dependent or language dependent, data from 9 more native speakers have been collected for the confirmation of multiple pronunciations. In this paper, phonological rules related to the segment alternation, segment deletion and segment insertion have been investigated. Analysis reports that (i) segment alternation occurs due to stress, (ii) unstressed articulation causes segment deletion and (iii) segment insertion emerges to break consonant cluster at coda position.
2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
This paper describes the multi-level annotation process of Urdu speech corpus and its quality ass... more This paper describes the multi-level annotation process of Urdu speech corpus and its quality assessment using PRAAT. The annotation of speech corpus has been done at phoneme, word, syllable and break index levels. Phoneme, word and break index level annotation has been done manually by trained linguists whereas syllable-tier annotation has been done automatically using template matching algorithm. The mean accuracy achieved at phoneme and break index label and boundary identification is 79.07% and 89.67% respectively. The quality assessment of word and syllable tiers is still under investigation.
Uploads
Papers