In this paper, we propose an approach based on HMM and linguistics for the Vietnamese recognition... more In this paper, we propose an approach based on HMM and linguistics for the Vietnamese recognition problem, including handwritten and speech recognition. The main contribution is that our method could be used to model all Vietnamese isolated words by a small number ...
In this paper we proposed and developed a system to integrate the bibliographical data of publica... more In this paper we proposed and developed a system to integrate the bibliographical data of publications in the computer science domain from various online sources into a unified database based on the focused crawling approach. In order to build this system, there are two phases to carry on. The first phase deals with importing bibliographic data from DBLP (Digital Bibliography and Library Project) into our database. The second phase the system will automatically crawl new publications from online digital libraries such as Microsoft Academic Search, ACM, IEEEXplore, CiteSeer and extract bibliographical information (one kind of publication metadata) to update, enrich the existing database, which have been built at the first phase. This system serves effectively in services relating to academic activities such as searching literatures, ranking publications, ranking experts, ranking conferences or journals, reviewing articles, identifying the research trends, mining the linking of articles, stating of the art for a specified research domain, and other related works base on these bibliographical data.
... 490 Nazim uddin Mohammed, Trong Hai Duong, and Geun Sik Jo Rough Sets Based Association Rules... more ... 490 Nazim uddin Mohammed, Trong Hai Duong, and Geun Sik Jo Rough Sets Based Association Rules Application for Knowledge-Based System Design ..... 501 Shu-Hsien Liao and Yin-Ju Chen Author Index ..... 511 Page 16. ...
In this paper, we present a system for recognizing Vietnamese document images and propose a metho... more In this paper, we present a system for recognizing Vietnamese document images and propose a method to increase the accuracy for this system. Based on features of Vietnamese language, we can minimize the number of characters and integrate spell-checking in the ...
2012 International Conference on Collaboration Technologies and Systems (CTS), 2012
ABSTRACT To learn about the state of the art for a research project, researchers must conduct a l... more ABSTRACT To learn about the state of the art for a research project, researchers must conduct a literature survey by searching for, collecting, and reading related scientific articles. Popular search systems, online digital libraries, and Web of Science (WoS) sources such as IEEE Explorer, ACM, SpringerLink, and Google Scholar typically return results or articles that are similar to keywords in the user's query. Some digital libraries also include content-based recommenders that suggest papers similar to one the user likes based on the contents of paper, i.e., the keywords it contains. In this work, we present a recommender module that suggests papers to users based on the seed paper's Citation Network. This work takes into account the combination of the co-citation and co-reference factors to improve algorithm's effectiveness. We applied and improved the the CCIDF (Common Citation Inverse Document Frequency) algorithm used by the CiteSeer digital library. This improved algorithm, called CCIDF+, was evaluated using data collected from Microsoft Academic Search (MAS). Experimental results show that CCIDF+ outperforms CCIDF.
2014 28th International Conference on Advanced Information Networking and Applications Workshops, 2014
ABSTRACT Successful research collaborations may facilitate major outcomes in science and their ap... more ABSTRACT Successful research collaborations may facilitate major outcomes in science and their applications. Thus, identifying effective collaborators may be a key factor that affects success. However, it is very difficult to identify potential collaborators and it is particularly difficult for young researchers who have less knowledge about other researchers and experts in their research domain. This study introduces and defines the problem of collaborator recommendation for 'isolated' researchers who have no links with others in co author networks. Existing approaches such as link-based and content-based methods may not be suitable for isolated researchers because of their lack of links and content information. Thus, we propose a new approach that uses additional information as new features to make recommendations, i.e., the strength of the relationship between organizations, the importance rating, and the activity scores of researchers. We also propose a new method for evaluating the quality of collaborator recommendations. We performed experiments by crawling publications from the Microsoft Academic Search Web site. The metadata were extracted from these publications, including the year, authors, organizational affiliations of authors, citations, and references. The metadata from publications between 2001 and 2005 were used as the training data while those from 2006 to 2011 were used for validation. The experimental results demonstrated the effectiveness and efficiency of our proposed approach.
International Journal of Software Innovation, 2015
In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is descri... more In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objective for the development is to achieve maximum naturalness in output speech through key points. Firstly, system uses a high quality recorded Vietnamese speech database appropriate for training, especially in statistical parametric model approach. Secondly, prosodic informations such as tone, POS (part of speech) and features based on characteristics of Vietnamese language are added to ensure the quality of synthetic speech. Third, system uses STRAIGHT which showed its ability to produce high-quality voice manipulation and was successfully incorporated into HMM-based speech synthesis. The results collected show that the speech produced by our system has the best result when being compared wi...
... Hiep Luong1, Tin Huynh2, Susan Gauch1, Phuc Do2, Kiem Hoang2 ... main, or senior researchers ... more ... Hiep Luong1, Tin Huynh2, Susan Gauch1, Phuc Do2, Kiem Hoang2 ... main, or senior researchers who have strong publication records, selecting a con-ference might be a trivial task since they know well which conferences, journals or scientific forums are the best places in ...
2010 International Conference on Education and Management Technology, 2010
In this paper we propose a method to extract automatically metadata (title, authors, affiliation,... more In this paper we propose a method to extract automatically metadata (title, authors, affiliation, email, references, etc) from science papers by combining the layout information of papers with rules which are defined by using JAPE Grammar rules of GATE. After metadata extracted automatically from digital documents, user can interact and correct them before they are exported to XML files. Developing
In this paper, we propose an approach based on HMM and linguistics for the Vietnamese recognition... more In this paper, we propose an approach based on HMM and linguistics for the Vietnamese recognition problem, including handwritten and speech recognition. The main contribution is that our method could be used to model all Vietnamese isolated words by a small number ...
In this paper we proposed and developed a system to integrate the bibliographical data of publica... more In this paper we proposed and developed a system to integrate the bibliographical data of publications in the computer science domain from various online sources into a unified database based on the focused crawling approach. In order to build this system, there are two phases to carry on. The first phase deals with importing bibliographic data from DBLP (Digital Bibliography and Library Project) into our database. The second phase the system will automatically crawl new publications from online digital libraries such as Microsoft Academic Search, ACM, IEEEXplore, CiteSeer and extract bibliographical information (one kind of publication metadata) to update, enrich the existing database, which have been built at the first phase. This system serves effectively in services relating to academic activities such as searching literatures, ranking publications, ranking experts, ranking conferences or journals, reviewing articles, identifying the research trends, mining the linking of articles, stating of the art for a specified research domain, and other related works base on these bibliographical data.
... 490 Nazim uddin Mohammed, Trong Hai Duong, and Geun Sik Jo Rough Sets Based Association Rules... more ... 490 Nazim uddin Mohammed, Trong Hai Duong, and Geun Sik Jo Rough Sets Based Association Rules Application for Knowledge-Based System Design ..... 501 Shu-Hsien Liao and Yin-Ju Chen Author Index ..... 511 Page 16. ...
In this paper, we present a system for recognizing Vietnamese document images and propose a metho... more In this paper, we present a system for recognizing Vietnamese document images and propose a method to increase the accuracy for this system. Based on features of Vietnamese language, we can minimize the number of characters and integrate spell-checking in the ...
2012 International Conference on Collaboration Technologies and Systems (CTS), 2012
ABSTRACT To learn about the state of the art for a research project, researchers must conduct a l... more ABSTRACT To learn about the state of the art for a research project, researchers must conduct a literature survey by searching for, collecting, and reading related scientific articles. Popular search systems, online digital libraries, and Web of Science (WoS) sources such as IEEE Explorer, ACM, SpringerLink, and Google Scholar typically return results or articles that are similar to keywords in the user's query. Some digital libraries also include content-based recommenders that suggest papers similar to one the user likes based on the contents of paper, i.e., the keywords it contains. In this work, we present a recommender module that suggests papers to users based on the seed paper's Citation Network. This work takes into account the combination of the co-citation and co-reference factors to improve algorithm's effectiveness. We applied and improved the the CCIDF (Common Citation Inverse Document Frequency) algorithm used by the CiteSeer digital library. This improved algorithm, called CCIDF+, was evaluated using data collected from Microsoft Academic Search (MAS). Experimental results show that CCIDF+ outperforms CCIDF.
2014 28th International Conference on Advanced Information Networking and Applications Workshops, 2014
ABSTRACT Successful research collaborations may facilitate major outcomes in science and their ap... more ABSTRACT Successful research collaborations may facilitate major outcomes in science and their applications. Thus, identifying effective collaborators may be a key factor that affects success. However, it is very difficult to identify potential collaborators and it is particularly difficult for young researchers who have less knowledge about other researchers and experts in their research domain. This study introduces and defines the problem of collaborator recommendation for 'isolated' researchers who have no links with others in co author networks. Existing approaches such as link-based and content-based methods may not be suitable for isolated researchers because of their lack of links and content information. Thus, we propose a new approach that uses additional information as new features to make recommendations, i.e., the strength of the relationship between organizations, the importance rating, and the activity scores of researchers. We also propose a new method for evaluating the quality of collaborator recommendations. We performed experiments by crawling publications from the Microsoft Academic Search Web site. The metadata were extracted from these publications, including the year, authors, organizational affiliations of authors, citations, and references. The metadata from publications between 2001 and 2005 were used as the training data while those from 2006 to 2011 were used for validation. The experimental results demonstrated the effectiveness and efficiency of our proposed approach.
International Journal of Software Innovation, 2015
In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is descri... more In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objective for the development is to achieve maximum naturalness in output speech through key points. Firstly, system uses a high quality recorded Vietnamese speech database appropriate for training, especially in statistical parametric model approach. Secondly, prosodic informations such as tone, POS (part of speech) and features based on characteristics of Vietnamese language are added to ensure the quality of synthetic speech. Third, system uses STRAIGHT which showed its ability to produce high-quality voice manipulation and was successfully incorporated into HMM-based speech synthesis. The results collected show that the speech produced by our system has the best result when being compared wi...
... Hiep Luong1, Tin Huynh2, Susan Gauch1, Phuc Do2, Kiem Hoang2 ... main, or senior researchers ... more ... Hiep Luong1, Tin Huynh2, Susan Gauch1, Phuc Do2, Kiem Hoang2 ... main, or senior researchers who have strong publication records, selecting a con-ference might be a trivial task since they know well which conferences, journals or scientific forums are the best places in ...
2010 International Conference on Education and Management Technology, 2010
In this paper we propose a method to extract automatically metadata (title, authors, affiliation,... more In this paper we propose a method to extract automatically metadata (title, authors, affiliation, email, references, etc) from science papers by combining the layout information of papers with rules which are defined by using JAPE Grammar rules of GATE. After metadata extracted automatically from digital documents, user can interact and correct them before they are exported to XML files. Developing
Uploads
Papers