Rishabh Srivastava

Papers

PurposeNet is an ontology based on the principle that all artifacts (man-made objects) exist for a purpose and all its features and relations with other entities are goverened by its purpose. We provide instances of ontology creation for... more

PurposeNet is an ontology based on the principle that all artifacts (man-made objects) exist for a purpose and all its features and relations with other entities are goverened by its purpose. We provide instances of ontology creation for two
varied domains from scratch in the PurposeNet architecture. These domains include MMTS domain and recipe domain. The methodology of creation was totally different for the two domains. MMTS domain was more computationally oriented
ontology while recipe domain required a post-processing after manually entering the data. The post-processing step uses hierarchical clustering to cluster very close actions. MMTS ontology is further used to create a simple template based QA system and the results are compared with a database system for the same domain.

Research Interests:
Ontology and Ontology (Computer Science)

Download (.pdf)

Recent studies in machine translation support the fact that multi-model systems perform better than the individual models. In this paper, we describe a Hindi to English statistical machine translation system and improve over the baseline... more

Recent studies in machine translation support the fact that multi-model systems perform better than the individual models. In this paper, we describe a Hindi to English statistical machine translation system and improve over the baseline using multiple translation models. We have considered phrase based as well as hierarchical models and enhanced over both these baselines using a regression model. The system is trained over textual as well as syntactic features extracted from source and target of the aforementioned translations. Our system shows significant improvement over the baseline systems for both automatic as well as human evaluations. The proposed methodology is quite generic and can easily be extended to other language pairs as well.

Location: Reykjavik, Iceland

Publication Date: May 26, 2014

Publication Name: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

Conference End Date: May 31, 2014

Conference Start Date: May 26, 2014

Research Interests:
Machine Learning, Machine Translation, Statistical Machine Translation, and Regression Models

Download (.pdf)

Hindi is the lingua-franca of India. Although all non-native speakers can communicate well in Hindi, there are only a few who can read and write in it. In this work, we aim to bridge this gap by building transliteration systems that could... more

Hindi is the lingua-franca of India. Although all non-native speakers can communicate well in Hindi, there are only a few who can read and write in it. In this work, we aim to bridge this gap by building transliteration systems that could transliterate Hindi into at-least 7 other Indian languages. The transliteration systems are developed as a reading aid for non-Hindi readers. The systems are trained on the transliteration pairs extracted automatically from a parallel corpora. All the transliteration systems perform satisfactorily for a non-Hindi reader to understand a Hindi text.

Location: Taipei, Taiwan

Organization: Department of English, National Chengchi

Publication Date: Nov 21, 2013

Publication Name: The 27th Pacific Asia Conference on Language, Information, and Computation

Conference End Date: Nov 24, 2013

Conference Start Date: Nov 22, 2013

Research Interests:
Machine Transliteration, Soundex Algorithm, and Text Extraction

Download (.pdf)

Artifacts are man-made objects taken as a whole. This paper presents various kinds of artifacts based on different division criteria and methods to create a list of artifacts. Different methods have been discussed and then we show how... more

Artifacts are man-made objects taken as a whole. This paper presents various kinds of artifacts based on different division criteria and methods to create a list of artifacts. Different methods have been discussed and then we show how some of them can be used on specific kinds of text to create an exhaustive list of artifacts.

Journal Name: International Journal of Computer Technology and Electronics Engineering

Publication Date: Dec 2011

Research Interests:
Computational Linguistics and Knowledge Representation

Download (.pdf)

Project Report

I propose to work on the Petfinder.my Adoption Prediction challenge on kaggle (https://www.kaggle.com/c/petfinder-adoption-prediction). PetFinder.my has been Malaysia’s leading animal welfare platform since 2008, with a database of... more

I propose to work on the Petfinder.my Adoption Prediction challenge on kaggle (https://www.kaggle.com/c/petfinder-adoption-prediction). PetFinder.my has been Malaysia’s leading animal welfare platform since 2008, with a database of more than 150,000 animals.

Publication Date: 2019

Publication Name: Udacity

Research Interests:
Machine Learning and Deep Learning

Download (.pdf)

PurposeNet is a semantic knowledgebase of artifacts, developed with purpose as the underlying principle of design. The principle is based on the observation that human beings tend to not only organize and categorize physical entities... more

PurposeNet is a semantic knowledgebase of artifacts, developed with purpose as the underlying principle of design. The principle is based on the observation that human beings tend to not only organize and categorize physical entities around them intuitively based on purpose, but, the morphology, anatomy and physiology of an artifact as well as its relations with the other artifacts around it are purpose-based. We aim at extracting different semantic relations (descriptive properties) from the Wikipedia for artifacts and then develop ontologies automatically using the web ontology language (OWL). Next we devised methods to populate the list of artifacts which are the basic unit of this system. We also improvised the PurposeNet architecture including various new concepts into it.

More Info: Report for Project-Based Internship

Publication Date: Dec 2011

Publication Name: IIIT DM, Jabalpur

Research Interests:
Computer Science, Information Extraction, and Knowledge Representation and Reasoning

Download (.pdf)

Posters

Publication Date: Mar 1, 2014

Research Interests:
Ontology and Question Answering System

Download (.pdf)

Publication Date: Nov 21, 2013

Research Interests:
Machine Transliteration

Download (.pdf)

Publication Date: Apr 16, 2013

Research Interests:
Information Retrieval and Computational Advertising

Download (.pdf)

Publication Date: Feb 7, 2013

Research Interests:
Natural Language Processing, Computational Linguistics, and Machine Transliteration

Download (.pdf)

Publication Date: Feb 7, 2013

Research Interests:
Ontology and Crowdsourcing

Download (.pdf)

Publication Date: Nov 2012

Research Interests:
Cloud Computing and Hadoop

Download (.pdf)

Previous Works

Hypergraphs are important data structures used to repre- sent and model the concepts in various areas of Computer Science and Discrete Mathematics. As of now an adjacency matrix representation and a bipartite incidence representation... more

Hypergraphs are important data structures used to repre-
sent and model the concepts in various areas of Computer Science and Discrete Mathematics. As of now an adjacency matrix representation and a bipartite incidence representation have been given for its implementation. The present paper proposes two novel methods for hypergraph representation using adjacency list. A comparison has been made with the existing representations to show that the proposed approach is better in terms of time complexity. Various graph algorithms such as Breadth- first search, Depth-first search, Strongly connected components, Dijkstras shortest path algorithm are implemented and studied in detail using the proposed representation for hypergraphs.

Publication Date: Feb 2013

Research Interests:
Hypergraphs, Graphs, Data Mining

Download (.pdf)

This paper presents the concept of using surface text patterns along with POS tagger for automatically extracting three properties, color, state and shape, of artifacts, any man-made entity, from the corpus. The approach has been... more

This paper presents the concept of using surface text patterns
along with POS tagger for automatically extracting three properties, color, state and shape, of artifacts, any man-made entity, from the corpus. The approach has been compared with the approach of using dependency parser for extraction of relation. The efficiency of both the approaches is also examined. The paper presents an insightful discussion on the issues that we have come across while using STPs for the purpose of information extraction.

Publication Date: Aug 2012

Research Interests:
Information Extraction, Artifacts, Surface Text Patterns, and Descriptive Properties

Download (.pdf)

Research Interests: Ontology and Ontology (Computer Science)<div>()</div>

Location: Reykjavik, Iceland

Publication Date: May 26, 2014

Publication Name: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

Conference End Date: May 31, 2014

Conference Start Date: May 26, 2014

Research Interests: Machine Learning, Machine Translation, Statistical Machine Translation, and Regression Models<div>()</div>

Location: Taipei, Taiwan

Organization: Department of English, National Chengchi

Publication Date: Nov 21, 2013

Publication Name: The 27th Pacific Asia Conference on Language, Information, and Computation

Conference End Date: Nov 24, 2013

Conference Start Date: Nov 22, 2013

Research Interests: Machine Transliteration, Soundex Algorithm, and Text Extraction<div>()</div>

Journal Name: International Journal of Computer Technology and Electronics Engineering

Publication Date: Dec 2011

Research Interests: Computational Linguistics and Knowledge Representation<div>()</div>

Publication Date: 2019

Publication Name: Udacity

Research Interests: Machine Learning and Deep Learning<div>()</div>

More Info: Report for Project-Based Internship

Publication Date: Dec 2011

Publication Name: IIIT DM, Jabalpur

Research Interests: Computer Science, Information Extraction, and Knowledge Representation and Reasoning<div>()</div>

Publication Date: Mar 1, 2014

Research Interests: Ontology and Question Answering System<div>()</div>

Publication Date: Nov 21, 2013

Research Interests: Machine Transliteration<div>()</div>

Publication Date: Apr 16, 2013

Research Interests: Information Retrieval and Computational Advertising<div>()</div>

Publication Date: Feb 7, 2013

Research Interests: Natural Language Processing, Computational Linguistics, and Machine Transliteration<div>()</div>

Publication Date: Feb 7, 2013

Research Interests: Ontology and Crowdsourcing<div>()</div>

Publication Date: Nov 2012

Research Interests: Cloud Computing and Hadoop<div>()</div>

Publication Date: Feb 2013

Research Interests: Hypergraphs, Graphs, Data Mining<div>()</div>

Publication Date: Aug 2012

Research Interests: Information Extraction, Artifacts, Surface Text Patterns, and Descriptive Properties<div>()</div>

Log In

Research Interests:
Ontology and Ontology (Computer Science)

Research Interests:
Machine Learning, Machine Translation, Statistical Machine Translation, and Regression Models

Research Interests:
Machine Transliteration, Soundex Algorithm, and Text Extraction

Research Interests:
Computational Linguistics and Knowledge Representation

Research Interests:
Machine Learning and Deep Learning

Research Interests:
Computer Science, Information Extraction, and Knowledge Representation and Reasoning

Research Interests:
Ontology and Question Answering System

Research Interests:
Machine Transliteration

Research Interests:
Information Retrieval and Computational Advertising

Research Interests:
Natural Language Processing, Computational Linguistics, and Machine Transliteration

Research Interests:
Ontology and Crowdsourcing

Research Interests:
Cloud Computing and Hadoop

Research Interests:
Hypergraphs, Graphs, Data Mining

Research Interests:
Information Extraction, Artifacts, Surface Text Patterns, and Descriptive Properties