Huet, 2005 - Google Patents
A functional toolkit for morphological and phonological processing, application to a Sanskrit taggerHuet, 2005
View PDF- Document ID
- 14816375219030737427
- Author
- Huet G
- Publication year
- Publication venue
- Journal of Functional Programming
External Links
Snippet
We present the Zen toolkit for morphological and phonological processing of natural languages. This toolkit is presented in literate programming style, in the Pidgin ML subset of the Objective Caml functional programming language. This toolkit is based on a systematic …
- 230000000877 morphologic 0 title abstract description 23
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
- G06F17/279—Discourse representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2247—Tree structured documents; Markup, e.g. Standard Generalized Markup Language [SGML], Document Type Definition [DTD]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/42—Syntactic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Huet | A functional toolkit for morphological and phonological processing, application to a Sanskrit tagger | |
Eisner | Bilexical grammars and their cubic-time parsing algorithms | |
Klavans et al. | The balancing act: combining symbolic and statistical approaches to language | |
Oflazer | Dependency parsing with an extended finite-state approach | |
US20020042707A1 (en) | Grammar-packaged parsing | |
EP0583083A2 (en) | Finite-state transduction of related word forms for text indexing and retrieval | |
Corro | Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O (n^ 6) down to O (n^ 3) | |
de La Clergerie | DyALog: a tabular logic programming based environment for NLP | |
Márquez | Part-of-speech Tagging: A Machine Learning Approach based on Decision Trees | |
Gérard | Lexicon-directed segmentation and tagging of Sanskrit | |
Velldal | Empirical realization ranking | |
Stroppa et al. | Formal models of analogical proportions | |
Huet | Formal structure of Sanskrit text: Requirements analysis for a mechanical Sanskrit processor | |
Forsberg | Three tools for language processing: BNF converter, Functional Morphology, and Extract | |
Petrunina | Adjectivization in Russian: Analyzing participles by means of lexical frequency and constraint grammar | |
Wang | Statistical parsing and language modeling based on constraint dependency grammar | |
Schiehlen | Semantic Construction | |
Prins | Finite-state pre-processing for natural language analysis | |
Dahl | Natural language processing and logic programming | |
Janicki | Statistical and Computational Models for Whole Word Morphology | |
Dongilli | Natural language rendering of a conjunctive query | |
Jakubíček | Rule-based parsing of morphologically rich languages | |
Erbach | A flexible parser for a linguistic development environment | |
White | Rapid grammar development and parsing: Constraint dependency grammars with abstract role values | |
Suarez et al. | Automatic generation of semantic patterns using techniques of natural language processing |