Sojka, 2000 - Google Patents
Competing patterns for language engineering: Methods to handle and store empirical dataSojka, 2000
View PDF- Document ID
- 875527862770974898
- Author
- Sojka P
- Publication year
- Publication venue
- International Workshop on Text, Speech and Dialogue
External Links
Snippet
In this paper we describe a method of effective handling of linguistic data by means of covering and inhibiting patterns-patterns that “compete” each other. A methodology of developing such patterns is outlined. Applications in the areas of morphology, hyphenation …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/211—Formatting, i.e. changing of presentation of document
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0583083B1 (en) | Finite-state transduction of related word forms for text indexing and retrieval | |
Ruiz-Casado et al. | Automatic extraction of semantic relationships for wordnet by means of pattern learning from wikipedia | |
Klein et al. | Accurate unlexicalized parsing | |
US6023760A (en) | Modifying an input string partitioned in accordance with directionality and length constraints | |
Bilisoly | Practical text mining with Perl | |
Sedláček et al. | A new Czech morphological analyser ajka | |
CN101261623A (en) | Word splitting method and device for word border-free mark language based on search | |
Neumann et al. | A shallow text processing core engine | |
Abate et al. | Development of Amharic morphological analyzer using memory-based learning | |
Abd et al. | Arabic light stemmer based on ISRI stemmer | |
Schaback et al. | Multi-level feature extraction for spelling correction | |
Elshafei | Machine generation of Arabic diacritical marks | |
HIRPSSA et al. | POS Tagging for Amharic Text: A Machine Learning Approach. | |
Koskenniemi | Finite state morphology and information retrieval | |
Sojka | Competing patterns for language engineering: Methods to handle and store empirical data | |
Sekine | Corpus-based parsing and sublanguage studies | |
Hirpassa | Information extraction system for Amharic text | |
Trippel | The Lexicon Graph Model: A generic model for multimodal lexicon development | |
Doumi et al. | A semi-automatic and low cost approach to build scalable lemma-based lexical resources for Arabic verbs | |
Asker et al. | Applying machine learning to Amharic text classification | |
Gebremeskel | Ge’ez POS Tagger Using Hybrid Approach | |
Gebremeskel et al. | Unlock Tigrigna NLP: Design and Development of Morphological Analyzer for Tigrigna Verbs Using Hybrid Approach | |
Delić et al. | Transformation-based part-of-speech tagging for Serbian language | |
Novák | A model of computational morphology and its application to Uralic languages | |
Berri et al. | Web-based Arabic morphological analyzer |