[go: up one dir, main page]

Sojka, 2000 - Google Patents

Competing patterns for language engineering: Methods to handle and store empirical data

Sojka, 2000

View PDF
Document ID
875527862770974898
Author
Sojka P
Publication year
Publication venue
International Workshop on Text, Speech and Dialogue

External Links

Snippet

In this paper we describe a method of effective handling of linguistic data by means of covering and inhibiting patterns-patterns that “compete” each other. A methodology of developing such patterns is outlined. Applications in the areas of morphology, hyphenation …
Continue reading at www.academia.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • G06F17/30675Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/277Lexical analysis, e.g. tokenisation, collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • G06F17/271Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30613Indexing
    • G06F17/30619Indexing indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/274Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2795Thesaurus; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/211Formatting, i.e. changing of presentation of document
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models

Similar Documents

Publication Publication Date Title
EP0583083B1 (en) Finite-state transduction of related word forms for text indexing and retrieval
Ruiz-Casado et al. Automatic extraction of semantic relationships for wordnet by means of pattern learning from wikipedia
Klein et al. Accurate unlexicalized parsing
US6023760A (en) Modifying an input string partitioned in accordance with directionality and length constraints
Bilisoly Practical text mining with Perl
Sedláček et al. A new Czech morphological analyser ajka
CN101261623A (en) Word splitting method and device for word border-free mark language based on search
Neumann et al. A shallow text processing core engine
Abate et al. Development of Amharic morphological analyzer using memory-based learning
Abd et al. Arabic light stemmer based on ISRI stemmer
Schaback et al. Multi-level feature extraction for spelling correction
Elshafei Machine generation of Arabic diacritical marks
HIRPSSA et al. POS Tagging for Amharic Text: A Machine Learning Approach.
Koskenniemi Finite state morphology and information retrieval
Sojka Competing patterns for language engineering: Methods to handle and store empirical data
Sekine Corpus-based parsing and sublanguage studies
Hirpassa Information extraction system for Amharic text
Trippel The Lexicon Graph Model: A generic model for multimodal lexicon development
Doumi et al. A semi-automatic and low cost approach to build scalable lemma-based lexical resources for Arabic verbs
Asker et al. Applying machine learning to Amharic text classification
Gebremeskel Ge’ez POS Tagger Using Hybrid Approach
Gebremeskel et al. Unlock Tigrigna NLP: Design and Development of Morphological Analyzer for Tigrigna Verbs Using Hybrid Approach
Delić et al. Transformation-based part-of-speech tagging for Serbian language
Novák A model of computational morphology and its application to Uralic languages
Berri et al. Web-based Arabic morphological analyzer