[go: up one dir, main page]

Volk, 1999 - Google Patents

Choosing the right lemma when analysing German nouns

Volk, 1999

View PDF
Document ID
12812578208147423444
Author
Volk M
Publication year

External Links

Snippet

1. Introduction When processing large corpora, it is often necessary to lemmatise the wordforms. This is usually done by a morphological analyser which can, in any case, undo inflection but sometimes even derivation and compounding. The latter is especially useful for …
Continue reading at www.zora.uzh.ch (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/277Lexical analysis, e.g. tokenisation, collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/2775Phrasal analysis, e.g. finite state techniques, chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/274Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2795Thesaurus; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • G06F17/2715Statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2863Processing of non-latin text
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/273Orthographic correction, e.g. spelling checkers, vowelisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters

Similar Documents

Publication Publication Date Title
Wacholder et al. Disambiguation of proper names in text
Wu et al. Large-scale automatic extraction of an English-Chinese translation lexicon
KR940022316A (en) Keyword Extractor for Japanese Documents
US8170867B2 (en) System for extracting information from a natural language text
Dasgupta et al. Unsupervised morphological parsing of Bengali
Wermter et al. Collocation extraction based on modifiability statistics
Chanod et al. Creating a tagset, lexicon and guesser for a French tagger
Mima et al. An application and e aluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese
Uchimoto et al. The unknown word problem: a morphological analysis of Japanese using maximum entropy aided by a dictionary
US7376551B2 (en) Definition extraction
Nagy et al. Detecting light verb constructions across languages
Volk Choosing the right lemma when analysing German nouns
Fung Extracting key terms from Chinese and Japanese texts
Wechsler et al. Multi-language text indexing for internet retrieval
Stolz et al. When some dots turn a different color…: Thoughts on how (not) to determine whether or not reduplication is universal
Evert et al. Identifying Morphosyntactic Preferences in Collocations.
Weller et al. Extraction of German Multiword Expressions from Parsed Corpora Using Context Features.
Laporte Reduction of lexical ambiguity
Martin Choosing the right lemma when analysing German nouns
Borst et al. Lexically-based distinction of readability levels of health documents
Dimitrov et al. A lightweight approach to coreference resolution for named entities in text
Golcher Statistical text segmentation with partial structure analysis
Nakov Building an inflectional stemmer for Bulgarian.
Litkowski Use of machine readable dictionaries for word-sense disambiguation in senseval-2
Tufiş et al. Lexical token alignment: Experiments, results and applications