Džeroski et al., 2006 - Google Patents
Towards a Slovene dependency treebankDžeroski et al., 2006
View PDF- Document ID
- 8783777716998192204
- Author
- Džeroski S
- Erjavec T
- Ledinek N
- Pajas P
- Žabokrtsky Z
- Žele A
- Publication year
- Publication venue
- Proc. of the Fifth Intern. Conf. on Language Resources and Evaluation (LREC)
External Links
Snippet
The paper presents the initial release of the Slovene Dependency Treebank, currently containing 2000 sentences or 30.000 words. Our approach to annotation is based on the Prague Dependency Treebank, which serves as an excellent model due to the similarity of …
- 238000011156 evaluation 0 abstract description 9
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2247—Tree structured documents; Markup, e.g. Standard Generalized Markup Language [SGML], Document Type Definition [DTD]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2264—Transformation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2217—Character encodings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/211—Formatting, i.e. changing of presentation of document
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/24—Editing, e.g. insert/delete
- G06F17/248—Templates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30908—Information retrieval; Database structures therefor; File system structures therefor of semistructured data, the undelying structure being taken into account, e.g. mark-up language structure data
- G06F17/30914—Mapping or conversion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/42—Syntactic analysis
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Džeroski et al. | Towards a Slovene dependency treebank | |
| CN101673260A (en) | System and method for training machine translator | |
| Ambati et al. | Hindi CCGbank: A CCG treebank from the Hindi dependency treebank | |
| Simov et al. | HPSG-based syntactic treebank of Bulgarian (BulTreeBank) | |
| Burns | Latincy: Synthetic trained pipelines for latin nlp | |
| Erjavec | The goo300k corpus of historical Slovene. | |
| Wax | Automated grammar engineering for verbal morphology | |
| Sawalha et al. | Morphologically-analyzed and syntactically-annotated Quran dataset | |
| Finlayson | Collecting Semantics in the Wild: The Story Workbench. | |
| Fashwan et al. | A morphologically annotated corpus and a morphological analyzer for Egyptian Arabic | |
| Ćavar et al. | Riznica: the Croatian language corpus | |
| Czaykowska-Higgins et al. | Using TEI for an endangered language lexical resource: The Nxaʔamxcín Database-Dictionary Project | |
| Gerstenberger et al. | Instant annotations–Applying NLP methods to the annotation of spoken language documentation corpora | |
| Khemakhem | Standard-based lexical models for automatically structured dictionnaries | |
| Butler et al. | Review of fieldworks language explorer (flex) | |
| Hellan et al. | Creating Norwegian Valence Resources from a Deep Grammar | |
| Moshagen et al. | The GiellaLT infrastructure: A multilingual infrastructure for rule-based NLP | |
| Moshagen et al. | The GiellaLT infrastructure: A multilingual infrastructure for rule-based NLP1 | |
| Schmirler | Syntactic features and text types in 20th century plains cree: A constraint grammar approach | |
| Ledinek | Towards a Slovene dependency treebank | |
| Aili et al. | Building Uyghur dependency treebank: Design principles, annotation schema and tools | |
| Butler et al. | Fieldworks language explorer (FLEx) | |
| Fransen | Past, present and future: Computational approaches to mapping historical Irish cognate verb forms | |
| Pozzo et al. | Aligning Immanuel Kant’s work and its translations | |
| Čmejrek | Using Dependency Tree Structure for Czech-English Machine Translation |