[go: up one dir, main page]

 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline

Search Results (1)

Search Parameters:
Keywords = maintain morpheme units

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
9 pages, 676 KiB  
Article
The Multi-Hot Representation-Based Language Model to Maintain Morpheme Units
by Ju-Sang Lee, Joon-Choul Shin and Choel-Young Ock
Appl. Sci. 2022, 12(20), 10612; https://doi.org/10.3390/app122010612 - 20 Oct 2022
Viewed by 1417
Abstract
Natural language models brought rapid developments to Natural Language Processing (NLP) performance following the emergence of large-scale deep learning models. Language models have previously used token units to represent natural language while reducing the proportion of unknown tokens. However, tokenization in language models [...] Read more.
Natural language models brought rapid developments to Natural Language Processing (NLP) performance following the emergence of large-scale deep learning models. Language models have previously used token units to represent natural language while reducing the proportion of unknown tokens. However, tokenization in language models raises language-specific issues. One of the key issues is that separating words by morphemes may cause distortion to the original meaning; also, it can prove challenging to apply the information surrounding a word, such as its semantic network. We propose a multi-hot representation language model to maintain Korean morpheme units. This method represents a single morpheme as a group of syllable-based tokens for cases where no matching tokens exist. This model has demonstrated similar performance to existing models in various natural language processing applications. The proposed model retains the minimum unit of meaning by maintaining the morpheme units and can easily accommodate the extension of semantic information. Full article
(This article belongs to the Special Issue Natural Language Processing (NLP) and Applications)
Show Figures

Figure 1

Figure 1
<p>The multi-hot representation language model acting on an input value (on the example: “Staycation is trending”).</p>
Full article ">Figure 2
<p>Experimental area (KorQuAD v1.0, NER, SRL, NSMC) usage model.</p>
Full article ">
Back to TopTop