Computer Science > Computation and Language

arXiv:2310.12751 (cs)

[Submitted on 19 Oct 2023]

Title:Character-level Chinese Backpack Language Models

View PDF

Abstract:The Backpack is a Transformer alternative shown to improve interpretability in English language modeling by decomposing predictions into a weighted sum of token sense components. However, Backpacks' reliance on token-defined meaning raises questions as to their potential for languages other than English, a language for which subword tokenization provides a reasonable approximation for lexical items. In this work, we train, evaluate, interpret, and control Backpack language models in character-tokenized Chinese, in which words are often composed of many characters. We find that our (134M parameter) Chinese Backpack language model performs comparably to a (104M parameter) Transformer, and learns rich character-level meanings that log-additively compose to form word meanings. In SimLex-style lexical semantic evaluations, simple averages of Backpack character senses outperform input embeddings from a Transformer. We find that complex multi-character meanings are often formed by using the same per-character sense weights consistently across context. Exploring interpretability-through control, we show that we can localize a source of gender bias in our Backpacks to specific character senses and intervene to reduce the bias.

Comments:	BlackboxNLP 2023 Camera-Ready
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2310.12751 [cs.CL]
	(or arXiv:2310.12751v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.12751

Submission history

From: Hao Sun [view email]
[v1] Thu, 19 Oct 2023 13:54:57 UTC (3,945 KB)

Computer Science > Computation and Language

Title:Character-level Chinese Backpack Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Character-level Chinese Backpack Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators