Computer Science > Computation and Language

arXiv:2403.17760 (cs)

[Submitted on 26 Mar 2024 (v1), last revised 29 May 2024 (this version, v2)]

Title:Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons

Authors:Shijia Zhou, Leonie Weissweiler, Taiqi He, Hinrich Schütze, David R. Mortensen, Lori Levin

Abstract:In this paper, we make a contribution that can be understood from two perspectives: from an NLP perspective, we introduce a small challenge dataset for NLI with large lexical overlap, which minimises the possibility of models discerning entailment solely based on token distinctions, and show that GPT-4 and Llama 2 fail it with strong bias. We then create further challenging sub-tasks in an effort to explain this failure. From a Computational Linguistics perspective, we identify a group of constructions with three classes of adjectives which cannot be distinguished by surface features. This enables us to probe for LLM's understanding of these constructions in various ways, and we find that they fail in a variety of ways to distinguish between them, suggesting that they don't adequately represent their meaning or capture the lexical properties of phrasal heads.

Comments:	LREC-COLING 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2403.17760 [cs.CL]
	(or arXiv:2403.17760v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.17760

Submission history

From: Shijia Zhou [view email]
[v1] Tue, 26 Mar 2024 14:51:12 UTC (9,009 KB)
[v2] Wed, 29 May 2024 23:41:37 UTC (9,009 KB)

Computer Science > Computation and Language

Title:Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators