Computer Science > Information Retrieval

arXiv:2402.09543 (cs)

[Submitted on 14 Feb 2024]

Title:Rethinking Large Language Model Architectures for Sequential Recommendations

Authors:Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu

View PDF HTML (experimental)

Abstract:Recently, sequential recommendation has been adapted to the LLM paradigm to enjoy the power of LLMs. LLM-based methods usually formulate recommendation information into natural language and the model is trained to predict the next item in an auto-regressive manner. Despite their notable success, the substantial computational overhead of inference poses a significant obstacle to their real-world applicability. In this work, we endeavor to streamline existing LLM-based recommendation models and propose a simple yet highly effective model Lite-LLM4Rec. The primary goal of Lite-LLM4Rec is to achieve efficient inference for the sequential recommendation task. Lite-LLM4Rec circumvents the beam search decoding by using a straight item projection head for ranking scores generation. This design stems from our empirical observation that beam search decoding is ultimately unnecessary for sequential recommendations. Additionally, Lite-LLM4Rec introduces a hierarchical LLM structure tailored to efficiently handle the extensive contextual information associated with items, thereby reducing computational overhead while enjoying the capabilities of LLMs. Experiments on three publicly available datasets corroborate the effectiveness of Lite-LLM4Rec in both performance and inference efficiency (notably 46.8% performance improvement and 97.28% efficiency improvement on ML-1m) over existing LLM-based methods. Our implementations will be open sourced.

Comments:	8 pages, 5 figures, conference
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2402.09543 [cs.IR]
	(or arXiv:2402.09543v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2402.09543

Submission history

From: Hanbing Wang [view email]
[v1] Wed, 14 Feb 2024 19:37:53 UTC (1,553 KB)

Computer Science > Information Retrieval

Title:Rethinking Large Language Model Architectures for Sequential Recommendations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Rethinking Large Language Model Architectures for Sequential Recommendations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators