[go: up one dir, main page]

Skip to content
View DRSY's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report DRSY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DRSY/README.md

Hi there 👋

😉 I am Siyu Ren.

🎓 I got my Ph.D degree at Shanghai Jiao Tong University.

🔎 Currently, my research interest includes Efficient Methods for NLP/Large Language Models and techniques around mechanistic understanding of LLMs pretraining, instrution-tuning, and alignment.

📚 For my academic publications, please refer to https://drsy.github.io/.

DRSY's github stats主要使用语言

profile

Pinned Loading

  1. MoTIS MoTIS Public

    [NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

    Swift 120 10

  2. EMO EMO Public

    [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

    Python 114 14

  3. EasyKV EasyKV Public

    Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

    Python 56 4

  4. DGen DGen Public

    [AAAI 2021]Knowledge-Driven Distractor Generation for Cloze-Style Multiple Choice Questions

    Python 21 2

  5. KV_Compression KV_Compression Public

    [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens

    Python 21

  6. LAMP LAMP Public

    [NAACL 2022 Findings]Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning

    Python 11