NICE Data Selection for Instruction Tuning in LLMs with Non-differentiable Evaluation Metric [ICML 2025]

This is the official implementation of the ICML 2025 paper "NICE Data Selection for Instruction Tuning in LLMs with Non-differentiable Evaluation Metric".

Our code are based on the code from LESS.

Install Requirements

To get started with this repository, you'll need to install environment in environment.yml

Data Preparation

In our project, for task-agnostic setting, we use four datasets: Flan v2, COT, Dolly, and Open Assistant.

For task-aware setting, we use two datasets: RLHF and Code-alpaca-20k.

For the purposes of evaluation, we evaluate on four datasets: AlpacaEval, TLDR, RLHF, HumanEval.

Dataset can be downloaded from link.

Data Selection Commands

The selection commands are in running_commands.sh. Follow the sequence to conduct data selection.

BibTeX

@inproceedings{
wang2025nice,
title={{NICE} Data Selection for Instruction Tuning in {LLM}s with Non-differentiable Evaluation Metric},
author={Jingtan Wang and Xiaoqiang Lin and Rui Qiao and Pang Wei Koh and Chuan-Sheng Foo and Bryan Kian Hsiang Low},
booktitle={Forty-second International Conference on Machine Learning},
year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
evaluation		evaluation
nice		nice
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
running_commands.sh		running_commands.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NICE Data Selection for Instruction Tuning in LLMs with Non-differentiable Evaluation Metric [ICML 2025]

Install Requirements

Data Preparation

Data Selection Commands

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NICE Data Selection for Instruction Tuning in LLMs with Non-differentiable Evaluation Metric [ICML 2025]

Install Requirements

Data Preparation

Data Selection Commands

BibTeX

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages