default search action

combined dblp search
author search
venue search
publication search

ask others

Tomasz Korbak

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/ChenSCKCBCP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ChenSCKCBCP24
Angelica Chen, Jérémy Scheurer, Jon Ander Campos, Tomasz Korbak, Jun Shern Chan, Samuel R. Bowman, Kyunghyun Cho, Ethan Perez:
Learning from Natural Language Feedback. Trans. Mach. Learn. Res. 2024 (2024)
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BerglundTKBSKE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BerglundTKBSKE24
Lukas Berglund, Meg Tong, Maximilian Kaufmann, Mikita Balesni, Asa Cooper Stickland, Tomasz Korbak, Owain Evans:
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A". ICLR 2024
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/GoKKRD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GoKKRD24
Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Marc Dymetman:
Compositional Preference Models for Aligning LMs. ICLR 2024
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SharmaTKDABDHJK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SharmaTKDABDHJK24
Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, Ethan Perez:
Towards Understanding Sycophancy in Language Models. ICLR 2024
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01413
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01413
Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo:
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data. CoRR abs/2404.01413 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09932
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, José Hernández-Orallo, Lewis Hammond, Eric J. Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob N. Foerster, Florian Tramèr, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger:
Foundational Challenges in Assuring Alignment and Safety of Large Language Models. CoRR abs/2404.09932 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-12150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-12150
Tomasz Korbak:
Aligning language models with human preferences. CoRR abs/2404.12150 (2024)
2023
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/adb/Korbak23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/adb/Korbak23
Tomasz Korbak:
Self-organisation, (M, R)-systems and enactive cognitive science. Adapt. Behav. 31(1): 35-49 (2023)
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/CasperDSGSRFKLF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/CasperDSGSRFKLF23
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. Trans. Mach. Learn. Res. 2023 (2023)
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/McKenzieLPPMPMK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/McKenzieLPPMPMK23
Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez:
Inverse Scaling: When Bigger Isn't Better. Trans. Mach. Learn. Res. 2023 (2023)
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GoKKRRD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GoKKRRD23
Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Nahyeon Ryu, Marc Dymetman:
Aligning Language Models with Preferences through f-divergence Minimization. ICML 2023: 11546-11583
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KorbakSCBBPBP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KorbakSCBBPBP23
Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Vinayak Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez:
Pretraining Language Models with Human Preferences. ICML 2023: 17506-17533
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08215
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08215
Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Nahyeon Ryu, Marc Dymetman:
Aligning Language Models with Preferences through f-divergence Minimization. CoRR abs/2302.08215 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08582
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08582
Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez:
Pretraining Language Models with Human Preferences. CoRR abs/2302.08582 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04544
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04544
Julian Zubek, Tomasz Korbak, Joanna Raczaszek-Leonardi:
Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima. CoRR abs/2303.04544 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16749
Angelica Chen, Jérémy Scheurer, Tomasz Korbak, Jon Ander Campos, Jun Shern Chan, Samuel R. Bowman, Kyunghyun Cho, Ethan Perez:
Improving Code Generation by Training with Natural Language Feedback. CoRR abs/2303.16749 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16755
Jérémy Scheurer, Jon Ander Campos, Tomasz Korbak, Jun Shern Chan, Angelica Chen, Kyunghyun Cho, Ethan Perez:
Training Language Models with Language Feedback at Scale. CoRR abs/2303.16755 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09479
Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez:
Inverse Scaling: When Bigger Isn't Better. CoRR abs/2306.09479 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-15217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-15217
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. CoRR abs/2307.15217 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00667
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00667
Lukas Berglund, Asa Cooper Stickland, Mikita Balesni, Maximilian Kaufmann, Meg Tong, Tomasz Korbak, Daniel Kokotajlo, Owain Evans:
Taken out of context: On measuring situational awareness in LLMs. CoRR abs/2309.00667 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12288
Lukas Berglund, Meg Tong, Maximilian Kaufmann, Mikita Balesni, Asa Cooper Stickland, Tomasz Korbak, Owain Evans:
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A". CoRR abs/2309.12288 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13011
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13011
Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Marc Dymetman:
Compositional preference models for aligning LMs. CoRR abs/2310.13011 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13548
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13548
Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Newton Cheng, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, Ethan Perez:
Towards Understanding Sycophancy in Language Models. CoRR abs/2310.13548 (2023)
2022
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KorbakPB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KorbakPB22
Tomasz Korbak, Ethan Perez, Christopher L. Buckley:
RL with KL penalties is better viewed as Bayesian inference. EMNLP (Findings) 2022: 1083-1091
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KorbakEKD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KorbakEKD22
Tomasz Korbak, Hady Elsahar, Germán Kruszewski, Marc Dymetman:
Controlling Conditional Language Models without Catastrophic Forgetting. ICML 2022: 11499-11528
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KorbakEKD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KorbakEKD22
Tomasz Korbak, Hady Elsahar, Germán Kruszewski, Marc Dymetman:
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting. NeurIPS 2022
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11275
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11275
Tomasz Korbak, Ethan Perez, Christopher L. Buckley:
RL with KL penalties is better viewed as Bayesian inference. CoRR abs/2205.11275 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00761
Tomasz Korbak, Hady Elsahar, Germán Kruszewski, Marc Dymetman:
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting. CoRR abs/2206.00761 (2022)
2021
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KucinskiKKM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KucinskiKKM21
Lukasz Kucinski, Tomasz Korbak, Pawel Kolodziej, Piotr Milos:
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication. NeurIPS 2021: 23075-23088
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-04985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-04985
Tomasz Korbak, Hady Elsahar, Marc Dymetman, Germán Kruszewski:
Energy-Based Models for Code Generation under Compilability Constraints. CoRR abs/2106.04985 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-06464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-06464
Lukasz Kucinski, Tomasz Korbak, Pawel Kolodziej, Piotr Milos:
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication. CoRR abs/2111.06464 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-00791
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-00791
Tomasz Korbak, Hady Elsahar, Germán Kruszewski, Marc Dymetman:
Controlling Conditional Language Models with Distributional Policy Gradients. CoRR abs/2112.00791 (2021)
2020
[c2]
- view
  - electronic edition @ mindmodeling.org (archived)
  - no references & citations available
- export record
  dblp key:
  - conf/cogsci/GlowkaNWKRZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cogsci/GlowkaNWKRZ20
Krzysztof Glówka, Michal Niklewski, Joanna Wiszowata, Tomasz Korbak, Joanna Raczaszek-Leonardi, Julian Zubek:
The Emergence of Action-grounded Compositional Communication. CogSci 2020
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15058
Tomasz Korbak, Julian Zubek, Joanna Raczaszek-Leonardi:
Measuring non-trivial compositionality in emergent communication. CoRR abs/2010.15058 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-09325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-09325
Renard Korzeniowski, Rafal Rolczynski, Przemyslaw Sadownik, Tomasz Korbak, Marcin Mozejko:
Exploiting Unsupervised Pre-training and Automated Feature Engineering for Low-resource Hate Speech Detection in Polish. CoRR abs/1906.09325 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-06079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-06079
Tomasz Korbak, Julian Zubek, Lukasz Kucinski, Piotr Milos, Joanna Raczaszek-Leonardi:
Developmentally motivated emergence of compositional communication via template transfer. CoRR abs/1910.06079 (2019)
2017
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ltconf/KorbakZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ltconf/KorbakZ17
Tomasz Korbak, Paulina Zak:
Fine-Tuning Tree-LSTM for Phrase-Level Sentiment Classification on a Polish Dependency Treebank. LCT 2017: 31-42
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01985
Tomasz Korbak, Paulina Zak:
Fine-tuning Tree-LSTM for phrase-level sentiment classification on a Polish dependency treebank. Submission to PolEval task 2. CoRR abs/1711.01985 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.