default search action

combined dblp search
author search
venue search
publication search

ask others

Marc G. Bellemare

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j7]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/RowlandMATOHTBD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/RowlandMATOHTBD24
Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney:
An Analysis of Quantile Temporal-Difference Learning. J. Mach. Learn. Res. 25: 163:1-163:47 (2024)
[c56]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WiltzerFGT0DBR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WiltzerFGT0DBR24
Harley Wiltzer, Jesse Farebrother, Arthur Gretton, Yunhao Tang, André Barreto, Will Dabney, Marc G. Bellemare, Mark Rowland:
A Distributional Analogue to the Successor Representation. ICML 2024
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-08530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-08530
Harley Wiltzer, Jesse Farebrother, Arthur Gretton, Yunhao Tang, André Barreto, Will Dabney, Marc G. Bellemare, Mark Rowland:
A Distributional Analogue to the Successor Representation. CoRR abs/2402.08530 (2024)
[i56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00244
Nate Rahn, Pierluca D'Oro, Marc G. Bellemare:
Controlling Large Language Model Agents with Entropic Activation Steering. CoRR abs/2406.00244 (2024)
2023
[c55]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/LanGFRPAB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LanGFRPAB23
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare:
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces. AISTATS 2023: 1703-1718
[c54]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/DOroSNBBC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DOroSNBBC23
Pierluca D'Oro, Max Schwarzer, Evgenii Nikishin, Pierre-Luc Bacon, Marc G. Bellemare, Aaron C. Courville:
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier. ICLR 2023
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/FarebrotherGALG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FarebrotherGALG23
Jesse Farebrother, Joshua Greaves, Rishabh Agarwal, Charline Le Lan, Ross Goroshin, Pablo Samuel Castro, Marc G. Bellemare:
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks. ICLR 2023
[c52]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Obando-CeronBC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Obando-CeronBC23
Johan Samir Obando-Ceron, Marc G. Bellemare, Pablo Samuel Castro:
The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning. Tiny Papers @ ICLR 2023
[c51]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TaigaAFCB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TaigaAFCB23
Adrien Ali Taïga, Rishabh Agarwal, Jesse Farebrother, Aaron C. Courville, Marc G. Bellemare:
Investigating Multi-task Pretraining and Generalization in Reinforcement Learning. ICLR 2023
[c50]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LanTRHABD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LanTRHABD23
Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney:
Bootstrapped Representations in Reinforcement Learning. ICML 2023: 18686-18713
[c49]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/RowlandTLMBD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RowlandTLMBD23
Mark Rowland, Yunhao Tang, Clare Lyle, Rémi Munos, Marc G. Bellemare, Will Dabney:
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation. ICML 2023: 29210-29231
[c48]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/SchwarzerOCBAC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SchwarzerOCBAC23
Max Schwarzer, Johan Samir Obando-Ceron, Aaron C. Courville, Marc G. Bellemare, Rishabh Agarwal, Pablo Samuel Castro:
Bigger, Better, Faster: Human-level Atari with human-level efficiency. ICML 2023: 30365-30380
[c47]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Obando-CeronBC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Obando-CeronBC23
Johan S. Obando-Ceron, Marc G. Bellemare, Pablo Samuel Castro:
Small batch deep reinforcement learning. NeurIPS 2023
[c46]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/RahnDWBB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RahnDWBB23
Nate Rahn, Pierluca D'Oro, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare:
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control. NeurIPS 2023
[i55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-04462
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-04462
Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney:
An Analysis of Quantile Temporal-Difference Learning. CoRR abs/2301.04462 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-12567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-12567
Jesse Farebrother, Joshua Greaves, Rishabh Agarwal, Charline Le Lan, Ross Goroshin, Pablo Samuel Castro, Marc G. Bellemare:
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks. CoRR abs/2304.12567 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18388
Mark Rowland, Yunhao Tang, Clare Lyle, Rémi Munos, Marc G. Bellemare, Will Dabney:
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation. CoRR abs/2305.18388 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19452
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19452
Max Schwarzer, Johan S. Obando-Ceron, Aaron C. Courville, Marc G. Bellemare, Rishabh Agarwal, Pablo Samuel Castro:
Bigger, Better, Faster: Human-level Atari with human-level efficiency. CoRR abs/2305.19452 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10171
Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney:
Bootstrapped Representations in Reinforcement Learning. CoRR abs/2306.10171 (2023)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-14597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-14597
Nate Rahn, Pierluca D'Oro, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare:
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control. CoRR abs/2309.14597 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-03882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-03882
Johan S. Obando-Ceron, Marc G. Bellemare, Pablo Samuel Castro:
Small batch deep reinforcement learning. CoRR abs/2310.03882 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17894
Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron C. Courville, Marc G. Bellemare, Sergei V. Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore:
Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy. CoRR abs/2311.17894 (2023)
2022
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/LanTOAB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LanTOAB22
Charline Le Lan, Stephen Tu, Adam Oberman, Rishabh Agarwal, Marc G. Bellemare:
On the Generalization of Representations in Reinforcement Learning. AISTATS 2022: 4132-4157
[c44]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WiltzerMB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WiltzerMB22
Harley E. Wiltzer, David Meger, Marc G. Bellemare:
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning. ICML 2022: 23832-23856
[c43]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AgarwalSCCB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AgarwalSCCB22
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress. NeurIPS 2022
[c42]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TangMRPDB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TangMRPDB22
Yunhao Tang, Rémi Munos, Mark Rowland, Bernardo Ávila Pires, Will Dabney, Marc G. Bellemare:
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning. NeurIPS 2022
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00543
Charline Le Lan, Stephen Tu, Adam Oberman, Rishabh Agarwal, Marc G. Bellemare:
On the Generalization of Representations in Reinforcement Learning. CoRR abs/2203.00543 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-12184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-12184
Harley Wiltzer, David Meger, Marc G. Bellemare:
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning. CoRR abs/2205.12184 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01626
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Beyond Tabula Rasa: Reincarnating Reinforcement Learning. CoRR abs/2206.01626 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07570
Yunhao Tang, Mark Rowland, Rémi Munos, Bernardo Ávila Pires, Will Dabney, Marc G. Bellemare:
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning. CoRR abs/2207.07570 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-04025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-04025
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare:
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces. CoRR abs/2212.04025 (2022)
2021
[c41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DabneyBRDQBS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DabneyBRDQBS21
Will Dabney, André Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, David Silver:
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning. AAAI 2021: 7160-7168
[c40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LanBC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LanBC21
Charline Le Lan, Marc G. Bellemare, Pablo Samuel Castro:
Metrics and Continuity in Reinforcement Learning. AAAI 2021: 8261-8269
[c39]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/AgarwalMCB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AgarwalMCB21
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare:
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning. ICLR 2021
[c38]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BuckmanGB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BuckmanGB21
Jacob Buckman, Carles Gelada, Marc G. Bellemare:
The Importance of Pessimism in Fixed-Dataset Policy Optimization. ICLR 2021
[c37]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AgarwalSCCB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AgarwalSCCB21
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Deep Reinforcement Learning at the Edge of the Statistical Precipice. NeurIPS 2021: 29304-29320
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-05265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-05265
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare:
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning. CoRR abs/2101.05265 (2021)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01514
Charline Le Lan, Marc G. Bellemare, Pablo Samuel Castro:
Metrics and continuity in reinforcement learning. CoRR abs/2102.01514 (2021)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-13264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-13264
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Deep Reinforcement Learning at the Edge of the Statistical Precipice. CoRR abs/2108.13264 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11052
Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare:
On Bonus-Based Exploration Methods in the Arcade Learning Environment. CoRR abs/2109.11052 (2021)
2020
[j6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ai/BardFCBLSPDMHDM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/BardFCBLSPDMHDM20
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling:
The Hanabi challenge: A new frontier for AI research. Artif. Intell. 280: 103216 (2020)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/BellemareCCGMMP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/BellemareCCGMMP20
Marc G. Bellemare, Salvatore Candido, Pablo Samuel Castro, Jun Gong, Marlos C. Machado, Subhodeep Moitra, Sameera S. Ponda, Ziyu Wang:
Autonomous navigation of stratospheric balloons using reinforcement learning. Nat. 588(7836): 77-82 (2020)
[c36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JainFLPB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JainFLPB20
Vishal Jain, William Fedus, Hugo Larochelle, Doina Precup, Marc G. Bellemare:
Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction. AAAI 2020: 4328-4336
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MachadoBB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MachadoBB20
Marlos C. Machado, Marc G. Bellemare, Michael Bowling:
Count-Based Exploration with the Successor Representation. AAAI 2020: 5125-5133
[c34]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/AmortilaPPB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/AmortilaPPB20
Philip Amortila, Doina Precup, Prakash Panangaden, Marc G. Bellemare:
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms. AISTATS 2020: 4357-4366
[c33]
- view
  - electronic edition @ computationalcreativity.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icccrea/MathewsonCCFB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icccrea/MathewsonCCFB20
Kory Wallace Mathewson, Pablo Samuel Castro, Colin Cherry, George F. Foster, Marc G. Bellemare:
Shaping the Narrative Arc: Information-Theoretic Collaborative DialoguePaper type: Technical Paper. ICCC 2020: 9-16
[c32]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TaigaFMCB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TaigaFMCB20
Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare:
On Bonus Based Exploration Methods In The Arcade Learning Environment. ICLR 2020
[c31]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GhoshB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GhoshB20
Dibya Ghosh, Marc G. Bellemare:
Representations for Stable Off-Policy Reinforcement Learning. ICML 2020: 3556-3565
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-12499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-12499
William Fedus, Dibya Ghosh, John D. Martin, Marc G. Bellemare, Yoshua Bengio, Hugo Larochelle:
On Catastrophic Interference in Atari 2600 Games. CoRR abs/2002.12499 (2020)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-04069
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-04069
Ahmed Touati, Adrien Ali Taïga, Marc G. Bellemare:
Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces. CoRR abs/2003.04069 (2020)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12239
Philip Amortila, Doina Precup, Prakash Panangaden, Marc G. Bellemare:
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms. CoRR abs/2003.12239 (2020)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-02243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-02243
Will Dabney, André Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, David Silver:
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning. CoRR abs/2006.02243 (2020)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-05520
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-05520
Dibya Ghosh, Marc G. Bellemare:
Representations for Stable Off-Policy Reinforcement Learning. CoRR abs/2007.05520 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-06799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-06799
Jacob Buckman, Carles Gelada, Marc G. Bellemare:
The Importance of Pessimism in Fixed-Dataset Policy Optimization. CoRR abs/2009.06799 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c30]
- view
  - electronic edition @ ceur-ws.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aaai/AmortilaBPP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/AmortilaBPP19
Philip Amortila, Marc G. Bellemare, Prakash Panangaden, Doina Precup:
Temporally Extended Metrics for Markov Decision Processes. SafeAI@AAAI 2019
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GeladaB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GeladaB19
Carles Gelada, Marc G. Bellemare:
Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift. AAAI 2019: 3647-3655
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LyleBC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LyleBC19
Clare Lyle, Marc G. Bellemare, Pablo Samuel Castro:
A Comparative Analysis of Expected and Distributional Reinforcement Learning. AAAI 2019: 4504-4511
[c27]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/BellemareRCM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/BellemareRCM19
Marc G. Bellemare, Nicolas Le Roux, Pablo Samuel Castro, Subhodeep Moitra:
Distributional reinforcement learning with linear function approximation. AISTATS 2019: 2203-2211
[c26]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DadashiBTRS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DadashiBTRS19
Robert Dadashi, Marc G. Bellemare, Adrien Ali Taïga, Nicolas Le Roux, Dale Schuurmans:
The Value Function Polytope in Reinforcement Learning. ICML 2019: 1486-1495
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GeladaKBNB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GeladaKBNB19
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare:
DeepMDP: Learning Continuous Latent Space Models for Representation Learning. ICML 2019: 2170-2179
[c24]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/RowlandDKMBD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RowlandDKMBD19
Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney:
Statistics and Samples in Distributional Reinforcement Learning. ICML 2019: 5528-5536
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/SuchMLWCLZSBCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SuchMLWCLZSBCL19
Felipe Petroski Such, Vashisht Madhavan, Rosanne Liu, Rui Wang, Pablo Samuel Castro, Yulun Li, Jiale Zhi, Ludwig Schubert, Marc G. Bellemare, Jeff Clune, Joel Lehman:
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents. IJCAI 2019: 3260-3267
[c22]
- view
- export record
  dblp key:
  - conf/nips/BellemareDDTCRS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BellemareDDTCRS19
Marc G. Bellemare, Will Dabney, Robert Dadashi, Adrien Ali Taïga, Pablo Samuel Castro, Nicolas Le Roux, Dale Schuurmans, Tor Lattimore, Clare Lyle:
A Geometric Perspective on Optimal Representations for Reinforcement Learning. NeurIPS 2019: 4360-4371
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09455
Carles Gelada, Marc G. Bellemare:
Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift. CoRR abs/1901.09455 (2019)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-11084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-11084
Clare Lyle, Pablo Samuel Castro, Marc G. Bellemare:
A Comparative Analysis of Expected and Distributional Reinforcement Learning. CoRR abs/1901.11084 (2019)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-11524
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-11524
Robert Dadashi, Adrien Ali Taïga, Nicolas Le Roux, Dale Schuurmans, Marc G. Bellemare:
The Value Function Polytope in Reinforcement Learning. CoRR abs/1901.11524 (2019)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-11528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-11528
Kory W. Mathewson, Pablo Samuel Castro, Colin Cherry, George F. Foster, Marc G. Bellemare:
Shaping the Narrative Arc: An Information-Theoretic Approach to Collaborative Dialogue. CoRR abs/1901.11528 (2019)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-11530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-11530
Marc G. Bellemare, Will Dabney, Robert Dadashi, Adrien Ali Taïga, Pablo Samuel Castro, Nicolas Le Roux, Dale Schuurmans, Tor Lattimore, Clare Lyle:
A Geometric Perspective on Optimal Representations for Reinforcement Learning. CoRR abs/1901.11530 (2019)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-00506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-00506
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling:
The Hanabi Challenge: A New Frontier for AI Research. CoRR abs/1902.00506 (2019)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03149
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03149
Marc G. Bellemare, Nicolas Le Roux, Pablo Samuel Castro, Subhodeep Moitra:
Distributional reinforcement learning with linear function approximation. CoRR abs/1902.03149 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-06865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-06865
William Fedus, Carles Gelada, Yoshua Bengio, Marc G. Bellemare, Hugo Larochelle:
Hyperbolic Discounting and Learning over Multiple Horizons. CoRR abs/1902.06865 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08102
Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney:
Statistics and Samples in Distributional Reinforcement Learning. CoRR abs/1902.08102 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-02736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-02736
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare:
DeepMDP: Learning Continuous Latent Space Models for Representation Learning. CoRR abs/1906.02736 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-02388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-02388
Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare:
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment. CoRR abs/1908.02388 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-12511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-12511
Vishal Jain, William Fedus, Hugo Larochelle, Doina Precup, Marc G. Bellemare:
Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction. CoRR abs/1911.12511 (2019)
2018
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ftml/Francois-LavetH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ftml/Francois-LavetH18
Vincent François-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare, Joelle Pineau:
An Introduction to Deep Reinforcement Learning. Found. Trends Mach. Learn. 11(3-4): 219-354 (2018)
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/MachadoBTVHB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/MachadoBTVHB18
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling:
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents. J. Artif. Intell. Res. 61: 523-562 (2018)
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DabneyRBM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DabneyRBM18
Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos:
Distributional Reinforcement Learning With Quantile Regression. AAAI 2018: 2892-2901
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandBDMT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandBDMT18
Mark Rowland, Marc G. Bellemare, Will Dabney, Rémi Munos, Yee Whye Teh:
An Analysis of Categorical Distributional Reinforcement Learning. AISTATS 2018: 29-37
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/GruslysDAPBM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GruslysDAPBM18
Audrunas Gruslys, Will Dabney, Mohammad Gheshlaghi Azar, Bilal Piot, Marc G. Bellemare, Rémi Munos:
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning. ICLR (Poster) 2018
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/MachadoBTVHB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/MachadoBTVHB18
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling:
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract). IJCAI 2018: 5573-5577
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-11622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-11622
Marlos C. Machado, Marc G. Bellemare, Michael Bowling:
Count-Based Exploration with the Successor Representation. CoRR abs/1807.11622 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-09819
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-09819
Adrien Ali Taïga, Aaron C. Courville, Marc G. Bellemare:
Approximate Exploration through State Abstraction. CoRR abs/1808.09819 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07004
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare, Doina Precup:
The Barbados 2018 List of Open Issues in Continual Learning. CoRR abs/1811.07004 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-12560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-12560
Vincent François-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare, Joelle Pineau:
An Introduction to Deep Reinforcement Learning. CoRR abs/1811.12560 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-06110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-06110
Pablo Samuel Castro, Subhodeep Moitra, Carles Gelada, Saurabh Kumar, Marc G. Bellemare:
Dopamine: A Research Framework for Deep Reinforcement Learning. CoRR abs/1812.06110 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-07069
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-07069
Felipe Petroski Such, Vashisht Madhavan, Rosanne Liu, Rui Wang, Pablo Samuel Castro, Yulun Li, Ludwig Schubert, Marc G. Bellemare, Jeff Clune, Joel Lehman:
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents. CoRR abs/1812.07069 (2018)
2017
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BellemareDM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BellemareDM17
Marc G. Bellemare, Will Dabney, Rémi Munos:
A Distributional Perspective on Reinforcement Learning. ICML 2017: 449-458
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GravesBMMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GravesBMMK17
Alex Graves, Marc G. Bellemare, Jacob Menick, Rémi Munos, Koray Kavukcuoglu:
Automated Curriculum Learning for Neural Networks. ICML 2017: 1311-1320
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MachadoBB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MachadoBB17
Marlos C. Machado, Marc G. Bellemare, Michael H. Bowling:
A Laplacian Framework for Option Discovery in Reinforcement Learning. ICML 2017: 2295-2304
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/OstrovskiBOM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/OstrovskiBOM17
Georg Ostrovski, Marc G. Bellemare, Aäron van den Oord, Rémi Munos:
Count-Based Exploration with Neural Density Models. ICML 2017: 2721-2730
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MachadoBB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MachadoBB17
Marlos C. Machado, Marc G. Bellemare, Michael H. Bowling:
A Laplacian Framework for Option Discovery in Reinforcement Learning. CoRR abs/1703.00956 (2017)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/OstrovskiBOM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OstrovskiBOM17
Georg Ostrovski, Marc G. Bellemare, Aäron van den Oord, Rémi Munos:
Count-Based Exploration with Neural Density Models. CoRR abs/1703.01310 (2017)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/GravesBMMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GravesBMMK17
Alex Graves, Marc G. Bellemare, Jacob Menick, Rémi Munos, Koray Kavukcuoglu:
Automated Curriculum Learning for Neural Networks. CoRR abs/1704.03003 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/GruslysABM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GruslysABM17
Audrunas Gruslys, Mohammad Gheshlaghi Azar, Marc G. Bellemare, Rémi Munos:
The Reactor: A Sample-Efficient Actor-Critic Architecture. CoRR abs/1704.04651 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BellemareDDMLHM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BellemareDDMLHM17
Marc G. Bellemare, Ivo Danihelka, Will Dabney, Shakir Mohamed, Balaji Lakshminarayanan, Stephan Hoyer, Rémi Munos:
The Cramer Distance as a Solution to Biased Wasserstein Gradients. CoRR abs/1705.10743 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BellemareDM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BellemareDM17
Marc G. Bellemare, Will Dabney, Rémi Munos:
A Distributional Perspective on Reinforcement Learning. CoRR abs/1707.06887 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-06009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-06009
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling:
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents. CoRR abs/1709.06009 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1710-10044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-10044
Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos:
Distributional Reinforcement Learning with Quantile Regression. CoRR abs/1710.10044 (2017)
2016
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BellemareOGTM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BellemareOGTM16
Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos:
Increasing the Action Gap: New Operators for Reinforcement Learning. AAAI 2016: 1476-1483
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/alt/HarutyunyanBSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/HarutyunyanBSM16
Anna Harutyunyan, Marc G. Bellemare, Tom Stepleton, Rémi Munos:
Q(λ) with Off-Policy Corrections. ALT 2016: 305-320
[c11]
- view
- export record
  dblp key:
  - conf/nips/MunosSHB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MunosSHB16
Rémi Munos, Tom Stepleton, Anna Harutyunyan, Marc G. Bellemare:
Safe and Efficient Off-Policy Reinforcement Learning. NIPS 2016: 1046-1054
[c10]
- view
- export record
  dblp key:
  - conf/nips/BellemareSOSSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BellemareSOSSM16
Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Rémi Munos:
Unifying Count-Based Exploration and Intrinsic Motivation. NIPS 2016: 1471-1479
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HarutyunyanBSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HarutyunyanBSM16
Anna Harutyunyan, Marc G. Bellemare, Tom Stepleton, Rémi Munos:
Q($λ$) with Off-Policy Corrections. CoRR abs/1602.04951 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BellemareSOSSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BellemareSOSSM16
Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Rémi Munos:
Unifying Count-Based Exploration and Intrinsic Motivation. CoRR abs/1606.01868 (2016)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MunosSHB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MunosSHB16
Rémi Munos, Tom Stepleton, Anna Harutyunyan, Marc G. Bellemare:
Safe and Efficient Off-Policy Reinforcement Learning. CoRR abs/1606.02647 (2016)
2015
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/MnihKSRVBGRFOPB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/MnihKSRVBGRFOPB15
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin A. Riedmiller, Andreas Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis:
Human-level control through deep reinforcement learning. Nat. 518(7540): 529-533 (2015)
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/VenessBHCD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/VenessBHCD15
Joel Veness, Marc G. Bellemare, Marcus Hutter, Alvin Chua, Guillaume Desjardins:
Compress and Control. AAAI 2015: 3016-3023
[c8]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/Bellemare15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Bellemare15
Marc G. Bellemare:
Count-Based Frequency Estimation with Bounded Memory. IJCAI 2015: 3337-3344
[c7]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/VenessHOB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/VenessHOB15
Joel Veness, Marcus Hutter, Laurent Orseau, Marc G. Bellemare:
Online Learning of k-CNF Boolean Functions. IJCAI 2015: 3865-3873
[c6]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/BellemareNVB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/BellemareNVB15
Marc G. Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling:
The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract). IJCAI 2015: 4148-4152
[e1]
- view
- export record
  dblp key:
  - conf/aaai/2015games
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/2015games
Michael Bowling, Marc G. Bellemare, Erik Talvitie, Joel Veness, Marlos C. Machado:
Learning for General Competency in Video Games, Papers from the 2015 AAAI Workshop, Austin, Texas, USA, January 26, 2015. AAAI Technical Report WS-15-10, AAAI Press 2015, ISBN 978-1-57735-721-6 [contents]
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BellemareOGTM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BellemareOGTM15
Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos:
Increasing the Action Gap: New Operators for Reinforcement Learning. CoRR abs/1512.04860 (2015)
2014
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BellemareVT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BellemareVT14
Marc G. Bellemare, Joel Veness, Erik Talvitie:
Skip Context Tree Switching. ICML 2014: 1458-1466
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/VenessBHCD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/VenessBHCD14
Joel Veness, Marc G. Bellemare, Marcus Hutter, Alvin Chua, Guillaume Desjardins:
Compress and Control. CoRR abs/1411.5326 (2014)
2013
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/BellemareNVB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/BellemareNVB13
Marc G. Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling:
The Arcade Learning Environment: An Evaluation Platform for General Agents. J. Artif. Intell. Res. 47: 253-279 (2013)
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BellemareVB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BellemareVB13
Marc G. Bellemare, Joel Veness, Michael Bowling:
Bayesian Learning of Recursively Factored Environments. ICML (3) 2013: 1211-1219
2012
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BellemareVB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BellemareVB12
Marc G. Bellemare, Joel Veness, Michael Bowling:
Investigating Contingency Awareness Using Atari 2600 Games. AAAI 2012: 864-871
[c2]
- view
- export record
  dblp key:
  - conf/nips/BellemareVB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BellemareVB12
Marc G. Bellemare, Joel Veness, Michael Bowling:
Sketch-Based Linear Value Function Approximation. NIPS 2012: 2222-2230
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1207-4708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1207-4708
Marc G. Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling:
The Arcade Learning Environment: An Evaluation Platform for General Agents. CoRR abs/1207.4708 (2012)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2007
[c1]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/BellemareP07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/BellemareP07
Marc G. Bellemare, Doina Precup:
Context-Driven Predictions. IJCAI 2007: 250-255

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.