default search action
Yilun Zhao 0001
Person information
- affiliation: Yale University, New Haven, CT, USA
- affiliation (former): Zhejiang University, Hangzhou, China
Other persons with the same name
- Yilun Zhao 0002 — Institute of Computing Technology, Chinese Academy of Sciences, China
- Yilun Zhao 0003 — National Key Laboratory of Scattering and Radiation, Beijing, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j2]Ansong Ni, Pengcheng Yin, Yilun Zhao, Martin Riddell, Troy Feng, Rui Shen, Stephen Yin, Ye Liu, Semih Yavuz, Caiming Xiong, Shafiq Joty, Yingbo Zhou, Dragomir Radev, Arman Cohan:
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models. Trans. Assoc. Comput. Linguistics 12: 1311-1329 (2024) - [c27]Xiangru Tang, Anni Zou, Zhuosheng Zhang, Ziming Li, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein:
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning. ACL (Findings) 2024: 599-621 - [c26]Yilun Zhao, Lyuhao Chen, Arman Cohan, Chen Zhao:
TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based Reasoning. ACL (1) 2024: 12824-12840 - [c25]Yilun Zhao, Hongjun Liu, Yitao Long, Rui Zhang, Chen Zhao, Arman Cohan:
KnowledgeFMath: A Knowledge-Intensive Math Reasoning Dataset in Finance Domains. ACL (1) 2024: 12841-12858 - [c24]Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan:
Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation. ACL (Findings) 2024: 16078-16092 - [c23]Yilun Zhao, Yitao Long, Hongjun Liu, Ryo Kamoi, Linyong Nan, Lyuhao Chen, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan:
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Financial Documents. ACL (1) 2024: 16103-16120 - [c22]Yuqi Wang, Lyuhao Chen, Songcheng Cai, Zhijian Xu, Yilun Zhao:
Revisiting Automated Evaluation for Long-form Table Question Answering. EMNLP 2024: 14696-14706 - [c21]Yilun Zhao, Yitao Long, Tintin Jiang, Chengye Wang, Weiyuan Chen, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan:
FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents. EMNLP 2024: 14739-14752 - [c20]Chuhan Li, Ziyao Shangguan, Yilun Zhao, Deyuan Li, Yixin Liu, Arman Cohan:
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models. EMNLP (Findings) 2024: 15419-15446 - [c19]Simeng Han, Aaron Yu, Rui Shen, Zhenting Qi, Martin Riddell, Wenfei Zhou, Yujie Qiao, Yilun Zhao, Semih Yavuz, Ye Liu, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Dragomir Radev, Rex Ying, Arman Cohan:
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains. EMNLP (Findings) 2024: 16553-16565 - [c18]Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander R. Fabbri, Wojciech Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev:
FOLIO: Natural Language Reasoning with First-Order Logic. EMNLP 2024: 22017-22031 - [c17]Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein:
Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? NAACL (Short Papers) 2024: 12-34 - [c16]Yixin Liu, Alexander R. Fabbri, Jiawen Chen, Yilun Zhao, Simeng Han, Shafiq Joty, Pengfei Liu, Dragomir Radev, Chien-Sheng Wu, Arman Cohan:
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization. NAACL-HLT (Findings) 2024: 4481-4501 - [c15]Linyong Nan, Ellen Zhang, Weijin Zou, Yilun Zhao, Wenfei Zhou, Arman Cohan:
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering. NAACL-HLT (Findings) 2024: 4556-4579 - [c14]Chunyuan Deng, Yilun Zhao, Xiangru Tang, Mark Gerstein, Arman Cohan:
Investigating Data Contamination in Modern Benchmarks for Large Language Models. NAACL-HLT 2024: 8706-8719 - [i31]Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi:
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models. CoRR abs/2402.03271 (2024) - [i30]Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein:
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science. CoRR abs/2402.04247 (2024) - [i29]Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang:
Evaluating LLMs at Detecting Errors in LLM Responses. CoRR abs/2404.03602 (2024) - [i28]Chunyuan Deng, Xiangru Tang, Yilun Zhao, Hanming Wang, Haoran Wang, Wangchunshu Zhou, Arman Cohan, Mark Gerstein:
MIMIR: A Streamlined Platform for Personalized Agent Tuning in Domain Expertise. CoRR abs/2404.04285 (2024) - [i27]Xiangru Tang, Xingyao Zhang, Yanjun Shao, Jie Wu, Yilun Zhao, Arman Cohan, Ming Gong, Dongmei Zhang, Mark Gerstein:
Step-Back Profiling: Distilling User History for Personalized Scientific Writing. CoRR abs/2406.14275 (2024) - [i26]Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan:
Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation. CoRR abs/2406.14644 (2024) - [i25]Qianqian Xie, Dong Li, Mengxi Xiao, Zihao Jiang, Ruoyu Xiang, Xiao Zhang, Zhengyu Chen, Yueru He, Weiguang Han, Yuzhe Yang, Shunian Chen, Yifei Zhang, Lihang Shen, Daniel Kim, Zhiwei Liu, Zheheng Luo, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Zhiyuan Yao, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram, Peng Lu, Yilun Zhao, Yitao Long, Guojun Xiong, Kaleb Smith, Honghai Yu, Yanzhao Lai, Min Peng, Jianyun Nie, Jordan W. Suchow, Xiao-Yang Liu, Benyou Wang, Alejandro Lopez-Lira, Jimin Huang, Sophia Ananiadou:
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications. CoRR abs/2408.11878 (2024) - 2023
- [c13]Yilun Zhao, Boyu Mi, Zhenting Qi, Linyong Nan, Minghao Guo, Arman Cohan, Dragomir Radev:
OpenRT: An Open-source Framework for Reasoning Over Tabular Data. ACL (demo) 2023: 336-347 - [c12]Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev:
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation. ACL (1) 2023: 4140-4170 - [c11]Yilun Zhao, Chen Zhao, Linyong Nan, Zhenting Qi, Wenlin Zhang, Xiangru Tang, Boyu Mi, Dragomir Radev:
RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations. ACL (1) 2023: 6064-6081 - [c10]Yilun Zhao, Zhenting Qi, Linyong Nan, Lorenzo Jaime Yu Flores, Dragomir Radev:
LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control. EACL 2023: 554-561 - [c9]Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan:
Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios. EMNLP (Industry Track) 2023: 160-175 - [c8]Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Ruizhe Chen, Xiangru Tang, Yumo Xu, Dragomir Radev, Arman Cohan:
QTSumm: Query-Focused Summarization over Tabular Data. EMNLP 2023: 1157-1172 - [c7]Linyong Nan, Yilun Zhao, Weijin Zou, Narutatsu Ri, Jaesung Tae, Ellen Zhang, Arman Cohan, Dragomir Radev:
Enhancing Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies. EMNLP (Findings) 2023: 14935-14956 - [c6]Yixin Liu, Alexander R. Fabbri, Yilun Zhao, Pengfei Liu, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev:
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation. EMNLP 2023: 16360-16368 - [i24]Yilun Zhao, Zhenting Qi, Linyong Nan, Lorenzo Jaime Yu Flores, Dragomir Radev:
LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control. CoRR abs/2302.02962 (2023) - [i23]Yixin Liu, Alexander R. Fabbri, Yilun Zhao, Pengfei Liu, Shafiq R. Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev:
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation. CoRR abs/2303.03608 (2023) - [i22]Linyong Nan, Yilun Zhao, Weijin Zou, Narutatsu Ri, Jaesung Tae, Ellen Zhang, Arman Cohan, Dragomir Radev:
Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies. CoRR abs/2305.12586 (2023) - [i21]Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Xiangru Tang, Yumo Xu, Arman Cohan, Dragomir Radev:
QTSumm: A New Benchmark for Query-Focused Table Summarization. CoRR abs/2305.14303 (2023) - [i20]Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan:
Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers. CoRR abs/2305.14987 (2023) - [i19]Yilun Zhao, Chen Zhao, Linyong Nan, Zhenting Qi, Wenlin Zhang, Xiangru Tang, Boyu Mi, Dragomir Radev:
RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations. CoRR abs/2306.14321 (2023) - [i18]Yijie Zhou, Kejian Shi, Wencai Zhang, Yixin Liu, Yilun Zhao, Arman Cohan:
ODSum: New Benchmarks for Open Domain Multi-Document Summarization. CoRR abs/2309.08960 (2023) - [i17]Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein:
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? CoRR abs/2309.08963 (2023) - [i16]Ansong Ni, Pengcheng Yin, Yilun Zhao, Martin Riddell, Troy Feng, Rui Shen, Stephen Yin, Ye Liu, Semih Yavuz, Caiming Xiong, Shafiq Joty, Yingbo Zhou, Dragomir Radev, Arman Cohan:
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models. CoRR abs/2309.17446 (2023) - [i15]Yixin Liu, Alexander R. Fabbri, Jiawen Chen, Yilun Zhao, Simeng Han, Shafiq Joty, Pengfei Liu, Dragomir Radev, Chien-Sheng Wu, Arman Cohan:
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization. CoRR abs/2311.09184 (2023) - [i14]Linyong Nan, Ellen Zhang, Weijin Zou, Yilun Zhao, Wenfei Zhou, Arman Cohan:
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering. CoRR abs/2311.09721 (2023) - [i13]Chunyuan Deng, Yilun Zhao, Xiangru Tang, Mark Gerstein, Arman Cohan:
Investigating Data Contamination in Modern Benchmarks for Large Language Models. CoRR abs/2311.09783 (2023) - [i12]Yilun Zhao, Hongjun Liu, Yitao Long, Rui Zhang, Chen Zhao, Arman Cohan:
KnowledgeMath: Knowledge-Intensive Math Word Problem Solving in Finance Domains. CoRR abs/2311.09797 (2023) - [i11]Yilun Zhao, Yitao Long, Hongjun Liu, Linyong Nan, Lyuhao Chen, Ryo Kamoi, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan:
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data. CoRR abs/2311.09805 (2023) - [i10]Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Zengxian Yang, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Zhengliang Li, Liang Chen, Yiming Zong, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein:
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks. CoRR abs/2311.09835 (2023) - [i9]Xiangru Tang, Anni Zou, Zhuosheng Zhang, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein:
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning. CoRR abs/2311.10537 (2023) - 2022
- [j1]Zhengxu Yu, Yilun Zhao, Bin Hong, Zhongming Jin, Jianqiang Huang, Deng Cai, Xian-Sheng Hua:
Apparel-Invariant Feature Learning for Person Re-Identification. IEEE Trans. Multim. 24: 4482-4492 (2022) - [c5]Yilun Zhao, Yunxiang Li, Chenying Li, Rui Zhang:
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data. ACL (1) 2022: 6588-6600 - [c4]Linyong Nan, Lorenzo Jaime Yu Flores, Yilun Zhao, Yixin Liu, Luke Benson, Weijin Zou, Dragomir Radev:
R2D2: Robust Data-to-Text with Replacement Detection. EMNLP 2022: 6903-6917 - [c3]Yilun Zhao, Linyong Nan, Zhenting Qi, Rui Zhang, Dragomir Radev:
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples. EMNLP 2022: 9006-9018 - [c2]Chenying Li, Wenbo Ye, Yilun Zhao:
FinMath: Injecting a Tree-structured Solver for Question Answering over Financial Reports. LREC 2022: 6147-6152 - [i8]Linyong Nan, Lorenzo Jaime Yu Flores, Yilun Zhao, Yixin Liu, Luke Benson, Weijin Zou, Dragomir R. Radev:
R2D2: Robust Data-to-Text with Replacement Detection. CoRR abs/2205.12467 (2022) - [i7]Yilun Zhao, Yunxiang Li, Chenying Li, Rui Zhang:
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data. CoRR abs/2206.01347 (2022) - [i6]Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Luke Benson, Lucy Sun, Ekaterina Zubova, Yujie Qiao, Matthew Burtell, David Peng, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Shafiq R. Joty, Alexander R. Fabbri, Wojciech Kryscinski, Xi Victoria Lin, Caiming Xiong, Dragomir Radev:
FOLIO: Natural Language Reasoning with First-Order Logic. CoRR abs/2209.00840 (2022) - [i5]Yilun Zhao, Linyong Nan, Zhenting Qi, Rui Zhang, Dragomir Radev:
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples. CoRR abs/2210.12374 (2022) - [i4]Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq R. Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev:
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation. CoRR abs/2212.07981 (2022) - 2021
- [c1]Yilun Zhao, Jia Guo:
MusiCoder: A Universal Music-Acoustic Encoder Based on Transformer. MMM (1) 2021: 417-429 - 2020
- [i3]Yilun Zhao, Xinda Wu, Yuqing Ye, Jia Guo, Kejun Zhang:
MusiCoder: A Universal Music-Acoustic Encoder Based on Transformers. CoRR abs/2008.00781 (2020) - [i2]Zhengxu Yu, Yilun Zhao, Bin Hong, Zhongming Jin, Jianqiang Huang, Deng Cai, Xiaofei He, Xian-Sheng Hua:
Apparel-invariant Feature Learning for Apparel-changed Person Re-identification. CoRR abs/2008.06181 (2020) - [i1]Jia Guo, Chen Zhu, Yilun Zhao, Heda Wang, Yao Hu, Xiaofei He, Deng Cai:
LAMP: Label Augmented Multimodal Pretraining. CoRR abs/2012.04446 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:32 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint