default search action
Haoxuan You
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Haoxuan You, Mandy Guo, Zhecan Wang, Kai-Wei Chang, Jason M. Baldridge, Jiahui Yu:
CoBIT: A Contrastive Bi-directional Image-Text Generation Model. ICLR 2024 - [c20]Haoxuan You, Haotian Zhang, Zhe Gan, Xianzhi Du, Bowen Zhang, Zirui Wang, Liangliang Cao, Shih-Fu Chang, Yinfei Yang:
Ferret: Refer and Ground Anything Anywhere at Any Granularity. ICLR 2024 - [c19]Junzhang Liu, Zhecan Wang, Hammad A. Ayyubi, Haoxuan You, Chris Thomas, Rui Sun, Shih-Fu Chang, Kai-Wei Chang:
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions. ACM Multimedia 2024: 8402-8411 - [i30]Jingping Nie, Hanya Shao, Yuang Fan, Qijia Shao, Haoxuan You, Matthias Preindl, Xiaofan Jiang:
LLM-based Conversational AI Therapist for Daily Functioning Screening and Psychotherapeutic Intervention via Everyday Smart Devices. CoRR abs/2403.10779 (2024) - [i29]Haotian Zhang, Haoxuan You, Philipp Dufter, Bowen Zhang, Chen Chen, Hong-You Chen, Tsu-Jui Fu, William Yang Wang, Shih-Fu Chang, Zhe Gan, Yinfei Yang:
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models. CoRR abs/2404.07973 (2024) - [i28]Junzhang Liu, Zhecan Wang, Hammad A. Ayyubi, Haoxuan You, Christopher Thomas, Rui Sun, Shih-Fu Chang, Kai-Wei Chang:
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions. CoRR abs/2405.11145 (2024) - [i27]Zhecan Wang, Junzhang Liu, Chia-Wei Tang, Hani Alomari, Anushka Sivakumar, Rui Sun, Wenhao Li, Md. Atabuzzaman, Hammad A. Ayyubi, Haoxuan You, Alvi Md. Ishmam, Kai-Wei Chang, Shih-Fu Chang, Chris Thomas:
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images. CoRR abs/2409.12953 (2024) - [i26]Haotian Zhang, Mingfei Gao, Zhe Gan, Philipp Dufter, Nina Wenzel, Forrest Huang, Dhruti Shah, Xianzhi Du, Bowen Zhang, Yanghao Li, Sam Dodge, Keen You, Zhen Yang, Aleksei Timofeev, Mingze Xu, Hong-You Chen, Jean-Philippe Fauconnier, Zhengfeng Lai, Haoxuan You, Zirui Wang, Afshin Dehghan, Peter Grasch, Yinfei Yang:
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning. CoRR abs/2409.20566 (2024) - [i25]Hanrong Ye, Haotian Zhang, Erik A. Daxberger, Lin Chen, Zongyu Lin, Yanghao Li, Bowen Zhang, Haoxuan You, Dan Xu, Zhe Gan, Jiasen Lu, Yinfei Yang:
MM-Ego: Towards Building Egocentric Multimodal LLMs. CoRR abs/2410.07177 (2024) - 2023
- [c18]Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang:
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding. ACL (Findings) 2023: 778-793 - [c17]Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noel Codella, Kai-Wei Chang, Shih-Fu Chang:
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond. EMNLP (Findings) 2023: 8598-8617 - [c16]Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-Wei Chang, Shih-Fu Chang:
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models. EMNLP (Findings) 2023: 11289-11303 - [i24]Haoxuan You, Mandy Guo, Zhecan Wang, Kai-Wei Chang, Jason Baldridge, Jiahui Yu:
CoBIT: A Contrastive Bi-directional Image-Text Generation Model. CoRR abs/2303.13455 (2023) - [i23]Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-Wei Chang, Shih-Fu Chang:
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models. CoRR abs/2305.14985 (2023) - [i22]Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang:
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding. CoRR abs/2307.00862 (2023) - [i21]Haoxuan You, Haotian Zhang, Zhe Gan, Xianzhi Du, Bowen Zhang, Zirui Wang, Liangliang Cao, Shih-Fu Chang, Yinfei Yang:
Ferret: Refer and Ground Anything Anywhere at Any Granularity. CoRR abs/2310.07704 (2023) - [i20]Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noel Codella, Kai-Wei Chang, Shih-Fu Chang:
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond. CoRR abs/2310.14670 (2023) - 2022
- [j2]Yifan Feng, Yue Gao, Xibin Zhao, Yandong Guo, Nihar Bagewadi, Nhat-Tan Bui, Hieu Dao, Shankar Gangisetty, Ripeng Guan, Xie Han, Cong Hua, Chidambar Hunakunti, Yu Jiang, Shichao Jiao, Yuqi Ke, Liqun Kuang, Anan Liu, Dinh-Huan Nguyen, Hai-Dang Nguyen, Weizhi Nie, Bang-Dang Pham, Karthik Raikar, Qingmei Tang, Minh-Triet Tran, Jialong Wan, Chenggang Yan, Haoxuan You, Difei Zhu:
SHREC'22 track: Open-Set 3D Object Retrieval. Comput. Graph. 107: 231-240 (2022) - [c15]Zhecan Wang, Haoxuan You, Liunian Harold Li, Alireza Zareian, Suji Park, Yiqing Liang, Kai-Wei Chang, Shih-Fu Chang:
SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning. AAAI 2022: 5914-5922 - [c14]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. ECCV (27) 2022: 69-87 - [c13]Haoxuan You, Rui Sun, Zhecan Wang, Kai-Wei Chang, Shih-Fu Chang:
Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding. EMNLP (Findings) 2022: 5444-5454 - [c12]Zhecan Wang, Haoxuan You, Yicheng He, Wenhao Li, Kai-Wei Chang, Shih-Fu Chang:
Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense. EMNLP 2022: 9212-9224 - [c11]Xu Ma, Can Qin, Haoxuan You, Haoxi Ran, Yun Fu:
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework. ICLR 2022 - [i19]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan:
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks. CoRR abs/2201.05729 (2022) - [i18]Xu Ma, Can Qin, Haoxuan You, Haoxi Ran, Yun Fu:
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework. CoRR abs/2202.07123 (2022) - [i17]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Xiyang Dai, Bin Xiao, Jianwei Yang, Haoxuan You, Kai-Wei Chang, Shih-Fu Chang, Lu Yuan:
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks. CoRR abs/2204.10496 (2022) - [i16]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. CoRR abs/2207.12661 (2022) - [i15]Zhecan Wang, Haoxuan You, Yicheng He, Wenhao Li, Kai-Wei Chang, Shih-Fu Chang:
Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense. CoRR abs/2211.05895 (2022) - [i14]Haoxuan You, Rui Sun, Zhecan Wang, Kai-Wei Chang, Shih-Fu Chang:
Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding. CoRR abs/2212.06971 (2022) - 2021
- [c10]Liunian Harold Li, Haoxuan You, Zhecan Wang, Alireza Zareian, Shih-Fu Chang, Kai-Wei Chang:
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions. NAACL-HLT 2021: 5339-5350 - [i13]Yang Hu, Haoxuan You, Zhecan Wang, Zhicheng Wang, Erjin Zhou, Yue Gao:
Graph-MLP: Node Classification without Message Passing in Graph. CoRR abs/2106.04051 (2021) - [i12]Zhecan Wang, Haoxuan You, Liunian Harold Li, Alireza Zareian, Suji Park, Yiqing Liang, Kai-Wei Chang, Shih-Fu Chang:
SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning. CoRR abs/2112.08587 (2021) - 2020
- [j1]Min Zhang, Haoxuan You, Pranav Kadam, Shan Liu, C.-C. Jay Kuo:
PointHop: An Explainable Machine Learning Method for Point Cloud Classification. IEEE Trans. Multim. 22(7): 1744-1755 (2020) - [c9]Alireza Zareian, Zhecan Wang, Haoxuan You, Shih-Fu Chang:
Learning Visual Commonsense for Robust Scene Graph Generation. ECCV (23) 2020: 642-657 - [i11]Alireza Zareian, Haoxuan You, Zhecan Wang, Shih-Fu Chang:
Learning Visual Commonsense for Robust Scene Graph Generation. CoRR abs/2006.09623 (2020) - [i10]Liunian Harold Li, Haoxuan You, Zhecan Wang, Alireza Zareian, Shih-Fu Chang, Kai-Wei Chang:
Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions. CoRR abs/2010.12831 (2020)
2010 – 2019
- 2019
- [c8]Yifan Feng, Haoxuan You, Zizhao Zhang, Rongrong Ji, Yue Gao:
Hypergraph Neural Networks. AAAI 2019: 3558-3565 - [c7]Yutong Feng, Yifan Feng, Haoxuan You, Xibin Zhao, Yue Gao:
MeshNet: Mesh Neural Network for 3D Shape Representation. AAAI 2019: 8279-8286 - [c6]Haoxuan You, Yifan Feng, Xibin Zhao, Changqing Zou, Rongrong Ji, Yue Gao:
PVRNet: Point-View Relation Neural Network for 3D Shape Recognition. AAAI 2019: 9119-9126 - [c5]Peng Gao, Zhengkai Jiang, Haoxuan You, Pan Lu, Steven C. H. Hoi, Xiaogang Wang, Hongsheng Li:
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering. CVPR 2019: 6639-6648 - [c4]Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-Modality Latent Interaction Network for Visual Question Answering. ICCV 2019: 5824-5834 - [c3]Zhicheng Jiao, Haoxuan You, Fan Yang, Xin Li, Han Zhang, Dinggang Shen:
Decoding EEG by Visual-guided Deep Neural Networks. IJCAI 2019: 1387-1393 - [c2]Can Qin, Haoxuan You, Lichen Wang, C.-C. Jay Kuo, Yun Fu:
PointDAN: A Multi-Scale 3D Domain Adaption Network for Point Cloud Representation. NeurIPS 2019: 7190-7201 - [i9]Min Zhang, Haoxuan You, Pranav Kadam, Shan Liu, C.-C. Jay Kuo:
PointHop: An Explainable Machine Learning Method for Point Cloud Classification. CoRR abs/1907.12766 (2019) - [i8]Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-modality Latent Interaction Network for Visual Question Answering. CoRR abs/1908.04289 (2019) - [i7]Can Qin, Haoxuan You, Lichen Wang, C.-C. Jay Kuo, Yun Fu:
PointDAN: A Multi-Scale 3D Domain Adaption Network for Point Cloud Representation. CoRR abs/1911.02744 (2019) - 2018
- [c1]Haoxuan You, Yifan Feng, Rongrong Ji, Yue Gao:
PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition. ACM Multimedia 2018: 1310-1318 - [i6]Haoxuan You, Yifan Feng, Rongrong Ji, Yue Gao:
PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition. CoRR abs/1808.07659 (2018) - [i5]Yifan Feng, Haoxuan You, Zizhao Zhang, Rongrong Ji, Yue Gao:
Hypergraph Neural Networks. CoRR abs/1809.09401 (2018) - [i4]Yutong Feng, Yifan Feng, Haoxuan You, Xibin Zhao, Yue Gao:
MeshNet: Mesh Neural Network for 3D Shape Representation. CoRR abs/1811.11424 (2018) - [i3]Haoxuan You, Yifan Feng, Xibin Zhao, Changqing Zou, Rongrong Ji, Yue Gao:
PVRNet: Point-View Relation Neural Network for 3D Shape Recognition. CoRR abs/1812.00333 (2018) - [i2]Peng Gao, Hongsheng Li, Haoxuan You, Zhengkai Jiang, Pan Lu, Steven C. H. Hoi, Xiaogang Wang:
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering. CoRR abs/1812.05252 (2018) - 2017
- [i1]Haoxuan You, Zhicheng Jiao, Haojun Xu, Jie Li, Ying Wang, Xinbo Gao:
Restricting Greed in Training of Generative Adversarial Network. CoRR abs/1711.10152 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-18 20:46 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint