default search action
Xianzhi Du
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Zhengfeng Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Haoping Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao:
VeCLIP: Improving CLIP Training via Visual-Enriched Captions. ECCV (42) 2024: 111-127 - [c24]Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang:
MM1: Methods, Analysis and Insights from Multimodal LLM Pre-training. ECCV (29) 2024: 304-323 - [c23]Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan:
Guiding Instruction-based Image Editing via Multimodal Large Language Models. ICLR 2024 - [c22]Ajay Kumar Jaiswal, Zhe Gan, Xianzhi Du, Bowen Zhang, Zhangyang Wang, Yinfei Yang:
Compressing LLMs: The Truth is Rarely Pure and Never Simple. ICLR 2024 - [c21]Wentao Wu, Aleksei Timofeev, Chen Chen, Bowen Zhang, Kun Duan, Shuangning Liu, Yantao Zheng, Jonathon Shlens, Xianzhi Du, Yinfei Yang:
MOFI: Learning Image Representations from Noisy Entity Annotated Images. ICLR 2024 - [c20]Haoxuan You, Haotian Zhang, Zhe Gan, Xianzhi Du, Bowen Zhang, Zirui Wang, Liangliang Cao, Shih-Fu Chang, Yinfei Yang:
Ferret: Refer and Ground Anything Anywhere at Any Granularity. ICLR 2024 - [c19]Zhengfeng Lai, Haoping Bai, Haotian Zhang, Xianzhi Du, Jiulong Shan, Yinfei Yang, Chen-Nee Chuah, Meng Cao:
Empowering Unsupervised Domain Adaptation with Large-scale Pre-trained Vision-Language Models. WACV 2024: 2679-2689 - [i27]Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Guoli Yin, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang:
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training. CoRR abs/2403.09611 (2024) - [i26]Xianzhi Du, Tom Gunter, Xiang Kong, Mark Lee, Zirui Wang, Aonan Zhang, Nan Du, Ruoming Pang:
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training. CoRR abs/2405.15052 (2024) - [i25]Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek, Sam Wiseman, Syd Evans, Tao Lei, Vivek Rathod, Xiang Kong, Xianzhi Du, Yanghao Li, Yongqiang Wang, Yuan Gao, Zaid Ahmed, Zhaoyang Xu, Zhiyun Lu, Al Rashid, Albin Madappally Jose, Alec Doane, Alfredo Bencomo, Allison Vanderby, Andrew Hansen, Ankur Jain, Anupama Mann Anupama, Areeba Kamal, Bugu Wu, Carolina Brum, Charlie Maalouf, Chinguun Erdenebileg, Chris Dulhanty, Dominik Moritz, Doug Kang, Eduardo Jimenez, Evan Ladd, Fangping Shi, Felix Bai, Frank Chu, Fred Hohman, Hadas Kotek, Hannah Gillis Coleman, Jane Li, Jeffrey P. Bigham, Jeffery Cao, Jeff Lai, Jessica Cheung, Jiulong Shan, Joe Zhou, John Li, Jun Qin, Karanjeet Singh, Karla Vega, Kelvin Zou, Laura Heckman, Lauren Gardiner, Margit Bowler, Maria Cordell, Meng Cao, Nicole Hay, Nilesh Shahdadpuri, Otto Godwin, Pranay Dighe, Pushyami Rachapudi, Ramsey Tantawi, Roman Frigg, Sam Davarnia, Sanskruti Shah, Saptarshi Guha, Sasha Sirovica, Shen Ma, Shuang Ma, Simon Wang, Sulgi Kim, Suma Jayaram, Vaishaal Shankar, Varsha Paidi, Vivek Kumar, Xin Wang, Xin Zheng, Walker Cheng:
Apple Intelligence Foundation Language Models. CoRR abs/2407.21075 (2024) - [i24]Haotian Zhang, Mingfei Gao, Zhe Gan, Philipp Dufter, Nina Wenzel, Forrest Huang, Dhruti Shah, Xianzhi Du, Bowen Zhang, Yanghao Li, Sam Dodge, Keen You, Zhen Yang, Aleksei Timofeev, Mingze Xu, Hong-You Chen, Jean-Philippe Fauconnier, Zhengfeng Lai, Haoxuan You, Zirui Wang, Afshin Dehghan, Peter Grasch, Yinfei Yang:
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning. CoRR abs/2409.20566 (2024) - 2023
- [c18]Tianlong Chen, Xuxi Chen, Xianzhi Du, Abdullah Rashwan, Fan Yang, Huizhong Chen, Zhangyang Wang, Yeqing Li:
AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts. ICCV 2023: 17300-17311 - [c17]Xianzhi Du, Bing Zhu, Zhihang Deng, Kenneth W. Shum, Weiping Wang:
ISAA: Boost Repair Process by Constructing the Degree Constrained Optimal Repair Tree for Erasure-coded Systems. ICPADS 2023: 194-199 - [i23]Liangliang Cao, Bowen Zhang, Chen Chen, Yinfei Yang, Xianzhi Du, Wencong Zhang, Zhiyun Lu, Yantao Zheng:
Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness. CoRR abs/2305.05095 (2023) - [i22]Wentao Wu, Aleksei Timofeev, Chen Chen, Bowen Zhang, Kun Duan, Shuangning Liu, Yantao Zheng, Jonathon Shlens, Xianzhi Du, Zhe Gan, Yinfei Yang:
MOFI: Learning Image Representations from Noisy Entity Annotated Images. CoRR abs/2306.07952 (2023) - [i21]Erik A. Daxberger, Floris Weers, Bowen Zhang, Tom Gunter, Ruoming Pang, Marcin Eichner, Michael Emmersberger, Yinfei Yang, Alexander Toshev, Xianzhi Du:
Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts. CoRR abs/2309.04354 (2023) - [i20]Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan:
Guiding Instruction-based Image Editing via Multimodal Large Language Models. CoRR abs/2309.17102 (2023) - [i19]Ajay Jaiswal, Zhe Gan, Xianzhi Du, Bowen Zhang, Zhangyang Wang, Yinfei Yang:
Compressing LLMs: The Truth is Rarely Pure and Never Simple. CoRR abs/2310.01382 (2023) - [i18]Zhengfeng Lai, Haotian Zhang, Wentao Wu, Haoping Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao:
From Scarcity to Efficiency: Improving CLIP Training via Visual-enriched Captions. CoRR abs/2310.07699 (2023) - [i17]Haoxuan You, Haotian Zhang, Zhe Gan, Xianzhi Du, Bowen Zhang, Zirui Wang, Liangliang Cao, Shih-Fu Chang, Yinfei Yang:
Ferret: Refer and Ground Anything Anywhere at Any Granularity. CoRR abs/2310.07704 (2023) - 2022
- [c16]Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou:
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation. ECCV (10) 2022: 711-727 - [c15]Zhihang Deng, Bing Zhu, Xianzhi Du, Kenneth W. Shum:
A Genetic Algorithm-based Construction of Fractional Repetition Codes. GLOBECOM 2022: 4244-4249 - [c14]Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wang, Denny Zhou:
Auto-scaling Vision Transformers without Training. ICLR 2022 - [c13]Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang:
Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance. ICML 2022: 25760-25782 - [c12]Ziyu Jiang, Xuxi Chen, Xueqin Huang, Xianzhi Du, Denny Zhou, Zhangyang Wang:
Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropagation. NeurIPS 2022 - [i16]Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wang, Denny Zhou:
Auto-scaling Vision Transformers without Training. CoRR abs/2202.11921 (2022) - [i15]Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang:
Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance. CoRR abs/2202.12387 (2022) - [i14]Xianzhi Du, Wei-Chih Hung, Tsung-Yi Lin:
Optimizing Anchor-based Detectors for Autonomous Driving Scenes. CoRR abs/2208.06062 (2022) - 2021
- [c11]Irwan Bello, William Fedus, Xianzhi Du, Ekin Dogus Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph:
Revisiting ResNets: Improved Training and Scaling Strategies. NeurIPS 2021: 22614-22627 - [i13]Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph:
Revisiting ResNets: Improved Training and Scaling Strategies. CoRR abs/2103.07579 (2021) - [i12]Abdullah Rashwan, Xianzhi Du, Xiaoqi Yin, Jing Li:
Dilated SpineNet for Semantic Segmentation. CoRR abs/2103.12270 (2021) - [i11]Xianzhi Du, Barret Zoph, Wei-Chih Hung, Tsung-Yi Lin:
Simple Training Strategies and Model Scaling for Object Detection. CoRR abs/2107.00057 (2021) - [i10]Xianzhi Du, Yeqing Li, Yin Cui, Rui Qian, Jing Li, Irwan Bello:
Revisiting 3D ResNets for Video Recognition. CoRR abs/2109.01696 (2021) - [i9]Qing Li, Boqing Gong, Yin Cui, Dan Kondratyuk, Xianzhi Du, Ming-Hsuan Yang, Matthew Brown:
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text. CoRR abs/2112.07074 (2021) - [i8]Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou:
A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation. CoRR abs/2112.09747 (2021) - 2020
- [j2]Mostafa El-Khamy, Haoyu Ren, Xianzhi Du, Jungwon Lee:
Multitask Deep Neural Networks for Tele-Wide Stereo Matching. IEEE Access 8: 184383-184398 (2020) - [c10]Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Golnaz Ghiasi, Mingxing Tan, Yin Cui, Quoc V. Le, Xiaodan Song:
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization. CVPR 2020: 11589-11598 - [c9]Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Yin Cui, Mingxing Tan, Quoc V. Le, Xiaodan Song:
Efficient Scale-Permuted Backbone with Learned Resource Distribution. ECCV (23) 2020: 572-586 - [c8]Xianzhi Du, Mostafa El-Khamy, Jungwon Lee:
FBA-AMNET: Foreground-Background Aware Atrous Multiscale Networks for Stereo Disparity Estimation. ICCE 2020: 1-2 - [i7]Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Yin Cui, Mingxing Tan, Quoc V. Le, Xiaodan Song:
Efficient Scale-Permuted Backbone with Learned Resource Distribution. CoRR abs/2010.11426 (2020)
2010 – 2019
- 2019
- [c7]Xianzhi Du, Xiaolong Wang, Dawei Li, Jingwen Zhu, Serafettin Tasci, Cameron Upright, Stephen Walsh, Larry S. Davis:
Boundary-sensitive Network for Portrait Segmentation. FG 2019: 1-8 - [c6]Mostafa El-Khamy, Xianzhi Du, Haoyu Ren, Jungwon Lee:
Multi-Task Learning of Depth from Tele and Wide Stereo Image Pairs. ICIP 2019: 4300-4304 - [i6]Xianzhi Du, Mostafa El-Khamy, Jungwon Lee:
AMNet: Deep Atrous Multiscale Stereo Disparity Estimation Networks. CoRR abs/1904.09099 (2019) - [i5]Mostafa El-Khamy, Haoyu Ren, Xianzhi Du, Jungwon Lee:
TW-SMNet: Deep Multitask Learning of Tele-Wide Stereo Matching. CoRR abs/1906.04463 (2019) - [i4]Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Golnaz Ghiasi, Mingxing Tan, Yin Cui, Quoc V. Le, Xiaodan Song:
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization. CoRR abs/1912.05027 (2019) - 2018
- [i3]Xianzhi Du, Mostafa El-Khamy, Vlad I. Morariu, Jungwon Lee, Larry S. Davis:
Fused Deep Neural Networks for Efficient Pedestrian Detection. CoRR abs/1805.08688 (2018) - 2017
- [b1]Xianzhi Du:
Computer Vision and Deep Learning with Applications to Object Detection, Segmentation, and Document Analysis. University of Maryland, College Park, MD, USA, 2017 - [c5]Baiyu Chen, Zhengyu Yang, Siyu Huang, Xianzhi Du, Zhiwei Cui, Janki Bhimani, Xin Xie, Ningfang Mi:
Cyber-physical system enabled nearby traffic flow modelling for autonomous vehicles. IPCCC 2017: 1-6 - [c4]Xianzhi Du, Mostafa El-Khamy, Jungwon Lee, Larry S. Davis:
Fused DNN: A Deep Neural Network Fusion Approach to Fast and Robust Pedestrian Detection. WACV 2017: 953-961 - [i2]Xianzhi Du, Xiaolong Wang, Dawei Li, Jingwen Zhu, Serafettin Tasci, Cameron Upright, Stephen Walsh, Larry S. Davis:
Boundary-sensitive Network for Portrait Segmentation. CoRR abs/1712.08675 (2017) - 2016
- [i1]Xianzhi Du, Mostafa El-Khamy, Jungwon Lee, Larry S. Davis:
Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection. CoRR abs/1610.03466 (2016) - 2015
- [c3]Xianzhi Du, David S. Doermann, Wael Abd-Almageed:
A graphical model approach for matching partial signatures. CVPR 2015: 1465-1472 - 2014
- [c2]Xianzhi Du, David S. Doermann, Wael Abd-Almageed:
Signature Matching Using Supervised Topic Models. ICPR 2014: 327-332 - 2013
- [c1]Xianzhi Du, Wael Abd-Almageed, David S. Doermann:
Large-Scale Signature Matching Using Multi-stage Hashing. ICDAR 2013: 976-980 - 2011
- [j1]Haiyan Jin, Xianzhi Du, Fulin Xiao, Guangjun Wen:
A Novel Wideband Spatial Power Combining Amplifier Based on Turnstile-Junction Waveguide Divider/Combiner. IEICE Trans. Electron. 94-C(9): 1479-1482 (2011)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint