default search action

combined dblp search
author search
venue search
publication search

ask others

Po-Yao Huang 0001

Bernie Huang – Bernie Po-Yao Huang – Poyao Huang 0001

> Home > Persons

Person information

affiliation: Facebook AI
affiliation: Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/OquabDMVSKFHMEA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/OquabDMVSKFHMEA24
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mido Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski:
DINOv2: Learning Robust Visual Features without Supervision. Trans. Mach. Learn. Res. 2024 (2024)
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Peng00MH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Peng00MH24
Puyuan Peng, Po-Yao Huang, Shang-Wen Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. ACL (1) 2024: 12442-12462
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Ma0X0ZCY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Ma0X0ZCY024
Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CVPR 2024: 26344-26353
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiWHOA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiWHOA24
Tingle Li, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Anumanchipalli:
Self-Supervised Audio-Visual Soundscape Stylization. ECCV (80) 2024: 20-40
[c38]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/00010TYKJGLZY0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/00010TYKJGLZY0X24
Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. EMNLP 2024: 19302-19318
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsengBCCLLPSWW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsengBCCLLPSWW024
Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894
[c36]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001XT0HS0GZF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001XT0HS0GZF24
Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. ICLR 2024
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16242
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16242
Xiaoyu Zhu, Junwei Liang, Po-Yao Huang, Alex Hauptmann:
Adversarially Masked Video Consistency for Unsupervised Domain Adaptation. CoRR abs/2403.16242 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16973
Puyuan Peng, Po-Yao Huang, Daniel Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. CoRR abs/2403.16973 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-16030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-16030
Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CoRR abs/2404.16030 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-01582
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-01582
Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Shang-Wen Li, Armen Aghajanyan, Gargi Ghosh, Luke Zettlemoyer:
Text Quality-Based Pruning for Efficient Training of Language Models. CoRR abs/2405.01582 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-14340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-14340
Tingle Li, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Anumanchipalli:
Self-Supervised Audio-Visual Soundscape Stylization. CoRR abs/2409.14340 (2024)
2023
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/LiHCHYH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/LiHCHYH23
Mingjie Li, Po-Yao Huang, Xiaojun Chang, Junjie Hu, Yi Yang, Alex Hauptmann:
Video Pivoting Unsupervised Multi-Modal Machine Translation. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3918-3932 (2023)
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YuYLMN0KFW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YuYLMN0KFW23
Tiezheng Yu, Hanchao Yu, Davis Liang, Yuning Mao, Shaoliang Nie, Po-Yao Huang, Madian Khabsa, Pascale Fung, Yi-Chia Wang:
Generating Hashtags for Short-form Videos with Guided Signals. ACL (1) 2023: 9482-9495
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YehHSLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YehHSLG23
Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Ghosh:
Flap: Fast Language-Audio Pre-Training. ASRU 2023: 1-8
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Zhu00MH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Zhu00MH23
Xiaoyu Zhu, Po-Yao Huang, Junwei Liang, Celso M. de Melo, Alexander G. Hauptmann:
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition. CVPR 2023: 1526-1536
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/0001X0YHGZF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/0001X0YHGZF23
Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
CiT: Curation in Training for Effective Vision-Language Data. ICCV 2023: 15134-15143
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/0005M0L00WXYF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/0005M0L00WXYF23
Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan L. Yuille, Christoph Feichtenhofer:
Diffusion Models as Masked Autoencoders. ICCV 2023: 16238-16248
[c30]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RyaliHB000ACPHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RyaliHB000ACPHM23
Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer:
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles. ICML 2023: 29441-29454
[c29]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001S0R0L0GMF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001S0R0L0GMF23
Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer:
MAViL: Masked Audio-Video Learners. NeurIPS 2023
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-02241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-02241
Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
CiT: Curation in Training for Effective Vision-Language Data. CoRR abs/2301.02241 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-18177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-18177
Xiaoyu Zhu, Po-Yao Huang, Junwei Liang, Celso M. de Melo, Alexander G. Hauptmann:
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition. CoRR abs/2303.18177 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-03283
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-03283
Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan L. Yuille, Christoph Feichtenhofer:
Diffusion Models as Masked Autoencoders. CoRR abs/2304.03283 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-07193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-07193
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael G. Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski:
DINOv2: Learning Robust Visual Features without Supervision. CoRR abs/2304.07193 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00989
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00989
Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer:
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles. CoRR abs/2306.00989 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11596
Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10787
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-16671
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-16671
Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. CoRR abs/2309.16671 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-01615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-01615
Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Ghosh:
FLAP: Fast Language-Audio Pre-training. CoRR abs/2311.01615 (2023)
2022
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/csur/RenXCHLCW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csur/RenXCHLCW21
Pengzhen Ren, Yun Xiao, Xiaojun Chang, Poyao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang:
A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions. ACM Comput. Surv. 54(4): 76:1-76:34 (2022)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/csur/RenXCHLGCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csur/RenXCHLGCW22
Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Brij B. Gupta, Xiaojiang Chen, Xin Wang:
A Survey of Deep Active Learning. ACM Comput. Surv. 54(9): 180:1-180:40 (2022)
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQLHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQLHM22
Juncheng B. Li, Shuhui Qu, Xinjian Li, Bernie Po-Yao Huang, Florian Metze:
On Adversarial Robustness Of Large-Scale Audio Visual Learning. ICASSP 2022: 231-235
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001Q0M22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001Q0M22
Juncheng Li, Shuhui Qu, Po-Yao Huang, Florian Metze:
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification. INTERSPEECH 2022: 1521-1525
[c26]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/000100BAGMF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/000100BAGMF22
Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. NeurIPS 2022
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-07520
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-07520
Armen Aghajanyan, Bernie Huang, Candace Ross, Vladimir Karpukhin, Hu Xu, Naman Goyal, Dmytro Okhonko, Mandar Joshi, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer:
CM3: A Causal Masked Multimodal Model of the Internet. CoRR abs/2201.07520 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12122
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12122
Juncheng B. Li, Shuhui Qu, Xinjian Li, Po-Yao Huang, Florian Metze:
On Adversarial Robustness of Large-scale Audio Visual Learning. CoRR abs/2203.12122 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-13448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-13448
Juncheng B. Li, Shuhui Qu, Po-Yao Huang, Florian Metze:
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification. CoRR abs/2203.13448 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06405
Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. CoRR abs/2207.06405 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08071
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08071
Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer:
MAViL: Masked Audio-Video Learners. CoRR abs/2212.08071 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/YuanCHLH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/YuanCHLH21
Di Yuan, Xiaojun Chang, Po-Yao Huang, Qiao Liu, Zhenyu He:
Self-Supervised Deep Correlation Tracking. IEEE Trans. Image Process. 30: 976-985 (2021)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XuGHAAFMZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XuGHAAFMZ21
Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer:
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding. ACL/IJCNLP (Findings) 2021: 4227-4239
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/XuG0OAMZF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XuG0OAMZF21
Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer:
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. EMNLP (1) 2021: 6787-6800
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMQ0M21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMQ0M21
Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition Through the Lens of Adversary. ICASSP 2021: 616-620
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/Patrick0MMVAH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/Patrick0MMVAH21
Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. ICCV 2021: 10540-10552
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Patrick0AMHHV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Patrick0AMHHV21
Mandela Patrick, Po-Yao Huang, Yuki Markus Asano, Florian Metze, Alexander G. Hauptmann, João F. Henriques, Andrea Vedaldi:
Support-set bottlenecks for video-text representation learning. ICLR 2021
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/HuangPHNMH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/HuangPHNMH21
Poyao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alex Hauptmann:
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models. NAACL-HLT 2021: 2443-2459
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08849
Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alexander G. Hauptmann:
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models. CoRR abs/2103.08849 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-10211
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-10211
Mandela Patrick, Yuki Markus Asano, Bernie Huang, Ishan Misra, Florian Metze, João F. Henriques, Andrea Vedaldi:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. CoRR abs/2103.10211 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-09996
Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer:
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding. CoRR abs/2105.09996 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-14084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-14084
Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer:
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. CoRR abs/2109.14084 (2021)
2020
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangHCH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangHCH20
Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander G. Hauptmann:
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting. ACL 2020: 8226-8237
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/0001CHH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/0001CHH20
Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann, Eduard H. Hovy:
Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval. ICMR 2020: 53-62
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/LiuK0CYQ0GWCH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/LiuK0CYQ0GWCH20
Wenhe Liu, Guoliang Kang, Po-Yao Huang, Xiaojun Chang, Lijun Yu, Yijun Qian, Junwei Liang, Liangke Gui, Jing Wen, Peng Chen, Alexander G. Hauptmann:
Argus: Efficient Activity Detection System for Extended Video Analysis. WACV Workshops 2020: 126-133
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-03119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-03119
Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander G. Hauptmann:
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting. CoRR abs/2005.03119 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-02903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-02903
Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang:
A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions. CoRR abs/2006.02903 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-00236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-00236
Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang:
A Survey of Deep Active Learning. CoRR abs/2009.00236 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02824
Mandela Patrick, Po-Yao Huang, Yuki Markus Asano, Florian Metze, Alexander G. Hauptmann, João F. Henriques, Andrea Vedaldi:
Support-set bottlenecks for video-text representation learning. CoRR abs/2010.02824 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07430
Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition through the lens of Adversary. CoRR abs/2011.07430 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/HuangCH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HuangCH19
Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann:
Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations. EMNLP/IJCNLP (1) 2019: 1461-1467
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/HuangVCH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/HuangVCH19
Po-Yao Huang, Vaibhav, Xiaojun Chang, Alexander G. Hauptmann:
Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks. ICMR 2019: 244-252
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangKLCH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangKLCH19
Po-Yao Huang, Guoliang Kang, Wenhe Liu, Xiaojun Chang, Alexander G. Hauptmann:
Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment. ACM Multimedia 2019: 1758-1767
[c13]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trec/ZhouWHH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trec/ZhouWHH19
Junpei Zhou, Xinyu Wang, Po-Yao Huang, Alexander G. Hauptmann:
CMU-Informedia at TREC 2019 Incident Streams Track. TREC 2019
[c12]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/ChangLHLZHLMHK019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/ChangLHLZHLMHK019
Xiaojun Chang, Wenhe Liu, Po-Yao Huang, Changlin Li, Fengda Zhu, Mingfei Han, Mingjie Li, Mengyuan Ma, Siyi Hu, Guoliang Kang, Junwei Liang, Liangke Gui, Lijun Yu, Yijun Qian, Jing Wen, Alexander G. Hauptmann:
MMVG-INF-Etrol@TRECVID 2019: Activities in Extended Video. TRECVID 2019
[i5]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/HovyCCGHMMSDCCK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/HovyCCGHMMSDCCK19
Eduard H. Hovy, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alex Hauptmann, Florian Metze, Teruko Mitamura, Zaid Sheikh, Ankit Dangi, Aditi Chaudhary, Xianyang Chen, Xiang Kong, Bernie Huang, Salvador Medina, Hector Liu, Xuezhe Ma, Maria Ryskina, Ramon Sanabria, Varun Gangal:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2019
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-04003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-04003
Vaibhav, Po-Yao Huang, Robert E. Frederking:
RWR-GAE: Random Walk Regularization for Graph Auto Encoders. CoRR abs/1908.04003 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-00058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-00058
Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann:
Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations. CoRR abs/1910.00058 (2019)
2018
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ChangHSLYH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ChangHSLYH18
Xiaojun Chang, Po-Yao Huang, Yi-Dong Shen, Xiaodan Liang, Yi Yang, Alexander G. Hauptmann:
RCAA: Relational Context-Aware Agents for Person Search. ECCV (9) 2018: 86-102
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/HuangLLH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/HuangLLH18
Po-Yao Huang, Junwei Liang, Jean-Baptiste Lamare, Alexander G. Hauptmann:
Multimodal Filtering of Social Media for Temporal Monitoring and Event Analysis. ICMR 2018: 450-457
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/siggrapha/HuangLCLO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggrapha/HuangLCLO18
Po-Yao Huang, Hong Shiang Lin, Sun-Yu Gordon Chi, Liang-Han Lin, Ming Ouhyoung:
Panoramic depth reconstruction within a single shot by optimizing global sphere radii. SIGGRAPH ASIA Posters 2018: 80:1-80:2
[c8]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/ChenCJH00VCLHLK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/ChenCJH00VCLHLK18
Jia Chen, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Po-Yao Huang, Junwei Liang, Vaibhav, Xiaojun Chang, Jiang Liu, Ting-Yao Hu, Wenhe Liu, Wei Ke, Wayner Barrios, Haroon Idrees, Donghyun Yoo, Yaser Sheikh, Ruslan Salakhutdinov, Kris Kitani, Dong Huang:
Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video. TRECVID 2018
[i2]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/HovyBCCGHMMCCHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/HovyBCCGHMMCCHL18
Eduard H. Hovy, Taylor Berg-Kirkpatrick, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alexander G. Hauptmann, Florian Metze, Teruko Mitamura, Aditi Chaudhary, Xianyang Chen, Bernie Po-Yao Huang, Hector Zhengzhong Liu, Xuezhe Ma, Shruti Palaskar, Dheeraj Rajagopal, Maria Ryskina, Ramon Sanabria:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2018
2017
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiangFLHCJH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiangFLHCJH17
Junwei Liang, Desai Fan, Han Lu, Poyao Huang, Jia Chen, Lu Jiang, Alexander G. Hauptmann:
An Event Reconstruction Tool for Conflict Monitoring Using Social Media. AAAI 2017: 5097-5098
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiangHCH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiangHCH17
Junwei Liang, Poyao Huang, Jia Chen, Alexander G. Hauptmann:
Synchronization for multi-perspective videos in the wild. ICASSP 2017: 1592-1596
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HuangYLJH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HuangYLJH17
Poyao Huang, Ye Yuan, Zhen-Zhong Lan, Lu Jiang, Alexander G. Hauptmann:
Video Representation Learning and Latent Concept Mining for Large-scale Multi-label Video Classification. CoRR abs/1707.01408 (2017)
2016
[c5]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/0001C0L0LPFJSC016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/0001C0L0LPFJSC016
Junwei Liang, Jia Chen, Poyao Huang, Xuanchong Li, Lu Jiang, Zhenzhong Lan, Pingbo Pan, Hehe Fan, Qin Jin, Jiande Sun, Yang Chen, Yi Yang, Alexander G. Hauptmann:
Informedia @ TRECVID 2016. TRECVID 2016
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/wmt/HuangLSOD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wmt/HuangLSOD16
Po-Yao Huang, Frederick Liu, Sz-Rung Shiang, Jean Oh, Chris Dyer:
Attention-based Multimodal Neural Machine Translation. WMT 2016: 639-645
2015
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuHDGX15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuHDGX15
Zhiting Hu, Poyao Huang, Yuntian Deng, Yingkai Gao, Eric P. Xing:
Entity Hierarchy Embedding. ACL (1) 2015: 1292-1300
[c2]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/qshine/LiuCH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/qshine/LiuCH15
Yu-Jui Liu, Shin-Ming Cheng, Po-Yao Huang:
Cognitive vertical handover in heterogeneous networks. QSHINE 2015: 392-397
2014
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iwcmc/LinCH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwcmc/LinCH14
Han-Feng Lin, Shin-Ming Cheng, Po-Yao Huang:
Cognitive access in multichannel wireless networks using two-dimension Markov chain. IWCMC 2014: 169-173

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.