default search action
Po-Yao Huang 0001
Person information
- affiliation: Facebook AI
- affiliation: Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA
Other persons with the same name
- Po-Yao Huang 0002 — National Taiwan University, Taipei, Taiwan
- Po-Yao Huang 0003 — National Cheng Kung University, Department of Industrial and Information Management, Taiwan
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mido Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski:
DINOv2: Learning Robust Visual Features without Supervision. Trans. Mach. Learn. Res. 2024 (2024) - [c41]Puyuan Peng, Po-Yao Huang, Shang-Wen Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. ACL (1) 2024: 12442-12462 - [c40]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CVPR 2024: 26344-26353 - [c39]Tingle Li, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Anumanchipalli:
Self-Supervised Audio-Visual Soundscape Stylization. ECCV (80) 2024: 20-40 - [c38]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. EMNLP 2024: 19302-19318 - [c37]Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894 - [c36]Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. ICLR 2024 - [i33]Xiaoyu Zhu, Junwei Liang, Po-Yao Huang, Alex Hauptmann:
Adversarially Masked Video Consistency for Unsupervised Domain Adaptation. CoRR abs/2403.16242 (2024) - [i32]Puyuan Peng, Po-Yao Huang, Daniel Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. CoRR abs/2403.16973 (2024) - [i31]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CoRR abs/2404.16030 (2024) - [i30]Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Shang-Wen Li, Armen Aghajanyan, Gargi Ghosh, Luke Zettlemoyer:
Text Quality-Based Pruning for Efficient Training of Language Models. CoRR abs/2405.01582 (2024) - [i29]Tingle Li, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Anumanchipalli:
Self-Supervised Audio-Visual Soundscape Stylization. CoRR abs/2409.14340 (2024) - 2023
- [j4]Mingjie Li, Po-Yao Huang, Xiaojun Chang, Junjie Hu, Yi Yang, Alex Hauptmann:
Video Pivoting Unsupervised Multi-Modal Machine Translation. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3918-3932 (2023) - [c35]Tiezheng Yu, Hanchao Yu, Davis Liang, Yuning Mao, Shaoliang Nie, Po-Yao Huang, Madian Khabsa, Pascale Fung, Yi-Chia Wang:
Generating Hashtags for Short-form Videos with Guided Signals. ACL (1) 2023: 9482-9495 - [c34]Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Ghosh:
Flap: Fast Language-Audio Pre-Training. ASRU 2023: 1-8 - [c33]Xiaoyu Zhu, Po-Yao Huang, Junwei Liang, Celso M. de Melo, Alexander G. Hauptmann:
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition. CVPR 2023: 1526-1536 - [c32]Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
CiT: Curation in Training for Effective Vision-Language Data. ICCV 2023: 15134-15143 - [c31]Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan L. Yuille, Christoph Feichtenhofer:
Diffusion Models as Masked Autoencoders. ICCV 2023: 16238-16248 - [c30]Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer:
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles. ICML 2023: 29441-29454 - [c29]Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer:
MAViL: Masked Audio-Video Learners. NeurIPS 2023 - [i28]Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
CiT: Curation in Training for Effective Vision-Language Data. CoRR abs/2301.02241 (2023) - [i27]Xiaoyu Zhu, Po-Yao Huang, Junwei Liang, Celso M. de Melo, Alexander G. Hauptmann:
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition. CoRR abs/2303.18177 (2023) - [i26]Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan L. Yuille, Christoph Feichtenhofer:
Diffusion Models as Masked Autoencoders. CoRR abs/2304.03283 (2023) - [i25]Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael G. Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski:
DINOv2: Learning Robust Visual Features without Supervision. CoRR abs/2304.07193 (2023) - [i24]Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer:
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles. CoRR abs/2306.00989 (2023) - [i23]Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023) - [i22]Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023) - [i21]Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. CoRR abs/2309.16671 (2023) - [i20]Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Ghosh:
FLAP: Fast Language-Audio Pre-training. CoRR abs/2311.01615 (2023) - 2022
- [j3]Pengzhen Ren, Yun Xiao, Xiaojun Chang, Poyao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang:
A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions. ACM Comput. Surv. 54(4): 76:1-76:34 (2022) - [j2]Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Brij B. Gupta, Xiaojiang Chen, Xin Wang:
A Survey of Deep Active Learning. ACM Comput. Surv. 54(9): 180:1-180:40 (2022) - [c28]Juncheng B. Li, Shuhui Qu, Xinjian Li, Bernie Po-Yao Huang, Florian Metze:
On Adversarial Robustness Of Large-Scale Audio Visual Learning. ICASSP 2022: 231-235 - [c27]Juncheng Li, Shuhui Qu, Po-Yao Huang, Florian Metze:
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification. INTERSPEECH 2022: 1521-1525 - [c26]Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. NeurIPS 2022 - [i19]Armen Aghajanyan, Bernie Huang, Candace Ross, Vladimir Karpukhin, Hu Xu, Naman Goyal, Dmytro Okhonko, Mandar Joshi, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer:
CM3: A Causal Masked Multimodal Model of the Internet. CoRR abs/2201.07520 (2022) - [i18]Juncheng B. Li, Shuhui Qu, Xinjian Li, Po-Yao Huang, Florian Metze:
On Adversarial Robustness of Large-scale Audio Visual Learning. CoRR abs/2203.12122 (2022) - [i17]Juncheng B. Li, Shuhui Qu, Po-Yao Huang, Florian Metze:
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification. CoRR abs/2203.13448 (2022) - [i16]Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. CoRR abs/2207.06405 (2022) - [i15]Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer:
MAViL: Masked Audio-Video Learners. CoRR abs/2212.08071 (2022) - 2021
- [j1]Di Yuan, Xiaojun Chang, Po-Yao Huang, Qiao Liu, Zhenyu He:
Self-Supervised Deep Correlation Tracking. IEEE Trans. Image Process. 30: 976-985 (2021) - [c25]Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer:
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding. ACL/IJCNLP (Findings) 2021: 4227-4239 - [c24]Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer:
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. EMNLP (1) 2021: 6787-6800 - [c23]Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition Through the Lens of Adversary. ICASSP 2021: 616-620 - [c22]Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. ICCV 2021: 10540-10552 - [c21]Mandela Patrick, Po-Yao Huang, Yuki Markus Asano, Florian Metze, Alexander G. Hauptmann, João F. Henriques, Andrea Vedaldi:
Support-set bottlenecks for video-text representation learning. ICLR 2021 - [c20]Poyao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alex Hauptmann:
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models. NAACL-HLT 2021: 2443-2459 - [i14]Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alexander G. Hauptmann:
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models. CoRR abs/2103.08849 (2021) - [i13]Mandela Patrick, Yuki Markus Asano, Bernie Huang, Ishan Misra, Florian Metze, João F. Henriques, Andrea Vedaldi:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. CoRR abs/2103.10211 (2021) - [i12]Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer:
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding. CoRR abs/2105.09996 (2021) - [i11]Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer:
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. CoRR abs/2109.14084 (2021) - 2020
- [c19]Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander G. Hauptmann:
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting. ACL 2020: 8226-8237 - [c18]Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann, Eduard H. Hovy:
Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval. ICMR 2020: 53-62 - [c17]Wenhe Liu, Guoliang Kang, Po-Yao Huang, Xiaojun Chang, Lijun Yu, Yijun Qian, Junwei Liang, Liangke Gui, Jing Wen, Peng Chen, Alexander G. Hauptmann:
Argus: Efficient Activity Detection System for Extended Video Analysis. WACV Workshops 2020: 126-133 - [i10]Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander G. Hauptmann:
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting. CoRR abs/2005.03119 (2020) - [i9]Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang:
A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions. CoRR abs/2006.02903 (2020) - [i8]Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang:
A Survey of Deep Active Learning. CoRR abs/2009.00236 (2020) - [i7]Mandela Patrick, Po-Yao Huang, Yuki Markus Asano, Florian Metze, Alexander G. Hauptmann, João F. Henriques, Andrea Vedaldi:
Support-set bottlenecks for video-text representation learning. CoRR abs/2010.02824 (2020) - [i6]Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition through the lens of Adversary. CoRR abs/2011.07430 (2020)
2010 – 2019
- 2019
- [c16]Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann:
Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations. EMNLP/IJCNLP (1) 2019: 1461-1467 - [c15]Po-Yao Huang, Vaibhav, Xiaojun Chang, Alexander G. Hauptmann:
Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks. ICMR 2019: 244-252 - [c14]Po-Yao Huang, Guoliang Kang, Wenhe Liu, Xiaojun Chang, Alexander G. Hauptmann:
Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment. ACM Multimedia 2019: 1758-1767 - [c13]Junpei Zhou, Xinyu Wang, Po-Yao Huang, Alexander G. Hauptmann:
CMU-Informedia at TREC 2019 Incident Streams Track. TREC 2019 - [c12]Xiaojun Chang, Wenhe Liu, Po-Yao Huang, Changlin Li, Fengda Zhu, Mingfei Han, Mingjie Li, Mengyuan Ma, Siyi Hu, Guoliang Kang, Junwei Liang, Liangke Gui, Lijun Yu, Yijun Qian, Jing Wen, Alexander G. Hauptmann:
MMVG-INF-Etrol@TRECVID 2019: Activities in Extended Video. TRECVID 2019 - [i5]Eduard H. Hovy, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alex Hauptmann, Florian Metze, Teruko Mitamura, Zaid Sheikh, Ankit Dangi, Aditi Chaudhary, Xianyang Chen, Xiang Kong, Bernie Huang, Salvador Medina, Hector Liu, Xuezhe Ma, Maria Ryskina, Ramon Sanabria, Varun Gangal:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2019 - [i4]Vaibhav, Po-Yao Huang, Robert E. Frederking:
RWR-GAE: Random Walk Regularization for Graph Auto Encoders. CoRR abs/1908.04003 (2019) - [i3]Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann:
Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations. CoRR abs/1910.00058 (2019) - 2018
- [c11]Xiaojun Chang, Po-Yao Huang, Yi-Dong Shen, Xiaodan Liang, Yi Yang, Alexander G. Hauptmann:
RCAA: Relational Context-Aware Agents for Person Search. ECCV (9) 2018: 86-102 - [c10]Po-Yao Huang, Junwei Liang, Jean-Baptiste Lamare, Alexander G. Hauptmann:
Multimodal Filtering of Social Media for Temporal Monitoring and Event Analysis. ICMR 2018: 450-457 - [c9]Po-Yao Huang, Hong Shiang Lin, Sun-Yu Gordon Chi, Liang-Han Lin, Ming Ouhyoung:
Panoramic depth reconstruction within a single shot by optimizing global sphere radii. SIGGRAPH ASIA Posters 2018: 80:1-80:2 - [c8]Jia Chen, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Po-Yao Huang, Junwei Liang, Vaibhav, Xiaojun Chang, Jiang Liu, Ting-Yao Hu, Wenhe Liu, Wei Ke, Wayner Barrios, Haroon Idrees, Donghyun Yoo, Yaser Sheikh, Ruslan Salakhutdinov, Kris Kitani, Dong Huang:
Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video. TRECVID 2018 - [i2]Eduard H. Hovy, Taylor Berg-Kirkpatrick, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alexander G. Hauptmann, Florian Metze, Teruko Mitamura, Aditi Chaudhary, Xianyang Chen, Bernie Po-Yao Huang, Hector Zhengzhong Liu, Xuezhe Ma, Shruti Palaskar, Dheeraj Rajagopal, Maria Ryskina, Ramon Sanabria:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2018 - 2017
- [c7]Junwei Liang, Desai Fan, Han Lu, Poyao Huang, Jia Chen, Lu Jiang, Alexander G. Hauptmann:
An Event Reconstruction Tool for Conflict Monitoring Using Social Media. AAAI 2017: 5097-5098 - [c6]Junwei Liang, Poyao Huang, Jia Chen, Alexander G. Hauptmann:
Synchronization for multi-perspective videos in the wild. ICASSP 2017: 1592-1596 - [i1]Poyao Huang, Ye Yuan, Zhen-Zhong Lan, Lu Jiang, Alexander G. Hauptmann:
Video Representation Learning and Latent Concept Mining for Large-scale Multi-label Video Classification. CoRR abs/1707.01408 (2017) - 2016
- [c5]Junwei Liang, Jia Chen, Poyao Huang, Xuanchong Li, Lu Jiang, Zhenzhong Lan, Pingbo Pan, Hehe Fan, Qin Jin, Jiande Sun, Yang Chen, Yi Yang, Alexander G. Hauptmann:
Informedia @ TRECVID 2016. TRECVID 2016 - [c4]Po-Yao Huang, Frederick Liu, Sz-Rung Shiang, Jean Oh, Chris Dyer:
Attention-based Multimodal Neural Machine Translation. WMT 2016: 639-645 - 2015
- [c3]Zhiting Hu, Poyao Huang, Yuntian Deng, Yingkai Gao, Eric P. Xing:
Entity Hierarchy Embedding. ACL (1) 2015: 1292-1300 - [c2]Yu-Jui Liu, Shin-Ming Cheng, Po-Yao Huang:
Cognitive vertical handover in heterogeneous networks. QSHINE 2015: 392-397 - 2014
- [c1]Han-Feng Lin, Shin-Ming Cheng, Po-Yao Huang:
Cognitive access in multichannel wireless networks using two-dimension Markov chain. IWCMC 2014: 169-173
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint