


default search action
18th ECCV 2024: Milan, Italy - Part LXII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXII. Lecture Notes in Computer Science 15120, Springer 2025, ISBN 978-3-031-73032-0 - Aayam Shrestha, Pan Liu, Germán Ros, Kai Yuan, Alan Fern:
Generating Physically Realistic and Directable Human Motions from Multi-modal Inputs. 1-17 - Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht:
CoTracker: It Is Better to Track Together. 18-35 - Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. 36-55 - Yuxuan Sun
, Hao Wu
, Chenglu Zhu
, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Dan Wan, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin
, Lin Yang:
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology. 56-73 - Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu:
Improving Adversarial Transferability via Model Alignment. 74-92 - Wenhao Ding, Yulong Cao, Ding Zhao, Chaowei Xiao, Marco Pavone:
RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios. 93-110 - Hao Tang, Weiyao Wang, Pierre Gleize, Matt Feiszli:
ADen: Adaptive Density Representations for Sparse-View Camera Pose Estimation. 111-128 - Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li:
Embodied Understanding of Driving Scenarios. 129-148 - Chris Zhang, Sourav Biswas, Kelvin Wong, Kion Fallah, Lunjun Zhang, Dian Chen, Sergio Casas, Raquel Urtasun:
Learning to Drive via Asymmetric Self-Play. 149-168 - Zhening Huang
, Xiaoyang Wu
, Xi Chen
, Hengshuang Zhao
, Lei Zhu
, Joan Lasenby
:
OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation. 169-185 - Xijun Wang, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming C. Lin, Shan Yang:
ViLA: Efficient Video-Language Alignment for Video Question Answering. 186-204 - Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra:
Factorizing Text-to-Video Generation by Explicit Image Conditioning. 205-224 - Yang Zhao, Yanwu Xu, Zhisheng Xiao, Haolin Jia, Tingbo Hou:
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices. 225-242 - Yiyang Su, Minchul Kim
, Feng Liu
, Anil K. Jain, Xiaoming Liu
:
Open-Set Biometrics: Beyond Good Closed-Set Models. 243-261 - Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo, Shiqing Ma, Xiangyu Zhang:
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening. 262-281 - Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu:
Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution. 282-301 - Opher Bar Nathan, Deborah Levy
, Tali Treibitz
, Dan Rosenbaum:
Osmosis: RGBD Diffusion Prior for Underwater Image Restoration. 302-319 - Feixiang Zhou
, Bryan M. Williams, Hossein Rahmani
:
Towards Adaptive Pseudo-Label Learning for Semi-Supervised Temporal Action Localization. 320-338 - Anders Holst, Niels Chr. Overgaard
:
Computing the Lipschitz Constant Needed for Fast Scene Recovery from CASSI Measurements. 339-353 - Yu Chi, Fangneng Zhan
, Sibo Wu, Christian Theobalt
, Adam Kortylewski
:
DatasetNeRF: Efficient 3D-Aware Data Factory with Generative Radiance Fields. 354-372 - Mikhail Okunev
, Marc Mapeke, Benjamin Attal
, Christian Richardt
, Matthew O'Toole
, James Tompkin
:
Flowed Time of Flight Radiance Fields. 373-389 - Haoran Li
, Long Ma
, Haolin Shi
, Yanbin Hao
, Yong Liao
, Lechao Cheng
, Peng Yuan Zhou
:
3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing. 390-406 - Chaitanya Patel, Shaojie Bai, Te-Li Wang, Jason M. Saragih, Shih-En Wei:
Fast Registration of Photorealistic Avatars for VR Facial Animation. 407-423 - Cristina Mata, Kanchana Ranasinghe, Michael S. Ryoo:
CoPT: Unsupervised Domain Adaptive Segmentation Using Domain-Agnostic Text Embeddings. 424-440 - Ziwei Yao, Ruiping Wang
, Xilin Chen
:
HiFi-Score: Fine-Grained Image Description Evaluation with Hierarchical Parsing Graphs. 441-458 - Anas Mahmoud
, Ali Harakeh
, Steven L. Waslander
:
Image-to-Lidar Relational Distillation for Autonomous Driving Data. 459-475 - Gemma Canet Tarres
, Zhe Lin
, Zhifei Zhang
, Jianming Zhang
, Yizhi Song, Dan Ruta, Andrew Gilbert
, John P. Collomosse
, Soo Ye Kim:
Thinking Outside the BBox: Unconstrained Generative Object Compositing. 476-495

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.