default search action
18th ECCV 2024: Milan, Italy - Part LXXX
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXX. Lecture Notes in Computer Science 15138, Springer 2025, ISBN 978-3-031-72988-1 - Minh Tran, Yelin Kim, Che-Chun Su, Cheng-Hao Kuo, Min Sun, Mohammad Soleymani:
Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding. 1-19 - Tingle Li, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Anumanchipalli:
Self-Supervised Audio-Visual Soundscape Stylization. 20-40 - Yeji Song, Wonsik Shin, Junsoo Lee, Jeesoo Kim, Nojun Kwak:
SAVE: Protagonist Diversification with Structure Agnostic Video Editing. 41-57 - Xiaohan Wang, Yuhui Zhang, Orr Zohar, Serena Yeung-Levy:
VideoAgent: Long-Form Video Understanding with Large Language Model as Agent. 58-76 - Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan:
Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning. 77-98 - Ekaterina Khramtsova, Mahsa Baktashmotlagh, Guido Zuccon, Xi Wang, Mathieu Salzmann:
Source-Free Domain-Invariant Performance Prediction. 99-116 - Sayanton V. Dibbo, Adam Breuer, Juston Moore, Michael A. Teti:
Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures. 117-136 - Jeeyung Kim, Ze Wang, Qiang Qiu:
Constructing Concept-Based Models to Mitigate Spurious Correlations with Minimal Human Effort. 137-153 - Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama:
Direct Distillation Between Different Domains. 154-172 - Andy V. Huynh, Lauren E. Gillespie, Jael Lopez-Saucedo, Claire Tang, Rohan Sikand, Moises Exposito-Alonso:
Contrastive Ground-Level Image and Remote Sensing Pre-training Improves Representation Learning for Natural World Imagery. 173-190 - Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su, Subhadra Gopalakrishnan, Dinesh Manocha:
V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation. 191-206 - Jialian Wu, Jianfeng Wang, Zhengyuan Yang, Zhe Gan, Zicheng Liu, Junsong Yuan, Lijuan Wang:
GRiT: A Generative Region-to-Text Transformer for Object Understanding. 207-224 - Hongbeen Park, Minjeong Park, Giljoo Nam, Jinkyu Kim:
LRSLAM: Low-Rank Representation of Signed Distance Fields in Dense Visual SLAM System. 225-240 - Seokwon Shin, Hyungrok Do, Youngdoo Son:
Learning Representation for Multitask Learning Through Self-supervised Auxiliary Learning. 241-258 - Delong Wu, Hao Zhu, Qi Zhang, You Li, Zhan Ma, Xun Cao:
Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending. 259-275 - Anders Christensen, Nooshin Mojab, Khushman Patel, Karan Ahuja, Zeynep Akata, Ole Winther, Mar González-Franco, Andrea Colaco:
Geometry Fidelity for Spherical Images. 276-292 - Cheng Peng, Yutao Tang, Yifan Zhou, Nengyu Wang, Xijun Liu, Deming Li, Rama Chellappa:
BAGS: Blur Agnostic Gaussian Splatting Through Multi-scale Kernel Modeling. 293-310 - Erum Mushtaq, Duygu Nur Yaldiz, Yavuz Faruk Bakman, Jie Ding, Chenyang Tao, Dimitrios Dimitriadis, Salman Avestimehr:
CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning. 311-328 - Jiachen Lu, Ze Huang, Zeyu Yang, Jiahui Zhang, Li Zhang:
WoVoGen: World Volume-Aware Diffusion for Controllable Multi-camera Driving Scene Generation. 329-345 - Guangtao Zheng, Wenqian Ye, Aidong Zhang:
Benchmarking Spurious Bias in Few-Shot Image Classifiers. 346-364 - Zongze Wu, Nicholas I. Kolkin, Jonathan Brandt, Richard Zhang, Eli Shechtman:
TurboEdit: Instant Text-Based Image Editing. 365-381 - Fadlullah Raji, John Murray-Bruce:
Soft Shadow Diffusion (SSD): Physics-Inspired Learning for 3D Computational Periscopy. 382-400 - Nazmul Karim, Abdullah Al Arafat, Umar Khalid, Zhishan Guo, Nazanin Rahnavard:
Augmented Neural Fine-Tuning for Efficient Backdoor Purification. 401-418 - Qi Guo, Hailong Shi, Huan Li, Jinsheng Xiao, Xingyu Gao:
REDIR: Refocus-Free Event-Based De-occlusion Image Reconstruction. 419-435 - Nazmul Karim, Hasan Iqbal, Umar Khalid, Chen Chen, Jing Hua:
Free-Editor: Zero-Shot Text-Driven 3D Scene Editing. 436-453 - Fenggen Yu, Yiming Qian, Xu Zhang, Francisca Gil-Ureta, Brian Jackson, Eric P. Bennett, Hao Zhang:
DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly. 454-471 - Zhiyu Tan, Mengping Yang, Luozheng Qin, Hao Yang, Ye Qian, Qiang Zhou, Cheng Zhang, Hao Li:
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation. 472-489
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.