default search action
ACCV 2024: Hanoi, Vietnam - Part V
- Minsu Cho, Ivan Laptev, Du Tran, Angela Yao, Hongbin Zha:
Computer Vision - ACCV 2024 - 17th Asian Conference on Computer Vision, Hanoi, Vietnam, December 8-12, 2024, Proceedings, Part V. Lecture Notes in Computer Science 15476, Springer 2025, ISBN 978-981-96-0916-1
Generative Models
- Parul Gupta, Munawar Hayat, Abhinav Dhall, Thanh-Toan Do:
Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models. 3-20 - Denis Zavadski, Damjan Kalsan, Carsten Rother:
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage. 21-40 - Yaochen Wu, Yu Meng, Lei Sun:
Diffusing Background Dictionary for Hyperspectral Anomaly Detection. 41-58 - Jinze Yang, Haoran Wang, Zining Zhu, Chenglong Liu, Meng Wymond Wu, Mingming Sun:
VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model. 59-76 - Xiaoyuan Fang, Longquan Dai, Jinhui Tang:
OmniFusion: Exemplar-Based Video Colorization Using OmniMotion and DifFusion Priors. 77-94 - Jiayi Wang, Zihao Liu, Xiaoyu Wu:
LoCo-MAD: Long-Range Context-Enhanced Model Towards Plot-Centric Movie Audio Description. 95-112 - Joanna Materzynska, Josef Sivic, Eli Shechtman, Antonio Torralba, Richard Zhang, Bryan C. Russell:
NewMove: Customizing Text-to-Video Models with Novel Motions. 113-130 - Xiaoqian Shen, Faizan Farooq Khan, Mohamed Elhoseiny:
EmoTalker: Audio Driven Emotion Aware Talking Head Generation. 131-147 - Geonung Kim, Beomsu Kim, Eunhyeok Park, Sunghyun Cho:
Diffusion Model Compression for Image-to-Image Translation. 148-166 - Ryota Yoshihashi, Yuya Otsuka, Kenji Doi, Tomohiro Tanaka, Hirokatsu Kataoka:
Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised Semantic Segmentation. 167-186 - Jongmin Gim, Jihun Park, Kyoungmin Lee, Sunghoon Im:
Content-Adaptive Style Transfer: A Training-Free Approach with VQ Autoencoders. 187-204 - Yi Gao:
PSG-Adapter: Controllable Planning Scene Graph for Improving Text-to-Image Diffusion. 205-221 - Jingwei Zhang, Farzan Farnia:
Sparse Domain Transfer via Elastic Net Regularization. 222-238 - Tuong-Vy Truong-Thuy, Gia-Cat Bui-Le, Hai-Dang Nguyen, Trung-Nghia Le:
Rethinking Sampling for Music-Driven Long-Term Dance Generation. 239-255 - Jing Ma, Xiang Xiang, Yan He:
Masking Cascaded Self-attentions for Few-Shot Font-Generation Transformer. 256-272 - Victor Enescu, Hichem Sahbi:
Learning Classwise Untangled Continuums for Conditional Normalizing Flows. 273-290
Data Sets and Performance Analysis
- Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen:
Vision Language Models are blind[inline-graphic not available: see fulltext]. 293-309 - Duy Le Dinh Anh, Kim Hoang Tran, Quang-Thuc Nguyen, Ngan Hoang Le:
Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking. 310-328 - Chang-Yu Hsieh, Jian-Jiun Ding:
ADSP: Advanced Dataset for Shadow Processing, Enabling Visible Occluders via Synthesizing Strategy. 329-347 - Md. Tanvir Islam, Inzamamul Alam, Simon S. Woo, Saeed Anwar, Ik Hyun Lee, Khan Muhammad:
LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond. 348-365 - Tudor Jianu, Baoru Huang, Hoan Nguyen, Binod Bhattarai, Tuong Do, Erman Tjiputra, Quang D. Tran, Pierre Berthet-Rayne, Ngan Le, Sebastiano Fichera, Anh Nguyen:
Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction. 366-382 - Vannkinh Nom, Souhail Bakkali, Muhammad Muzzamil Luqman, Mickaël Coustaty, Jean-Marc Ogier:
KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark. 383-399 - Hashmat Shadab Malik, Muhammad Huzaifa, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes. 400-417
Computational Photography and Sensing
- Ruisheng Gao, Zeyu Xiao, Zhiwei Xiong:
Mamba-Based Light Field Super-Resolution with Efficient Subspace Scanning. 421-437 - Kenji Doi, Shuntaro Okada, Ryota Yoshihashi, Hirokatsu Kataoka:
Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion. 438-454 - KuanYan Chen, Atik Garg, Yu-Shuen Wang:
Seamless-Through-Breaking: Rethinking Image Stitching for Optimal Alignment. 455-469 - Zeyu Xiao, Jiateng Shou, Zhiwei Xiong:
Learning Complementary Maps for Light Field Salient Object Detection. 470-489
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.