default search action
MMAsia 2022: Tokyo, Japan
- Shuqiang Jiang, Kiyoharu Aizawa, Phoebe Chen, Keiji Yanai:
Proceedings of the 4th ACM International Conference on Multimedia in Asia, MMAsia 2022, Tokyo, Japan, December 13-16, 2022. ACM 2022, ISBN 978-1-4503-9478-9
Full Papers
- Gibran Benitez-Garcia, Hiroki Takahashi, Miguel Jimenez-Martinez, Jesus Olivares-Mercado:
TFM a Dataset for Detection and Recognition of Masked Faces in the Wild. 1:1-1:7 - Kazuhiro Yamawaki, Xian-Hua Han:
Deep Image and Kernel Prior Learning for Blind Super-Resolution. 2:1-2:7 - Zhen Chen, Ming Yang, Shiliang Zhang:
Asymmetric Label Propagation for Video Object Segmentation. 3:1-3:7 - Aoyu Li, Ikuro Sato, Kohta Ishikawa, Rei Kawakami, Rio Yokota:
Informative Sample-Aware Proxy for Deep Metric Learning. 4:1-4:11 - Wenzhe Li, Zirui Zhu, Tianchi Huang, Lifeng Sun, Chun Yuan:
Federated Knowledge Transfer for Heterogeneous Visual Models. 5:1-5:7 - Yingrui Ye, Yuya Moroto, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Affective Embedding Framework with Semantic Representations from Tweets for Zero-Shot Visual Sentiment Prediction. 6:1-6:7 - Alessandro Arezzo, Stefano Berretti:
SPEAKER VGG CCT: Cross-Corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers. 7:1-7:7 - Heng Yu, Shuyan Ding, Lunbo Li, Jiexin Wu:
Self-Attentive CLIP Hashing for Unsupervised Cross-Modal Retrieval. 8:1-8:7 - Jingyu Lin, Yan Yan, Hanzi Wang:
An End-to-End Scene Text Detector with Dynamic Attention. 9:1-9:7 - Kit-Yung Lam, Liang Yang, Ahmad Alhilal, Lik-Hang Lee, Gareth Tyson, Pan Hui:
Human-Avatar Interaction in Metaverse: Framework for Full-Body Interaction. 10:1-10:7 - Junwen Chen, Keiji Yanai:
Parallel Queries for Human-Object Interaction Detection. 11:1-11:7 - Yeganeh Jalalpour, Wu-chi Feng, Feng Liu:
Sequential Frame-Interpolation and DCT-based Video Compression Framework. 12:1-12:7 - Qian Zhou, Zhe Yang, Hongpeng Guo, Beitong Tian, Klara Nahrstedt:
360BroadView: Viewer Management for Viewport Prediction in 360-Degree Video Live Broadcast. 13:1-13:7 - David Alexandre, Hsueh-Ming Hang, Wen-Hsiao Peng:
Two-Layer Learning-Based P-Frame Coding with Super-Resolution and Content-Adaptive Conditional ANF. 14:1-14:7 - Yunhui Shi, Shaopei An, Jin Wang, Baocai Yin:
Learned Bi-Directional Motion Prediction for Video Compression. 15:1-15:6 - Wan Teng Lim, Kelvin Ang, Yuen Peng Loh:
Deep Enhancement-Object Features Fusion for Low-Light Object Detection. 16:1-16:6 - Yuanyuan Xu, Haolun Lan:
Image Compression for Machines Using Boundary-Enhanced Saliency. 17:1-17:6 - Lanling Zeng, Lianxiong Wu, Yang Yang, Xiangjun Shen, Yongzhao Zhan:
Deep Weighted Guided Upsampling Network for Depth of Field Image Upsampling. 18:1-18:7 - Longlu Huang, Na Qi, Qing Zhu:
Multispectral Image Denoising via Structural Tensor Sparsity Promoting Model. 19:1-19:7 - Yuto Namba, Xian-Hua Han:
Multi-Scale Channel Transformer Network for Single Image Deraining. 20:1-20:7 - Jingyu Wang, Jie Nie, Hao Chen, Huaxin Xie, Chengyu Zheng, Min Ye, Zhiqiang Wei:
Remote Sensing Image Colorization Based on Joint Stream Deep Convolutional Generative Adversarial Networks. 21:1-21:8 - Fatima Albreiki, Sultan Abu Ghazal, Jean Lahoud, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan:
On the Robustness of 3D Object Detectors. 22:1-22:7 - Peng-Fei Zhang, Zi Huang, Xin Luo, Pengfei Zhao:
Robust Learning with Adversarial Perturbations and Label Noise: A Two-Pronged Defense Approach. 23:1-23:7 - Chieh-Yin Liao, Chen-Hsiu Huang, Jun-Cheng Chen, Ja-Ling Wu:
Enhancing the Robustness of Deep Learning Based Fingerprinting to Improve Deepfake Attribution. 24:1-24:7 - Shunya Ohaga, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Disentangled Image Attribute Editing in Latent Space via Mask-Based Retention Loss. 25:1-25:7 - Jun Kimata, Tomoya Nitta, Toru Tamaki:
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition. 26:1-26:7 - Dhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Muhammad Anwer, Hisham Cholakkal:
CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection. 27:1-27:8
Short Papers
- Vijay John, Yasutomo Kawanishi:
A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person Recognition. 28:1-28:5 - Daichi Horita, Kiyoharu Aizawa:
SLGAN: Style- and Latent-Guided Generative Adversarial Network for Desirable Makeup Transfer and Removal. 29:1-29:5 - Nozomu Onodera, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Popularity-Aware Graph Social Recommendation for Fully Non-Interaction Users. 30:1-30:5 - Jia-Hua Tsai, Wei-Ta Chu:
Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still Images. 31:1-31:5 - Kota Izumi, Keiji Yanai:
Zero-Shot Font Style Transfer with a Differentiable Renderer. 32:1-32:5 - Kenshiro Sato, Yoko Yamakata, Sosuke Amano, Kiyoharu Aizawa:
Wearable Camera Based Food Logging System. 33:1-33:5 - Ryota Kitabayashi, Taro Narahara, Toshihiko Yamasaki:
Graph Neural Network Based Living Comfort Prediction Using Real Estate Floor Plan Images. 34:1-34:5 - Lam Pham, Khoa Tran, Dat Ngo, Hieu Tang, Son Phan, Alexander Schindler:
Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices. 35:1-35:5 - Na Wang, Haoliang Wang, Stefano Petrangeli, Viswanathan Swaminathan, Fei Li, Songqing Chen:
A Reality Check of Positioning in Multiuser Mobile Augmented Reality: Measurement and Analysis. 36:1-36:5 - Ling Li, Lin Zhao, Linhao Xu, Jie Xu:
Towards High Performance One-Stage Human Pose Estimation. 37:1-37:5 - Xi Chen, Yongwei Gao, Wei Li:
Singing Voice Detection via Similarity-Based Semi-Supervised Learning. 38:1-38:5
Demo Papers
- Yuki Iwamoto, Tetsuro Kitahara:
A Music Loop Sequencer with User-Adaptive Music Loop Selection. 39:1-39:3 - Ryo Kawai, Noboru Yoshida, Jianquan Liu:
Action Detection System Based on Pose Information. 40:1-40:3 - Yu-Hsuan Lo, Shih-Wei Sun:
DeepHair: A DeepFake-Based Hairstyle Preview System. 41:1-41:2 - Sahil Goyal, Shagun Uppal, Sarthak Bhagat, Dhroov Goel, Sakshat Mali, Yi Yu, Yifang Yin, Rajiv Ratn Shah:
Emotional Talking Faces: Making Videos More Expressive and Realistic. 42:1-42:3 - Kei Nakamoto, Kohei Kumazawa, Hiroaki Karasawa, Sosuke Amano, Yoko Yamakata, Kiyoharu Aizawa:
FoodLog Athl: Multimedia Food Recording Platform for Dietary Guidance and Food Monitoring. 43:1-43:2 - Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Rubber Material Retrieval System using Electron Microscope Images for Rubber Material Development. 44:1-44:3 - Tetsuro Kitahara, Akio Yonamine:
JamSketch Deep α: A CNN-Based Improvisation System in Accordance with User's Melodic Outline Drawing. 45:1-45:3 - Advaiit Rajjvaed, Saurabh Puri, Gurdeep Bhullar, Gaëlle Martin-Cocher:
GSTH266enc: A GStreamer Plugin for VVC Encoder. 46:1-46:3 - Chuanxu Jiang, Yanfang Wang, Qian Huang, Yiming Wang, Yuhan Dai:
Intelligent Video Surveillance Platform Based on FFmpeg and Yolov5. 47:1-47:3
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.