Abstract is missing.
- Session details: Deep-1 (Image Translation)Nicu Sebe. [doi]
- Session details: Keynote 1Susanne Boll. [doi]
- Session details: Keynote 2Tao Mei. [doi]
- Session details: Demo + Video + Makers' ProgramKwanghoon Sohn, Yong Man Ro. [doi]
- Session details: Panel-2Jiaying Liu 0001, Wen-Huang Cheng. [doi]
- Session details: FF-5Zhu Li. [doi]
- Session details: Grand Challenge-2Shuqiang Jiang. [doi]
- Session details: Grand Challenge-1Shuqiang Jiang. [doi]
- Session details: Best Paper SessionRainer Lienhart, Tao Mei. [doi]
- Session details: Multimedia-2 (Socical & Emotional Multimedia)Rongrong Ji. [doi]
- Session details: Brand New IdeasKiyoharu Aizawa. [doi]
- Session details: FF-1Max Mühlhäuser. [doi]
- Session details: Multimedia -3 (Multimedia Search)Jitao Sang. [doi]
- Session details: Vision-3 (Applications in Multimedia)Zheng-Jun Zha. [doi]
- Session details: FF-2Peng Cui. [doi]
- Session details: Keynote 5Wenwu Zhu 0001. [doi]
- Session details: Doctoral SymposiumMeng Wang. [doi]
- Session details: Keynote 4Kyoung Mu Lee. [doi]
- Session details: Multimodal-2 (Cross-Modal Translation)Xian-Sheng Hua. [doi]
- Session details: Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring)Shuqiang Jiang. [doi]
- Session details: Vision-4 (Representation Learning)Marcel Worring. [doi]
- Session details: Vision-1 (Machine Learning)Jingkuan Song. [doi]
- Session details: System-1 (Video Analysis & Streaming)Xin Yang. [doi]
- Session details: Experience-1 (Multimedia Entertainment and Experience)Zhisheng Yan. [doi]
- Session details: Multimedia-1 (Multimedia Recommendation & Discovery)Mark Liao. [doi]
- Session details: Keynote 3Jiebo Luo. [doi]
- Session details: Deep-2 (Recognition)Qin Jin. [doi]
- Session details: Interactive ArtHyunjung Shim. [doi]
- Session details: Multimodal-1 (Multimodal Reasoning)Xian-Sheng Hua. [doi]
- Session details: FF-3Zhu Li. [doi]
- Session details: System-2 (Smart Multimedia Systems)Yijuan Lu. [doi]
- Session details: FF-4Wen-Huang Cheng. [doi]
- Session details: Open Source Software CompetitionMin-Chun Hu. [doi]
- Session details: FF-6Benoit Huet. [doi]
- Session details: Panel-1Jun Jitao, Yu Sang. [doi]
- Session details: Vision-2 (Object & Scene Understanding)Zheng-Jun Zha. [doi]
- SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal RetrievalChuan-Xiang Li, Zhen-Duo Chen, Peng-fei Zhang, Xin Luo, Liqiang Nie, Wei Zhang, Xin-Shun Xu. 1-9 [doi]
- Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streamsAna Garcia del Molino, Joo-Hwee Lim, Ah-Hwee Tan. 10-17 [doi]
- Video-to-Video Translation with Global Temporal ConsistencyXingxing Wei, Jun Zhu, Sitong Feng, Hang Su. 18-25 [doi]
- Shared Linear Encoder-based Gaussian Process Latent Variable Model for Visual ClassificationJinxing Li, Bob Zhang, Guangming Lu, David Zhang. 26-34 [doi]
- Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action DetectorJia-Xing Zhong, Nannan Li, Weijie Kong, Tao Zhang, Thomas H. Li, Ge Li. 35-44 [doi]
- Multi-Human Parsing MachinesJianshu Li, Jian Zhao, Yunpeng Chen, Sujoy Roy, Shuicheng Yan, Jiashi Feng, Terence Sim. 45-53 [doi]
- Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question AnsweringXuanyi Dong, Linchao Zhu, De Zhang, Yi Yang, Fei Wu. 54-62 [doi]
- Hierarchical Memory Modelling for Video CaptioningJunbo Wang, Wei Wang 0115, Yan Huang 0008, Liang Wang 0001, Tieniu Tan. 63-71 [doi]
- Incremental Deep Hidden Attribute LearningZheng Wang, Xiang Bai, Mang Ye, Shin'ichi Satoh. 72-80 [doi]
- CropNet: Real-Time ThumbnailingHuarong Chen, Bin Wang, Tianxiang Pan, Liwang Zhou, Hua Zeng. 81-89 [doi]
- Learning to Transfer: Generalizable Attribute Learning with Multitask Neural Model SearchZhi-Qi Cheng, Xiao Wu 0001, Siyu Huang, Jun-Xiu Li, Alexander G. Hauptmann, Qiang Peng. 90-98 [doi]
- Attention-based Pyramid Aggregation Network for Visual Place RecognitionYingying Zhu, Jiong Wang, Lingxi Xie, Liang Zheng 0001. 99-107 [doi]
- Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional DataChangde Du, Changying Du, Hao Wang, Jinpeng Li, Wei-Long Zheng, Bao-Liang Lu, Huiguang He. 108-116 [doi]
- Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTMYuxiao Chen, Jianbo Yuan, Quanzeng You, Jiebo Luo. 117-125 [doi]
- Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer ApproachFeifei Zhang, Tianzhu Zhang, Qirong Mao, Lingyu Duan, Changsheng Xu. 126-135 [doi]
- Inferring User Emotive State Changes in Realistic Human-Computer Conversational DialogsRunnan Li, Zhiyong Wu, Jia Jia 0001, Jingbei Li, Wei Chen, Helen Meng. 136-144 [doi]
- Self-boosted Gesture Interactive System with ST-NetZhengzhe Liu, Xiaojuan Qi, Lei Pang. 145-153 [doi]
- Slackliner - An Interactive Slackline Training AssistantFelix Kosmalla, Christian Murlowski, Florian Daiber, Antonio Krüger. 154-162 [doi]
- A Unified Generative Adversarial Framework for Image Generation and Person Re-identificationYaoyu Li, Tianzhu Zhang, Lingyu Duan, Changsheng Xu. 163-172 [doi]
- FoV-Aware Edge Caching for Adaptive 360° Video StreamingAnahita Mahzari, Afshin Taghavi Nasrabadi, Aliehsan Samiei, Ravi Prakash. 173-181 [doi]
- Don't just Look - Smell, Taste, and Feel the InteractionMarianna Obrist. 182 [doi]
- Style Separation and Synthesis via Generative Adversarial NetworksRui Zhang, Sheng Tang, Yu Li, Junbo Guo, Yongdong Zhang, Jintao Li, Shuicheng Yan. 183-191 [doi]
- Group Re-Identification: Leveraging and Integrating Multi-Grain InformationHao Xiao, Weiyao Lin, Bin Sheng, Ke Lu, Junchi Yan, Jingdong Wang, Errui Ding, Yihao Zhang, Hongkai Xiong. 192-200 [doi]
- OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance SceneXu Gao, Tingting Jiang. 201-210 [doi]
- Video Forecasting with Forward-Backward-Net: Delving Deeper into Spatiotemporal ConsistencyYuke Li. 211-219 [doi]
- Feature Constrained by Pixel: Hierarchical Adversarial Deep Domain AdaptationRui Shao, Xiangyuan Lan, Pong C. Yuen. 220-228 [doi]
- Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose VariationsZhixing Chen, Di Huang 0001, Yunhong Wang, Liming Chen 0002. 229-238 [doi]
- Explore Multi-Step Reasoning in Video Question AnsweringXiaomeng Song, Yucheng Shi, Xin Chen, Yahong Han. 239-247 [doi]
- Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence ModelingShancheng Fang, Hongtao Xie, Zheng-Jun Zha, Nannan Sun, Jianlong Tan, Yongdong Zhang. 248-256 [doi]
- Temporal Sequence Distillation: Towards Few-Frame Action Recognition in VideosZhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei Zhang. 257-264 [doi]
- Previewer for Multi-Scale Object DetectorZhihang Fu, Zhongming Jin, Guo-Jun Qi, Chen Shen, Rongxin Jiang, Yaowu Chen, Xian-Sheng Hua. 265-273 [doi]
- Learning Discriminative Features with Multiple Granularities for Person Re-IdentificationGuanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou. 274-282 [doi]
- StripNet: Towards Topology Consistent Strip Structure SegmentationGuoxiang Qu, Wenwei Zhang, Zhe Wang, Xing Dai, Jianping Shi, Junjun He, Fei Li, Xiulan Zhang, Yu Qiao. 283-291 [doi]
- Emotion Recognition in Speech using Cross-Modal Transfer in the WildSamuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman. 292-301 [doi]
- Personalized Multiple Facial Action Unit Recognition through Generative Adversarial Recognition NetworkCan Wang, Shangfei Wang. 302-310 [doi]
- Investigation of Small Group Social Interactions Using Deep Visual Activity-Based Nonverbal FeaturesCigdem Beyan, Muhammad Shahid, Vittorio Murino. 311-319 [doi]
- Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal FightEugene Yujun Fu, Michael Xuelin Huang, Hong Va Leong, Grace Ngai. 320-327 [doi]
- Personalized Serious Games for Cognitive Intervention with Lifelog Visual AnalyticsQianli Xu, Vigneshwaran Subbaraju, Chee How Cheong, Aijing Wang, Kathleen Kang, Munirah Bashir, Yanhong Dong, Liyuan Li, Joo-Hwee Lim. 328-336 [doi]
- Drawing in a Virtual 3D Space - Introducing VR Drawing in Elementary School Art EducationWendy Bolier, Wolfgang Hürst, Guido van Bommel, Joost Bosman, Harriët Bosman. 337-345 [doi]
- CIRCE: Real-Time Caching for Instance Recognition on Cloud Environments and Multi-Core ArchitecturesLuca Lovagnini, Wenxiao Zhang, Farshid Hassani Bijarbooneh, Pan Hui. 346-354 [doi]
- Jaguar: Low Latency Mobile Augmented Reality with Flexible TrackingWenxiao Zhang, Bo Han, Pan Hui. 355-363 [doi]
- Challenges and Practices of Large Scale Visual Intelligence in the Real-WorldXian-Sheng Hua. 364 [doi]
- Structure Guided Photorealistic Style TransferYuheng Zhi, Huawei Wei, Bingbing Ni. 365-373 [doi]
- Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image TranslationXuewen Yang, Dongliang Xie, Xin Wang. 374-382 [doi]
- Multi-View Image Generation from a Single-ViewBo Zhao, Xiao Wu 0001, Zhi-Qi Cheng, Hao Liu 0003, Zequn Jie, Jiashi Feng. 383-391 [doi]
- Sparsely Grouped Multi-Task Generative Adversarial Networks for Facial Attribute ManipulationJichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Meng Liu, Xueying Qin. 392-401 [doi]
- Visual Domain Adaptation with Manifold Embedded Distribution AlignmentJindong Wang, Wenjie Feng, Yiqiang Chen, Han Yu, Meiyu Huang, Philip S. Yu. 402-410 [doi]
- Causally Regularized Learning with Agnostic Data Selection BiasZheyan Shen, Peng Cui 0001, Kun Kuang, Bo Li, Peixuan Chen. 411-419 [doi]
- Robust Correlation Filter Tracking with Shepherded Instance-Aware ProposalsYanjie Liang, Qiangqiang Wu, Yi Liu, Yan Yan 0001, Hanzi Wang. 420-428 [doi]
- A Unified Framework for Multimodal Domain AdaptationFan Qi, Xiaoshan Yang, Changsheng Xu. 429-437 [doi]
- What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body ShapeShintami Chusnul Hidayati, Cheng-chun Hsu, Yu-ting Chang, Kai-Lung Hua, Jianlong Fu, Wen-Huang Cheng. 438-446 [doi]
- CSAN: Contextual Self-Attention Network for User Sequential RecommendationXiaowen Huang, Shengsheng Qian, Quan Fang, Jitao Sang, Changsheng Xu. 447-455 [doi]
- Attentive Interactive Convolutional Matching for Community Question Answering in Social MultimediaJun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu. 456-464 [doi]
- Beyond the Product: Discovering Image Posts for Brands in Social MediaFrancesco Gelli, Tiberio Uricchio, Xiangnan He 0001, Alberto Del Bimbo, Tat-Seng Chua. 465-473 [doi]
- Collaborative Annotation of Semantic Objects in Images with Multi-granularity SupervisionsLishi Zhang, Chenghan Fu, Jia Li. 474-482 [doi]
- GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic SegmentationMengyang Pu, Yaping Huang, Qingji Guan, Qi Zou. 483-491 [doi]
- Boosting Scene Parsing Performance via Reliable Scale PredictionHengcan Shi, Hongliang Li, Qingbo Wu, Fanman Meng, King N. Ngan. 492-500 [doi]
- Learning to Synthesize 3D Indoor Scenes from Monocular ImagesFan Zhu 0001, Li Liu, Jin Xie, Fumin Shen, Ling Shao 0001, Yi Fang. 501-509 [doi]
- Visual Spatial Attention Network for Relationship DetectionChaojun Han, Fumin Shen, Li Liu, Yang Yang, Heng Tao Shen. 510-518 [doi]
- Object-Difference Attention: A Simple Relational Attention for Visual Question AnsweringChenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong. 519-527 [doi]
- Life-long Cross-media Correlation LearningJinwei Qi, Yuxin Peng, Yunkan Zhuo. 528-536 [doi]
- Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-DecoderYue Gu, Xinyu Li, Kaixiang Huang, Shiyu Fu, Kangning Yang, Shuhong Chen, Moliang Zhou, Ivan Marsic. 537-545 [doi]
- End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural NetworksWenTao Liu, Zhengfang Duanmu, Zhou Wang. 546-554 [doi]
- FlexStream: Towards Flexible Adaptive Video Streaming on End Devices using Extreme SDNIbrahim Ben Mustafa, Tamer Nadeem, Emir Halepovic. 555-563 [doi]
- CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive StreamingLan Xie, Xinggong Zhang, Zongming Guo. 564-572 [doi]
- A Distributed Approach for Bitrate Selection in HTTP Adaptive StreamingAbdelhak Bentaleb, Ali C. Begen, Saad Harous, Roger Zimmermann. 573-581 [doi]
- High-Quality Exposure Correction of Underexposed PhotosQing Zhang, Ganzhao Yuan, Chunxia Xiao, Lei Zhu, Wei-Shi Zheng. 582-590 [doi]
- A Margin-based MLE for Crowdsourced Partial RankingQianqian Xu, Jiechao Xiong, Xinwei Sun 0001, Zhiyong Yang, Xiaochun Cao, Qingming Huang, Yuan Yao. 591-599 [doi]
- PHD-GIFs: Personalized Highlight Detection for Automatic GIF CreationAna Garcia del Molino, Michael Gygli. 600-608 [doi]
- Cross-Domain Adversarial Feature Learning for Sketch Re-identificationLu Pang, Yaowei Wang, Yi-Zhe Song, Tiejun Huang, Yonghong Tian 0001. 609-617 [doi]
- Semantic Human MattingQuan Chen, Tiezheng Ge, Yanyu Xu, Zhiqiang Zhang, Xinxin Yang, Kun Gai. 618-626 [doi]
- Geometry Guided Adversarial Facial Expression SynthesisLingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan. 627-635 [doi]
- Detecting Abnormality without Knowing Normality: A Two-stage Approach for Unsupervised Video Abnormal Event DetectionSiqi Wang, Yijie Zeng, Qiang Liu, Chengzhang Zhu, En Zhu, Jianping Yin. 636-644 [doi]
- BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial NetworkTingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu 0001, Liang Lin. 645-653 [doi]
- Trusted Guidance Pyramid Network for Human ParsingXianghui Luo, Zhuo Su, Jiaming Guo, Gengwei Zhang, Xiangjian He. 654-662 [doi]
- I read, I saw, I tell: Texts Assisted Fine-Grained Visual ClassificationJingjing Li, Lei Zhu, Zi Huang, Ke Lu, Jidong Zhao. 663-671 [doi]
- Look Deeper See Richer: Depth-aware Image Paragraph CaptioningZiwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin. 672-680 [doi]
- Learning Multimodal Taxonomy via Variational Deep Graph Embedding and ClusteringHuaiwen Zhang, Quan Fang, Shengsheng Qian, Changsheng Xu. 681-689 [doi]
- Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution ModelingJunyu Gao, Tianzhu Zhang, Changsheng Xu. 690-699 [doi]
- Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised DetectionYongcheng Liu, Lu Sheng, Jing Shao, Junjie Yan, Shiming Xiang, Chunhong Pan. 700-708 [doi]
- Unregularized Auto-Encoder with Generative Adversarial Networks for Image GenerationJiayu Wang, Wengang Zhou, Jinhui Tang, Zhongqian Fu, Qi Tian 0001, Houqiang Li. 709-717 [doi]
- When to Learn What: Deep Cognitive Subspace ClusteringYangbangyan Jiang, Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang. 718-726 [doi]
- Depth Structure Preserving Scene Image GenerationWendong Zhang, Feng Gao, Bingbing Ni, Lingyu Duan, Yichao Yan, Jingwei Xu, Xiaokang Yang. 727-736 [doi]
- 3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-IdentificationJiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong, Yongdong Zhang. 737-745 [doi]
- RGCNN: Regularized Graph CNN for Point Cloud SegmentationGusi Te, Wei Hu, Amin Zheng, Zongming Guo. 746-754 [doi]
- Deep Triplet QuantizationBin Liu, Yue Cao, Mingsheng Long, Jianmin Wang 0001, Jingdong Wang. 755-763 [doi]
- What has Art Got to do With It?Ernest A. Edmonds. 773 [doi]
- GestureGAN for Hand Gesture-to-Gesture Translation in the WildHao Tang 0005, Wei Wang 0018, Dan Xu 0002, Yan Yan 0002, Nicu Sebe. 774-782 [doi]
- Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial TrainingBei Liu, Jianlong Fu, Makoto P. Kato, Masatoshi Yoshikawa. 783-791 [doi]
- Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human ParsingJian Zhao, Jianshu Li, Yu Cheng, Terence Sim, Shuicheng Yan, Jiashi Feng. 792-800 [doi]
- Knowledge-aware Multimodal Dialogue SystemsLizi Liao, Yunshan Ma, Xiangnan He 0001, Richang Hong, Tat-Seng Chua. 801-809 [doi]
- End2End Semantic Segmentation for 3D Indoor ScenesNa Zhao. 810-814 [doi]
- On Reducing Effort in Evaluating Laparoscopic SkillsSabrina Kletz. 815-819 [doi]
- Decode Human Life from Social MediaTianran Hu. 820-824 [doi]
- Learning Semantic Structure-preserved Embeddings for Cross-modal RetrievalYiling Wu, Shuhui Wang, Qingming Huang. 825-833 [doi]
- Post Tuned Hashing: A New Approach to Indexing High-dimensional DataZhendong Mao, Quan Wang, Yongdong Zhang, Bin Wang. 834-842 [doi]
- Cross-modal Moment Localization in VideosMeng Liu, Xiang Wang, Liqiang Nie, Qi Tian 0001, Baoquan Chen, Tat-Seng Chua. 843-851 [doi]
- Multi-Scale Correlation for Sequential Cross-modal Hashing LearningZhaoda Ye, Yuxin Peng. 852-860 [doi]
- Generative Adversarial Product QuantisationLitao Yu, Yongsheng Gao, Jun Zhou 0001. 861-869 [doi]
- Aesthetic-Driven Image Enhancement by Adversarial LearningYubin Deng, Chen Change Loy, Xiaoou Tang. 870-878 [doi]
- Attention-based Multi-Patch Aggregation for Image Aesthetic AssessmentKekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu. 879-886 [doi]
- An End-to-End Quadrilateral Regression Network for Comic Panel ExtractionZheqi He, Yafeng Zhou, Yongtao Wang, Siwei Wang, Xiaoqing Lu, Zhi Tang, Ling Cai. 887-895 [doi]
- Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial NetworkXin Yang, Jinyu Chen, Zhiwei Wang, Qiaozhe Zhang, Wenyu Liu, Chunyuan Liao, Kwang-Ting Cheng. 896-904 [doi]
- JPEG Decompression in the Homomorphic Encryption DomainXiaojing Ma 0002, Changming Liu, Sixing Cao, Bin Zhu. 905-913 [doi]
- MiniView Layout for Bandwidth-Efficient 360-Degree VideoMengbai Xiao, Shuoqian Wang, Chao Zhou, Li Liu, Zhenhua Li 0001, Yao Liu, Songqing Chen. 914-922 [doi]
- Real-time 3D Face-Eye Performance Capture of a Person Wearing VR HeadsetGuoxian Song, Jianfei Cai, Tat-Jen Cham, Jianmin Zheng, Juyong Zhang, Henry Fuchs. 923-931 [doi]
- Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Dataset and a Deep Learning ModelChen Li, Mai Xu, Xinzhe Du, Zulin Wang. 932-940 [doi]
- Tracking-assisted Weakly Supervised Online Visual Object Segmentation in Unconstrained VideosZongpu Zhang, Yang Hua, Tao Song, Zhengui Xue, Ruhui Ma, Neil Martin Robertson, Haibing Guan. 941-949 [doi]
- ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial NetworkPraveen Tirupattur, Yogesh Singh Rawat, Concetto Spampinato, Mubarak Shah. 950-958 [doi]
- A Feature-Adaptive Semi-Supervised Framework for Co-saliency DetectionXiaoju Zheng, Zheng-Jun Zha, Liansheng Zhuang. 959-966 [doi]
- iSPA-Net: Iterative Semantic Pose Alignment NetworkJogendra Nath Kundu, Aditya Ganeshan, Rahul M. V., Aditya Prakash, Venkatesh Babu R.. 967-975 [doi]
- Extractive Video Summarizer with Memory Augmented Neural NetworksLitong Feng, Ziyin Li, Zhanghui Kuang, Wei Zhang. 976-983 [doi]
- Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural ImagesJing Zhang, Yang Cao, Yang Wang, Chenglin Wen, Chang Wen Chen. 984-992 [doi]
- Online Action Tube Detection via Resolving the Spatio-temporal Context PatternJingjia Huang, Nannan Li, Jia-Xing Zhong, Thomas H. Li, Ge Li. 993-1001 [doi]
- Enhancing Visual Question Answering Using DropoutZhiwei Fang, Jing Liu, Yanyuan Qiao, Qu Tang, Yong Li, Hanqing Lu. 1002-1010 [doi]
- Face-Voice Matching using Cross-modal EmbeddingsShota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu. 1011-1019 [doi]
- Deep Understanding of Cooking Procedure for Cross-modal Recipe RetrievalJingjing Chen, Chong-Wah Ngo, Fuli Feng, Tat-Seng Chua. 1020-1028 [doi]
- Decoupled Novel Object CaptionerYu Wu, Linchao Zhu, Lu Jiang, Yi Yang. 1029-1037 [doi]
- Temporal Cross-Media Retrieval with Soft-SmoothingDavid Semedo, João Magalhães. 1038-1046 [doi]
- Photo Squarization by Deep Multi-Operator RetargetingYu Song, Fan Tang, Weiming Dong, Xiaopeng Zhang 0001, Oliver Deussen, Tong-Yee Lee. 1047-1055 [doi]
- Non-locally Enhanced Encoder-Decoder Network for Single Image De-rainingGuanbin Li, Xiang He, Wei Zhang, HuiYou Chang, Le Dong, Liang Lin. 1056-1064 [doi]
- An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural NetworksPu Zhao, Sijia Liu 0001, Yanzhi Wang, Xue Lin. 1065-1073 [doi]
- Local Convolutional Neural Networks for Person Re-IdentificationJiwei Yang, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, Xian-Sheng Hua. 1074-1082 [doi]
- Conditional Expression Synthesis with Face Parsing TransformationZhihe Lu, Tanhao Hu, Lingxiao Song, Zhaoxiang Zhang, Ran He. 1083-1091 [doi]
- Attentive Recurrent Neural Network for Weak-supervised Multi-label Image ClassificationLiang Li 0003, Shuhui Wang, Shuqiang Jiang, Qingming Huang. 1092-1100 [doi]
- Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet)Jatin Garg, Skand Vishwanath Peri, Himanshu Tolani, Narayanan C. Krishnan. 1101-1109 [doi]
- Few-Shot Adaptation for Multimedia Semantic IndexingNakamasa Inoue, Koichi Shinoda. 1110-1118 [doi]
- Fashion Sensitive Clothing Recommendation Using Hierarchical Collocation ModelZhengzhong Zhou, Xiu Di, Wei Zhou, Liqing Zhang. 1119-1127 [doi]
- Multi-Scale Context Attention Network for Image RetrievalYihang Lou, Yan Bai, Shiqi Wang, Ling-Yu Duan. 1128-1136 [doi]
- Comprehensive Distance-Preserving Autoencoders for Cross-Modal RetrievalYibing Zhan, Jun Yu 0002, Zhou Yu, Rong Zhang, Dacheng Tao, Qi Tian 0001. 1137-1145 [doi]
- Temporal Hierarchical Attention at Category- and Item-Level for Micro-Video Click-Through PredictionXusong Chen, Dong Liu, Zheng-Jun Zha, Wengang Zhou, Zhiwei Xiong, Yan Li. 1146-1153 [doi]
- Historical Context-based Style Classification of Painting Images via Label Distribution LearningJufeng Yang, Liyi Chen, Le Zhang, Xiaoxiao Sun, Dongyu She, Shao-Ping Lu, Ming-Ming Cheng. 1154-1162 [doi]
- Direction-aware Neural Style TransferHao Wu, Zhengxing Sun, Weihang Yuan. 1163-1171 [doi]
- ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style TransferBin He, Feng Gao, Daiqian Ma, Boxin Shi, Ling-Yu Duan. 1172-1180 [doi]
- CloudVR: Cloud Accelerated Interactive Mobile Virtual RealityTeemu Kämäräinen, Matti Siekkinen, Jukka Eerikäinen, Antti Ylä-Jääski. 1181-1189 [doi]
- Your Attention is Unique: Detecting 360-Degree Video Saliency in Head-Mounted Display for Head Movement PredictionAnh Nguyen, Zhisheng Yan, Klara Nahrstedt. 1190-1198 [doi]
- Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra PredictionYiting Shao, Qi Zhang, Ge Li, Zhu Li, Li Li. 1199-1207 [doi]
- QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement LearningTianchi Huang, Rui-Xiao Zhang, Chao Zhou, Lifeng Sun. 1208-1216 [doi]
- Optimizing Personalized Interaction Experience in Crowd-Interactive Livecast: A Cloud-Edge ApproachHaitian Pang, Cong Zhang, Fangxin Wang, Han Hu, Zhi Wang, Jiangchuan Liu, Lifeng Sun. 1217-1225 [doi]
- Give Me One Portrait Image, I Will Tell You Your Emotion and PersonalitySongyou Peng, Le Zhang, Stefan Winkler, Marianne Winslett. 1226-1227 [doi]
- Demo: Phase-based Acoustic Localization and Motion Tracking for Mobile InteractionYang Liu, Yang Yang, Weidong Fang, Wuxiong Zhang. 1228-1230 [doi]
- AI Painting: An Aesthetic Painting Generation SystemCunjun Zhang, Kehua Lei, Jia Jia, Yihui Ma, Zhiyuan Hu. 1231-1233 [doi]
- SoMin.ai: Social Multimedia Influencer Discovery MarketplaceAleksandr Farseev, Kirill Lepikhin, Hendrik Schwartz, Eu Khoon Ang, Kenny Powar. 1234-1236 [doi]
- AniDance: Real-Time Dance Motion Synthesize to the SongTaoran Tang, Hanyang Mao, Jia Jia. 1237-1239 [doi]
- ArtSight: An Artistic Data Exploration EngineGjorgji Strezoski, Inske Groenen, Jurriaan Besenbruch, Marcel Worring. 1240-1241 [doi]
- Meet AR-bot: Meeting Anywhere, Anytime with Movable Spatial AR RobotYoonjung Park, Yoonsik Yang, Hyocheol Ro, Junghyun Byun, Seougho Chae, Tack-Don Han. 1242-1243 [doi]
- Magical Rice Bowl: A Real-time Food Category ChangerRyosuke Tanno, Daichi Horita, Wataru Shimoda, Keiji Yanai. 1244-1246 [doi]
- Exploring Temporal Communities in Mass Media ArchivesHaolin Ren, Benjamin Renoust, Guy Melançon, Marie-Luce Viaud, Shin'ichi Satoh. 1247-1249 [doi]
- SoniControl - A Mobile Ultrasonic FirewallMatthias Zeppelzauer, Alexis Ringot, Florian Taurer. 1250-1252 [doi]
- MusicMapp: A Deep Learning Based Solution for Music Exploration and Visual InteractionMohammed Habibullah Baig, Jibin Rajan Varghese, Zhangyang Wang. 1253-1255 [doi]
- Demonstration of an Open Source Framework for Qualitative Evaluation of CBIR SystemsPaula Gómez Duran, Eva Mohedano, Kevin McGuinness, Xavier Giró i Nieto, Noel E. O'Connor. 1256-1257 [doi]
- A Demonstration of an Intelligent Storytelling SystemYun-Gyung Cheong, Woo-Hyun Park, Hye-Yeon Yu. 1258-1259 [doi]
- IcooBook: When the Picture Book for Children Encounters Aesthetics of InteractionYaohua Bu, Jia Jia 0002, Xiang Li, Suping Zhou, Xiaobo Lu. 1260-1262 [doi]
- An Implementation of a DASH Client for Browsing Networked Virtual EnvironmentThomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi, Vincent Charvillat, Praveen Kumar Yadav. 1263-1264 [doi]
- Knowledge-aware Multimodal Fashion ChatbotLizi Liao, You Zhou, Yunshan Ma, Richang Hong, Tat-Seng Chua. 1265-1266 [doi]
- SVIAS: Scene-segmented Video Information Annotation SystemAlex Lee, Chang-Uk Kwak, Jeong Woo Son, Sun-Joong Kim. 1267-1269 [doi]
- Interactive Story Maker: Tagged Video Retrieval System for Video Re-creation ServiceChang-Uk Kwak, Minho Han, Sun-Joong Kim, Gyeong June Hahm. 1270-1271 [doi]
- HeterStyle: A Heterogeneous Video Style Transfer ApplicationXingyu Liu, Jingfan Guo, Tongwei Ren, Yahong Han, Lei Huang, Gangshan Wu. 1272-1273 [doi]
- PAMI: Projection Augmented Meeting Interface for Video ConferencingHyocheol Ro, InHwan Kim, Junghyun Byun, Yoonsik Yang, Yoonjung Park, Seungho Chae, Tack-Don Han. 1274-1277 [doi]
- ChildAR-bot: Educational Playing Projection-based AR Robot for ChildrenYoonjung Park, Yoonsik Yang, Hyocheol Ro, Jinwon Cha, Kyuri Kim, Tack-Don Han. 1278-1282 [doi]
- Mining Semantics-Preserving Attention for Group Activity RecognitionYansong Tang, Zian Wang, Peiyang Li, Jiwen Lu, Ming Yang, Jie Zhou. 1283-1291 [doi]
- Participation-Contributed Temporal Dynamic Model for Group Activity RecognitionRui Yan, Jinhui Tang, Xiangbo Shu, Zechao Li, Qi Tian 0001. 1292-1300 [doi]
- WildFish: A Large Benchmark for Fish Recognition in the WildPeiqin Zhuang, Yali Wang, Yu Qiao. 1301-1309 [doi]
- PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape RecognitionHaoxuan You, Yifan Feng, Rongrong Ji, Yue Gao. 1310-1318 [doi]
- EmotionGAN: Unsupervised Domain Adaptation for Learning Discrete Probability Distributions of Image EmotionsSicheng Zhao, Xin Zhao, Guiguang Ding, Kurt Keutzer. 1319-1327 [doi]
- USAR: An Interactive User-specific Aesthetic Ranking Framework for ImagesPei Lv, Meng Wang, Yongbo Xu, Ze Peng, Junyi Sun, Shi-Mei Su, Bing Zhou, Mingliang Xu. 1328-1336 [doi]
- Deep Multimodal Image-Repurposing DetectionEkraam Sabir, Wael AbdAlmageed, Yue Wu 0001, Prem Natarajan. 1337-1345 [doi]
- Facial Expression Recognition Enhanced by Thermal Images through Adversarial LearningBowen Pan, Shangfei Wang. 1346-1353 [doi]
- Deep Learning for Multimedia: Science or Technology?Jitao Sang, Jun Yu, Ramesh Jain, Rainer Lienhart, Peng Cui, Jiashi Feng. 1354-1355 [doi]
- VIVID: Virtual Environment for Visual Deep LearningKuan-Ting Lai, Chia-Chih Lin, Chun-Yao Kang, Mei-Enn Liao, Ming-Syan Chen. 1356-1359 [doi]
- A General-purpose Distributed Programming System using Data-parallel StreamsTsung-Wei Huang, Chun-Xun Lin, Guannan Guo, Martin D. F. Wong. 1360-1363 [doi]
- cilantro: A Lean, Versatile, and Efficient Library for Point Cloud Data ProcessingKonstantinos Zampogiannis, Cornelia Fermüller, Yiannis Aloimonos. 1364-1367 [doi]
- Web-Based Configurable Image AnnotationsMatthieu Pizenberg, Axel Carlier, Emmanuel Faure, Vincent Charvillat. 1368-1371 [doi]
- Only Learn One Sample: Fine-Grained Visual Categorization with One Sample TrainingXiangteng He, Yuxin Peng. 1372-1380 [doi]
- LA-Net: Layout-Aware Dense Network for Monocular Depth EstimationKecheng Zheng, Zheng-Jun Zha, Yang Cao, Xuejin Chen, Feng Wu. 1381-1388 [doi]
- Robustness and Discrimination Oriented Hashing Combining Texture and Invariant Vector DistanceZiqing Huang, Shiguang Liu. 1389-1397 [doi]
- Joint Global and Co-Attentive Representation Learning for Image-Sentence RetrievalShuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian 0001. 1398-1406 [doi]
- Text-to-image Synthesis via Symmetrical Distillation NetworksMingkuan Yuan, Yuxin Peng. 1407-1415 [doi]
- Context-Aware Visual Policy Network for Sequence-Level Image CaptioningDaqing Liu, Zheng-Jun Zha, Hanwang Zhang, Yongdong Zhang, Feng Wu. 1416-1424 [doi]
- SibNet: Sibling Convolutional Encoder for Video CaptioningSheng Liu, Zhou Ren, Junsong Yuan. 1425-1434 [doi]
- Paragraph Generation Network with Visual Relationship DetectionWenbin Che, Xiaopeng Fan, Ruiqin Xiong, Debin Zhao. 1435-1443 [doi]
- AI + Multimedia Make Better Life?Wen-Huang Cheng, Jiaying Liu 0001, Mohan S. Kankanhalli, Abdulmotaleb El-Saddik, Benoit Huet. 1455-1456 [doi]
- Online Inter-Camera Trajectory Association Exploiting Person Re-Identification and Camera TopologyNa Jiang, Sichen Bai, Yue Xu, Chang Xing, Zhong Zhou, Wei Wu. 1457-1465 [doi]
- Learning Local Descriptors with Adversarial Enhancer from Volumetric Geometry PatchesJing Zhu, Yi Fang. 1466-1474 [doi]
- Context-Dependent Diffusion Network for Visual Relationship DetectionZhen Cui, Chunyan Xu, Wenming Zheng, Jian Yang. 1475-1482 [doi]
- Connectionist Temporal Fusion for Sign Language TranslationShuo Wang, Dan Guo, Wen-gang Zhou, Zheng-Jun Zha, Meng Wang. 1483-1491 [doi]
- Support Neighbor Loss for Person Re-IdentificationKai Li 0012, Zhengming Ding, Kunpeng Li, Yulun Zhang, Yun Fu 0001. 1492-1500 [doi]
- Perceptual Temporal Incoherence Aware Stereo Video RetargetingBing Li, Chia-Wen Lin, Shan Liu, Tiejun Huang, Wen Gao 0001, C. C. Jay Kuo. 1501-1509 [doi]
- A Large-scale RGB-D Database for Arbitrary-view Human Action RecognitionYanli Ji, Feixiang Xu, Yang Yang, Fumin Shen, Heng Tao Shen, Wei-Shi Zheng. 1510-1518 [doi]
- Spotting and Aggregating Salient Regions for Video CaptioningHuiyun Wang, Youjiang Xu, Yahong Han. 1519-1526 [doi]
- Adaptive Temporal Encoding Network for Video Instance-level Human ParsingQixian Zhou, Xiaodan Liang, Ke Gong, Liang Lin. 1527-1535 [doi]
- User-Guided Deep Anime Line Art Colorization with Conditional Adversarial NetworksYuanzheng Ci, Xinzhu Ma, Zhihui Wang, Haojie Li, Zhongxuan Luo. 1536-1544 [doi]
- BitStream: Efficient Computing Architecture for Real-Time Low-Power Inference of Binary Neural Networks on CPUsTianli Zhao, Xiangyu He, Jian Cheng 0001, Jing Hu. 1545-1552 [doi]
- Attentive Crowd Flow MachinesLingbo Liu, Ruimao Zhang, Jiefeng Peng, Guanbin Li, Bowen Du, Liang Lin. 1553-1561 [doi]
- Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning FrameworkDeqiang Ouyang, Jie Shao, Yonghui Zhang, Yang Yang, Heng Tao Shen. 1562-1570 [doi]
- Interpretable Multimodal Retrieval for Fashion ProductsLizi Liao, Xiangnan He 0001, Bo Zhao, Chong-Wah Ngo, Tat-Seng Chua. 1571-1579 [doi]
- Generating Defensive Plays in Basketball GamesChieh-Yu Chen, Wenze Lai, Hsin-Ying Hsieh, Wen-Hao Zheng, Yu-Shuen Wang, Jung-Hong Chuang. 1580-1588 [doi]
- Dense Auto-Encoder Hashing for Robust Cross-Modality RetrievalHong Liu 0009, Mingbao Lin, Shengchuan Zhang, Yongjian Wu, Feiyue Huang, Rongrong Ji. 1589-1597 [doi]
- Dance with Melody: An LSTM-autoencoder Approach to Music-oriented Dance SynthesisTaoran Tang, Jia Jia, Hanyang Mao. 1598-1606 [doi]
- Musicality-Novelty Generative Adversarial Nets for Algorithmic CompositionGong Chen, Yan Liu, Sheng-hua Zhong, Xiang Zhang. 1607-1615 [doi]
- Improving QoE of ABR Streaming Sessions through QUIC RetransmissionsDivyashri Bhat, Rajvardhan Somraj Deshmukh, Michael Zink. 1616-1624 [doi]
- From Data to Knowledge: Deep Learning Model Compression, Transmission and CommunicationZiqian Chen, Shiqi Wang, Dapeng Oliver Wu, Tiejun Huang, Ling-Yu Duan. 1625-1633 [doi]
- Living with AI in Connected Devices for valuable ExperienceGary Geunbae Lee. 1634 [doi]
- Supervised Online Hashing via Hadamard Codebook LearningMingbao Lin, Rongrong Ji, Hong Liu, Yongjian Wu. 1635-1643 [doi]
- Cascaded Feature Augmentation with Diffusion for Image RetrievalYuanqiang Fang, Wengang Zhou, Yijuan Lu, Jinhui Tang, Qi Tian 0001, Houqiang Li. 1644-1652 [doi]
- Deep Priority HashingZhangjie Cao, Ziping Sun, Mingsheng Long, Jianmin Wang 0001, Philip S. Yu. 1653-1661 [doi]
- Fast Discrete Cross-modal Hashing With Regressing From Semantic LabelsXingbo Liu, Xiushan Nie, Wenjun Zeng, Chaoran Cui, Lei Zhu, Yilong Yin. 1662-1669 [doi]
- ModaNet: A Large-scale Street Fashion Dataset with Polygon AnnotationsShuai Zheng, Fan Yang, M. Hadi Kiapour, Robinson Piramuthu. 1670-1678 [doi]
- SLIONS: A Karaoke Application to Enhance Foreign Language LearningDania Murad, Riwu Wang, Douglas Turnbull, Ye Wang. 1679-1687 [doi]
- Context-Aware Unsupervised Text StylizationShuai Yang, Jiaying Liu 0001, Wenhan Yang, Zongming Guo. 1688-1696 [doi]
- Songle Sync: A Large-Scale Web-based Platform for Controlling Various Devices in Synchronization with MusicJun Kato 0001, Masa Ogata, Takahiro Inoue, Masataka Goto. 1697-1705 [doi]
- Fine-Grained Grocery Product Recognition by One-Shot LearningWeidong Geng, Feilin Han, Jiangke Lin, Liuyi Zhu, Jieming Bai, Suzhen Wang, Lin He, Qiang Xiao, Zhangjiong Lai. 1706-1714 [doi]
- Reconfigurable Inverted IndexYusuke Matsui, Ryota Hinami, Shin'ichi Satoh. 1715-1723 [doi]
- Robust Billboard-based, Free-viewpoint Video Synthesis Algorithm to Overcome Occlusions under Challenging Outdoor Sport ScenesHiroshi Sankoh, Sei Naito, Keisuke Nonaka, Houari Sabirin, Jun Chen. 1724-1732 [doi]
- iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying CameraWei Cheng, Lan Xu, Lei Han, Yuanfang Guo, Lu Fang. 1733-1741 [doi]
- Examine before You Answer: Multi-task Learning with Adaptive-attentions for Multiple-choice VQALianli Gao, Pengpeng Zeng, Jingkuan Song, Xianglong Liu, Heng Tao Shen. 1742-1750 [doi]
- Residual-Guide Network for Single Image DerainingZhiwen Fan, Huafeng Wu, Xueyang Fu, Yue Huang 0001, Xinghao Ding. 1751-1759 [doi]
- From Volcano to Toyshop: Adaptive Discriminative Region Discovery for Scene RecognitionZhengyu Zhao, Martha Larson. 1760-1768 [doi]
- The Effect of Foveation on High Dynamic Range Video PerceptionJoshua Sowerby, Yang Zhang 0003, Dimitris Agrafiotis. 1769-1776 [doi]
- An Efficient Deep Quantized Compressed Sensing Coding Framework of Natural ImagesWenxue Cui, Feng Jiang, Xinwei Gao, Shengping Zhang, Debin Zhao. 1777-1785 [doi]
- PoB: Toward Reasoning Patterns of Beauty in Image DataDiep Thi Ngoc Nguyen, Hideki Nakayama, Naoaki Okazaki, Tatsuya Sakaeda. 1786-1793 [doi]
- Partial Multi-view Subspace ClusteringNan Xu, Yanqing Guo, Xin Zheng, Qianyu Wang, Xiangyang Luo. 1794-1801 [doi]
- Pseudo Transfer with Marginalized Corrupted Attribute for Zero-shot LearningTeng Long, Xing Xu, Youyou Li, Fumin Shen, Jingkuan Song, Heng Tao Shen. 1802-1810 [doi]
- Semi-Supervised DFF: Decoupling Detection and Feature Flow for Video Object DetectorsGuangxing Han, Xuan Zhang, Chongrong Li. 1811-1819 [doi]
- Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn SketchesLingjing Wang, Cheng Qian, Jifei Wang, Yi Fang. 1820-1828 [doi]
- Deep Adaptive Temporal Pooling for Activity RecognitionSibo Song, Ngai-Man Cheung, Vijay Chandrasekhar, Bappaditya Mandal. 1829-1837 [doi]
- Person Re-identification with Hierarchical Deep Learning Feature and efficient XQDA MetricMingyong Zeng, Chang Tian, Zemin Wu. 1838-1846 [doi]
- Cumulative Nets for Edge DetectionJingkuan Song, Zhilong Zhou, Lianli Gao, Xing Xu, Heng Tao Shen. 1847-1855 [doi]
- Webly Supervised Joint Embedding for Cross-Modal Image-Text RetrievalNiluthpol Chowdhury Mithun, Rameswar Panda, Evangelos E. Papalexakis, Amit K. Roy Chowdhury. 1856-1864 [doi]
- Multi-modal Preference Modeling for Product SearchYangyang Guo, Zhiyong Cheng, Liqiang Nie, Xin-Shun Xu, Mohan S. Kankanhalli. 1865-1873 [doi]
- Learning Joint Multimodal Representation with Adversarial Attention NetworksFeiran Huang, Xiaoming Zhang, Zhoujun Li. 1874-1882 [doi]
- Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed PredictionBinbing Liao, Jingqing Zhang, Ming Cai, Siliang Tang, YiFan Gao, Chao Wu, ShengWen Yang, Wenwu Zhu 0001, Yike Guo, Fei Wu. 1883-1891 [doi]
- Learning and Fusing Multimodal Deep Features for Acoustic Scene CategorizationYifang Yin, Rajiv Ratn Shah, Roger Zimmermann. 1892-1900 [doi]
- Dynamic Sound Field Synthesis for Speech and Music OptimizationZhenyu Tang, Nicolás Morales, Dinesh Manocha. 1901-1909 [doi]
- DASH for 3D Networked Virtual EnvironmentThomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi, Vincent Charvillat, Praveen Kumar Yadav. 1910-1918 [doi]
- Transforming Retailing Experiences with Artificial IntelligenceBowen Zhou. 1919-1920 [doi]
- Learning Collaborative Generation Correction Modules for Blind Image Deblurring and BeyondRisheng Liu, Yi He, Shichao Cheng, Xin Fan, Zhongxuan Luo. 1921-1929 [doi]
- When Deep Fool Meets Deep Prior: Adversarial Attack on Super-Resolution NetworkMinghao Yin, Yongbing Zhang, Xiu Li, Shiqi Wang. 1930-1938 [doi]
- Semantic Image Inpainting with Progressive Generative NetworksHaoran Zhang, Zhenzhen Hu, Changzhi Luo, Wangmeng Zuo, Meng Wang. 1939-1947 [doi]
- Structural inpaintingHuy V. Vo, Ngoc Q. K. Duong, Patrick Pérez. 1948-1956 [doi]
- Fluid Annotation: A Human-Machine Collaboration Interface for Full Image AnnotationMykhaylo Andriluka, Jasper R. R. Uijlings, Vittorio Ferrari. 1957-1966 [doi]
- Images2Poem: Generating Chinese Poetry from Image StreamsLixin Liu, Xiaojun Wan 0001, Zongming Guo. 1967-1975 [doi]
- Harnessing AI for Speech Reconstruction using Multi-view Silent Video FeedYaman Kumar, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah, Roger Zimmermann. 1976-1983 [doi]
- ALERT: Adding a Secure Layer in Decision Support for Advanced Driver Assistance System (ADAS)Kanchan Bahirat, Umang Shah, Alvaro A. Cárdenas, Balakrishnan Prabhakaran. 1984-1992 [doi]
- Cross-Modal Health State EstimationNitish Nag, Vaibhav Pandey, Preston J. Putzel, Hari Bhimaraju, Srikanth Krishnan, Ramesh Jain. 1993-2002 [doi]
- An Effective Text-based Characterization Combined with Numerical Features for Social Media Headline PredictionLiuwu Li, Sihong Huang, Ziliang He, Wenyin Liu. 2003-2007 [doi]
- An Iterative Refinement Approach for Social Media Headline PredictionChih-Chung Hsu, Chia-Yen Lee, Ting-Xuan Liao, Jun-Yi Lee, Tsai-Yne Hou, Ying-Chu Kuo, Jing-Wen Lin, Ching-Yi Hsueh, Zhong-Xuan Zhang, Hsiang-Chin Chien. 2008-2012 [doi]
- Random Forest Exploiting Post-related and User-related Features for Social Media Popularity PredictionFeitao Huang, Junhong Chen, Zehang Lin, Peipei Kang, Zhenguo Yang. 2013-2017 [doi]
- Content-Based Video Relevance Prediction with Second-Order Relevance and Attention ModelingXusong Chen, Rui Zhao, Shengjie Ma, Dong Liu, Zheng-Jun Zha. 2018-2022 [doi]
- Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic EmbeddingTianshui Chen, Wenxi Wu, Yuefang Gao, Le Dong, Xiaonan Luo, Liang Lin. 2023-2031 [doi]
- Dissimilarity Representation Learning for Generalized Zero-Shot RecognitionGang Yang, Jinlu Liu, Jieping Xu, Xirong Li. 2032-2039 [doi]
- Attribute-Aware Attention Model for Fine-grained Representation LearningKai Han, Jianyuan Guo, Chao Zhang, Mingjian Zhu. 2040-2048 [doi]
- GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute LearningSiyu Huang, Xi Li, Zhiqi Cheng, Zhongfei Zhang, Alexander G. Hauptmann. 2049-2057 [doi]
- Feature Re-Learning with Data Augmentation for Content-based Video RecommendationJianfeng Dong, Xirong Li, Chaoxi Xu, Gang Yang, Xun Wang. 2058-2062 [doi]
- Beauty Product Image Retrieval Based on Multi-Feature Fusion and Feature AggregationQi Wang, Jingxiang Lai, Kai Xu 0010, Wenyin Liu, Liang Lei. 2063-2067 [doi]
- Unprecedented Usage of Pre-trained CNNs on Beauty ProductJian Han Lim, Nurul Japar, Chun Chet Ng, Chee Seng Chan. 2068-2072 [doi]
- Regional Maximum Activations of Convolutions with Attention for Cross-domain Beauty and Personal Care Product RetrievalZehang Lin, Zhenguo Yang, Feitao Huang, Junhong Chen. 2073-2077 [doi]
- Shadow Calligraphy of Dance: An Image-Based Interactive Installation for Capturing Flowing Human FiguresLyn Chao-ling Chen, He-Lin Luo. 2078-2080 [doi]
- Cellular Music: An Interactive Game of Life SequencerAnis Haron, Soon Xuan Yong, Wong Chee Onn. 2081-2083 [doi]
- TAGapp Visualization: An Application Based Visual Art InstallationSoon Xuan Yong, Wong Chee Onn, Kong Cheng Tan, Anis Haron. 2084-2086 [doi]
- Similarity-Based Processing of Motion Capture DataJan Sedmidubský, Pavel Zezula. 2087-2089 [doi]
- Structured Deep Learning for Pixel-level UnderstandingYunchao Wei, Xiaodan Liang, Si Liu, Liang Lin. 2090-2092 [doi]
- Social and Political Event Analysis based on Rich MediaJungseock Joo, Zachary C. Steinert-Threlkeld, Jiebo Luo. 2093-2095 [doi]
- To Recognize Families In the Wild: A Machine Vision TutorialJoseph P. Robinson, Ming Shao, Yun Fu. 2096-2097 [doi]
- Deep Learning InterpretationJitao Sang. 2098-2100 [doi]
- Interactive Video Search: Where is the User in the Age of Deep Learning?Klaus Schoeffmann, Werner Bailer, Cathal Gurrin, George Awad, Jakub Lokoc. 2101-2103 [doi]
- Human Behavior Understanding: From Action Recognition to Complex Event DetectionTing Yao, Jingen Liu. 2104-2105 [doi]
- The Importance of Medical MultimediaMichael Riegler, Pål Halvorsen, Bernd Münzer, Klaus Schoeffmann. 2106-2108 [doi]
- AltMM 2018 - 3rd International Workshop on Multimedia Alternate RealitiesTeresa Chambel, Francesca De Simone, Rene Kaiser, Nimesha Ranasinghe, Wendy Van den Broeck. 2109-2110 [doi]
- Summary for AVEC 2018: Bipolar Disorder and Cross-Cultural Affect RecognitionFabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Maja Pantic. 2111-2112 [doi]
- CoVieW'18: The 1st Workshop and Challenge on Comprehensive Video Understanding in the WildKwanghoon Sohn, Ming-Hsuan Yang 0001, Hyeran Byun, Jongwoo Lim, Jison Hsu, Stephen Lin, Euntai Kim, Seungryong Kim. 2113-2115 [doi]
- HealthMedia 2018: Third International Workshop on Multimedia for Personal Health and Health CareJochen Meyer, Susanne Boll, Noel E. O'Connor, Ramesh Jain, Troy L. McDaniel. 2116-2117 [doi]
- MAHCI 2018: The 1st Workshop on Multimedia for Accessible Human Computer InterfaceXueliang Liu, Rui Min, Benoit Huet, Jia Jia. 2118-2119 [doi]
- ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data WorkshopDong-Yan Huang, Sicheng Zhao, Björn W. Schuller, Hongxun Yao, Jianhua Tao, Min Xu, Lei Xie, Qingming Huang, Jie Yang. 2120-2121 [doi]
- AVSU: Workshop on Audio-Visual Scene Understanding for Immersive MultimediaAdrian Hilton, Hong-Goo Kang, Hansung Kim, Kwanghoon Sohn. 2122-2124 [doi]
- st ACM International Workshop on Multimedia Content Analysis in SportsRainer Lienhart, Thomas B. Moeslund, Hideo Saito. 2125-2126 [doi]
- EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked EmotionsXavier Alameda-Pineda, Miriam Redi, Nicu Sebe, Shih-Fu Chang, Jiebo Luo. 2127-2128 [doi]