default search action
Han Hu 0001
Person information
- affiliation: Microsoft Research Asia, Beijing, China
- affiliation (PhD 2014): Tsinghua University, Department of Automation, Tsinghua National Laboratory for Information Science and Technology, Beijing, China
Other persons with the same name
- Han Hu — disambiguation page
- Han Hu 0002 — Zhejiang University, State Key Laboratory of Plant Physiology and Biochemistry, China
- Han Hu 0003 — Beijing Institute of Technology, School of Information and Electronics, China (and 3 more)
- Han Hu 0005 — Southwest Jiaotong University, Chengdu, China
- Han Hu 0006 — Nanjing University of Posts and Telecommunications, Nanjing, China
- Han Hu 0007 — New Jersey Institute of Technology, Newark, NJ, USA
Other persons with a similar name
- Hu Han 0001 — Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
- Hu Han 0002 — Lanzhou Jiaotong University, Lanzhou, China
- Hu Han 0004 — Dalian University of Technology, Dalian, China
- Chih-Han Hu
- Ching-Han Hu
- Han-Wen Hu
- Han-fen Hu
- Ya-Han Hu
- Yi-Han Hu
- Yu-Han Hu
- show all similar names
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j8]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. ACM Comput. Surv. 57(2): 41:1-41:42 (2025) - 2024
- [j7]Yuhui Yuan, Weicong Liang, Henghui Ding, Zhanhao Liang, Chao Zhang, Han Hu:
Expediting Large-Scale Vision Transformer for Dense Prediction Without Fine-Tuning. IEEE Trans. Pattern Anal. Mach. Intell. 46(1): 250-266 (2024) - [c80]Ziwei Liao, Jialiang Zhu, Chunyu Wang, Han Hu, Steven L. Waslander:
Multiple View Geometry Transformers for 3D Human Pose Estimation. CVPR 2024: 708-717 - [c79]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CVPR 2024: 7827-7839 - [c78]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CVPR 2024: 7882-7891 - [c77]Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Houqiang Li, Han Hu, Dong Chen, Baining Guo:
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks. CVPR 2024: 12709-12720 - [c76]Xiaoke Huang, Jianfeng Wang, Yansong Tang, Zheng Zhang, Han Hu, Jiwen Lu, Lijuan Wang, Zicheng Liu:
Segment and Caption Anything. CVPR 2024: 13405-13417 - [c75]Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng:
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws. EMNLP 2024: 3209-3222 - [c74]Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo:
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection. ICLR 2024 - [c73]Tianyu He, Junliang Guo, Runyi Yu, Yuchi Wang, Jialiang Zhu, Kaikai An, Leyi Li, Xu Tan, Chunyu Wang, Han Hu, HsiangTao Wu, Sheng Zhao, Jiang Bian:
GAIA: Zero-shot Talking Avatar Generation. ICLR 2024 - [c72]Zhiwei Hao, Jianyuan Guo, Chengcheng Wang, Yehui Tang, Han Wu, Han Hu, Kai Han, Chang Xu:
Data-efficient Large Vision Models through Sequential Autoregression. ICML 2024 - [c71]Haojun Yu, Di Dai, Ziwei Zhao, Di He, Han Hu, Liwei Wang:
LarvSeg: Exploring Image Classification Data for Large Vocabulary Semantic Segmentation via Category-Wise Attentive Classifier. PRCV (1) 2024: 50-64 - [c70]Jialiang Zhu, Danqing Huang, Chunyu Wang, Mingxi Cheng, Ji Li, Han Hu, Xin Geng, Baining Guo:
Unsupervised Graphic Layout Grouping with Transformers. WACV 2024: 1020-1029 - [i81]Jianyuan Guo, Zhiwei Hao, Chengcheng Wang, Yehui Tang, Han Wu, Han Hu, Kai Han, Chang Xu:
Data-efficient Large Vision Models through Sequential Autoregression. CoRR abs/2402.04841 (2024) - [i80]Chen Li, Weiqi Wang, Jingcheng Hu, Yixuan Wei, Nanning Zheng, Han Hu, Zheng Zhang, Houwen Peng:
Common 7B Language Models Already Possess Strong Math Capabilities. CoRR abs/2403.04706 (2024) - [i79]Bolin Ni, Jingcheng Hu, Yixuan Wei, Houwen Peng, Zheng Zhang, Gaofeng Meng, Han Hu:
Xwin-LM: Strong and Scalable Alignment Practice for LLMs. CoRR abs/2405.20335 (2024) - [i78]Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng:
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws. CoRR abs/2408.08310 (2024) - 2023
- [j6]Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu:
Global Context Networks. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 6881-6895 (2023) - [j5]Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai:
SAN: Side Adapter Network for Open-Vocabulary Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15546-15561 (2023) - [c69]Jindong Gu, Fangyun Wei, Philip H. S. Torr, Han Hu:
Exploring Non-additive Randomness on ViT against Query-Based Black-Box Attacks. BMVC 2023: 406-408 - [c68]Zigang Geng, Chunyu Wang, Yixuan Wei, Ze Liu, Houqiang Li, Han Hu:
Human Pose as Compositional Tokens. CVPR 2023: 660-671 - [c67]Yixuan Wei, Yue Cao, Zheng Zhang, Houwen Peng, Zhuliang Yao, Zhenda Xie, Han Hu, Baining Guo:
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual Recognition. CVPR 2023: 2776-2786 - [c66]Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai:
Side Adapter Network for Open-Vocabulary Semantic Segmentation. CVPR 2023: 2945-2954 - [c65]Sucheng Ren, Fangyun Wei, Zheng Zhang, Han Hu:
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models. CVPR 2023: 3687-3697 - [c64]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu:
On Data Scaling in Masked Image Modeling. CVPR 2023: 10365-10374 - [c63]Xinyu Liu, Houwen Peng, Ningxin Zheng, Yuqing Yang, Han Hu, Yixuan Yuan:
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention. CVPR 2023: 14420-14430 - [c62]Zhenda Xie, Zigang Geng, Jingcheng Hu, Zheng Zhang, Han Hu, Yue Cao:
Revealing the Dark Secrets of Masked Image Modeling. CVPR 2023: 14475-14485 - [c61]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CVPR 2023: 18816-18826 - [c60]Ding Jia, Yuhui Yuan, Haodi He, Xiaopei Wu, Haojun Yu, Weihong Lin, Lei Sun, Chao Zhang, Han Hu:
DETRs with Hybrid Matching. CVPR 2023: 19702-19712 - [c59]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CVPR 2023: 22721-22731 - [c58]Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang, Huiqiang Jiang, Fangyun Wei, Yin Wang, Han Hu, Lili Qiu, Yuqing Yang:
Attentive Mask CLIP. ICCV 2023: 2759-2769 - [c57]Xin Lai, Yuhui Yuan, Ruihang Chu, Yukang Chen, Han Hu, Jiaya Jia:
Mask-Attention-Free Transformer for 3D Instance Segmentation. ICCV 2023: 3670-3680 - [c56]Yixuan Wei, Han Hu, Zhenda Xie, Ze Liu, Zheng Zhang, Yue Cao, Jianmin Bao, Dong Chen, Baining Guo:
Improving CLIP Fine-tuning Performance. ICCV 2023: 5416-5426 - [c55]Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu:
DETR Does Not Need Multi-Scale or Locality Design. ICCV 2023: 6522-6531 - [c54]Tiankai Hang, Shuyang Gu, Chen Li, Jianmin Bao, Dong Chen, Han Hu, Xin Geng, Baining Guo:
Efficient Diffusion Training via Min-SNR Weighting Strategy. ICCV 2023: 7407-7417 - [c53]Jia Ning, Chen Li, Zheng Zhang, Chunyu Wang, Zigang Geng, Qi Dai, Kun He, Han Hu:
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token. ICCV 2023: 19843-19853 - [c52]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. ICCV 2023: 19879-19890 - [c51]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Stephen Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. ICCV 2023: 21913-21923 - [c50]Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu, Stephen Lin, Yoichi Sato, Imari Sato:
ClipCrop: Conditioned Cropping Driven by Vision-Language Model. ICCV (Workshops) 2023: 294-304 - [c49]Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, HoYuen Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong:
Tutel: Adaptive Mixture-of-Experts at Scale. MLSys 2023 - [c48]Zhiwei Hao, Jianyuan Guo, Kai Han, Han Hu, Chang Xu, Yunhe Wang:
Revisit the Power of Vanilla Knowledge Distillation: from Small Scale to Large Scale. NeurIPS 2023 - [c47]Zhiwei Hao, Jianyuan Guo, Kai Han, Yehui Tang, Han Hu, Yunhe Wang, Chang Xu:
One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation. NeurIPS 2023 - [c46]Yifan Pu, Weicong Liang, Yiduo Hao, Yuhui Yuan, Yukang Yang, Chao Zhang, Han Hu, Gao Huang:
Rank-DETR for High Quality Object Detection. NeurIPS 2023 - [c45]Yasheng Sun, Yifan Yang, Houwen Peng, Yifei Shen, Yuqing Yang, Han Hu, Lili Qiu, Hideki Koike:
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation. NeurIPS 2023 - [c44]Yukang Yang, Dongnan Gui, Yuhui Yuan, Weicong Liang, Haisong Ding, Han Hu, Kai Chen:
GlyphControl: Glyph Conditional Control for Visual Text Generation. NeurIPS 2023 - [i77]Sucheng Ren, Fangyun Wei, Zheng Zhang, Han Hu:
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models. CoRR abs/2301.01296 (2023) - [i76]Jia Ning, Chen Li, Zheng Zhang, Zigang Geng, Qi Dai, Kun He, Han Hu:
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token. CoRR abs/2301.02229 (2023) - [i75]Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai:
Side Adapter Network for Open-Vocabulary Semantic Segmentation. CoRR abs/2302.12242 (2023) - [i74]Sucheng Ren, Fangyun Wei, Samuel Albanie, Zheng Zhang, Han Hu:
DeepMIM: Deep Supervision for Masked Image Modeling. CoRR abs/2303.08817 (2023) - [i73]Tiankai Hang, Shuyang Gu, Chen Li, Jianmin Bao, Dong Chen, Han Hu, Xin Geng, Baining Guo:
Efficient Diffusion Training via Min-SNR Weighting Strategy. CoRR abs/2303.09556 (2023) - [i72]Zigang Geng, Chunyu Wang, Yixuan Wei, Ze Liu, Houqiang Li, Han Hu:
Human Pose as Compositional Tokens. CoRR abs/2303.11638 (2023) - [i71]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. CoRR abs/2304.10465 (2023) - [i70]Xinyu Liu, Houwen Peng, Ningxin Zheng, Yuqing Yang, Han Hu, Yixuan Yuan:
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention. CoRR abs/2305.07027 (2023) - [i69]Zhiwei Hao, Jianyuan Guo, Kai Han, Han Hu, Chang Xu, Yunhe Wang:
VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale. CoRR abs/2305.15781 (2023) - [i68]Yukang Yang, Dongnan Gui, Yuhui Yuan, Haisong Ding, Han Hu, Kai Chen:
GlyphControl: Glyph Conditional Control for Visual Text Generation. CoRR abs/2305.18259 (2023) - [i67]Yasheng Sun, Yifan Yang, Houwen Peng, Yifei Shen, Yuqing Yang, Han Hu, Lili Qiu, Hideki Koike:
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation. CoRR abs/2308.00906 (2023) - [i66]Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu:
DETR Doesn't Need Multi-Scale or Locality Design. CoRR abs/2308.01904 (2023) - [i65]Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo:
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection. CoRR abs/2308.04409 (2023) - [i64]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CoRR abs/2308.09710 (2023) - [i63]Xin Lai, Yuhui Yuan, Ruihang Chu, Yukang Chen, Han Hu, Jiaya Jia:
Mask-Attention-Free Transformer for 3D Instance Segmentation. CoRR abs/2309.01692 (2023) - [i62]Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo:
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks. CoRR abs/2309.03895 (2023) - [i61]Jindong Gu, Fangyun Wei, Philip H. S. Torr, Han Hu:
Exploring Non-additive Randomness on ViT against Query-Based Black-Box Attacks. CoRR abs/2309.06438 (2023) - [i60]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. CoRR abs/2309.12314 (2023) - [i59]Yifan Pu, Weicong Liang, Yiduo Hao, Yuhui Yuan, Yukang Yang, Chao Zhang, Han Hu, Gao Huang:
Rank-DETR for High Quality Object Detection. CoRR abs/2310.08854 (2023) - [i58]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. CoRR abs/2310.10647 (2023) - [i57]Houwen Peng, Kan Wu, Yixuan Wei, Guoshuai Zhao, Yuxiang Yang, Ze Liu, Yifan Xiong, Ziyue Yang, Bolin Ni, Jingcheng Hu, Ruihang Li, Miaosen Zhang, Chen Li, Jia Ning, Ruizhe Wang, Zheng Zhang, Shuguang Liu, Joe Chau, Han Hu, Peng Cheng:
FP8-LM: Training FP8 Large Language Models. CoRR abs/2310.18313 (2023) - [i56]Zhiwei Hao, Jianyuan Guo, Kai Han, Yehui Tang, Han Hu, Yunhe Wang, Chang Xu:
One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation. CoRR abs/2310.19444 (2023) - [i55]Ziwei Liao, Jialiang Zhu, Chunyu Wang, Han Hu, Steven L. Waslander:
Multiple View Geometry Transformers for 3D Human Pose Estimation. CoRR abs/2311.10983 (2023) - [i54]Tianyu He, Junliang Guo, Runyi Yu, Yuchi Wang, Jialiang Zhu, Kaikai An, Leyi Li, Xu Tan, Chunyu Wang, Han Hu, HsiangTao Wu, Sheng Zhao, Jiang Bian:
GAIA: Zero-shot Talking Avatar Generation. CoRR abs/2311.15230 (2023) - [i53]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CoRR abs/2311.18830 (2023) - [i52]Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models. CoRR abs/2311.18837 (2023) - [i51]Xiaoke Huang, Jianfeng Wang, Yansong Tang, Zheng Zhang, Han Hu, Jiwen Lu, Lijuan Wang, Zicheng Liu:
Segment and Caption Anything. CoRR abs/2312.00869 (2023) - 2022
- [c43]Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, Han Hu:
Video Swin Transformer. CVPR 2022: 3192-3201 - [c42]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu:
SimMIM: a Simple Framework for Masked Image Modeling. CVPR 2022: 9643-9653 - [c41]Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo:
Swin Transformer V2: Scaling Up Capacity and Resolution. CVPR 2022: 11999-12009 - [c40]Yutong Lin, Chen Li, Yue Cao, Zheng Zhang, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Han Hu:
A Simple Approach and Benchmark for 21, 000-Category Object Detection. ECCV (11) 2022: 1-18 - [c39]Haodi He, Yuhui Yuan, Xiangyu Yue, Han Hu:
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation. ECCV (29) 2022: 682-700 - [c38]Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai:
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model. ECCV (29) 2022: 736-753 - [c37]Haohai Sun, Shangyi Geng, Jialun Zhong, Han Hu, Kun He:
Graph Hawkes Transformer for Extrapolated Reasoning on Temporal Knowledge Graphs. EMNLP 2022: 7481-7493 - [c36]Zhiwei Hao, Jianyuan Guo, Ding Jia, Kai Han, Yehui Tang, Chao Zhang, Han Hu, Yunhe Wang:
Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022 - [c35]Weicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, Weihong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu:
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning. NeurIPS 2022 - [c34]Yutong Lin, Ze Liu, Zheng Zhang, Han Hu, Nanning Zheng, Stephen Lin, Yue Cao:
Could Giant Pre-trained Image Models Extract Universal Representations? NeurIPS 2022 - [i50]Haodi He, Yuhui Yuan, Xiangyu Yue, Han Hu:
MLSeg: Image and Video Segmentation as Multi-Label Classification and Selected-Label Pixel Classification. CoRR abs/2203.04187 (2022) - [i49]Jiequan Cui, Yuhui Yuan, Zhisheng Zhong, Zhuotao Tian, Han Hu, Stephen Lin, Jiaya Jia:
Region Rebalance for Long-Tailed Semantic Segmentation. CoRR abs/2204.01969 (2022) - [i48]Chao Li, Jia Ning, Han Hu, Kun He:
Enhancing the Robustness, Efficiency, and Diversity of Differentiable Architecture Search. CoRR abs/2204.04681 (2022) - [i47]Yixuan Wei, Yue Cao, Zheng Zhang, Zhuliang Yao, Zhenda Xie, Han Hu, Baining Guo:
iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition. CoRR abs/2204.10760 (2022) - [i46]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu-Gang Jiang:
Deeper Insights into ViTs Robustness towards Common Corruptions. CoRR abs/2204.12143 (2022) - [i45]Zhenda Xie, Zigang Geng, Jingcheng Hu, Zheng Zhang, Han Hu, Yue Cao:
Revealing the Dark Secrets of Masked Image Modeling. CoRR abs/2205.13543 (2022) - [i44]Yixuan Wei, Han Hu, Zhenda Xie, Zheng Zhang, Yue Cao, Jianmin Bao, Dong Chen, Baining Guo:
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation. CoRR abs/2205.14141 (2022) - [i43]Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong:
Tutel: Adaptive Mixture-of-Experts at Scale. CoRR abs/2206.03382 (2022) - [i42]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu:
On Data Scaling in Masked Image Modeling. CoRR abs/2206.04664 (2022) - [i41]Ding Jia, Yuhui Yuan, Haodi He, Xiaopei Wu, Haojun Yu, Weihong Lin, Lei Sun, Chao Zhang, Han Hu:
DETRs with Hybrid Matching. CoRR abs/2207.13080 (2022) - [i40]Weicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, Weihong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu:
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning. CoRR abs/2210.01035 (2022) - [i39]Yutong Lin, Ze Liu, Zheng Zhang, Han Hu, Nanning Zheng, Stephen Lin, Yue Cao:
Could Giant Pretrained Image Models Extract Universal Representations? CoRR abs/2211.02043 (2022) - [i38]Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu, Stephen Lin, Yoichi Sato, Imari Sato:
ClipCrop: Conditioned Cropping Driven by Vision-Language Model. CoRR abs/2211.11492 (2022) - [i37]Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu:
Exploring Discrete Diffusion Models for Image Captioning. CoRR abs/2211.11694 (2022) - [i36]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CoRR abs/2211.13222 (2022) - [i35]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CoRR abs/2212.00776 (2022) - [i34]Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang, Huiqiang Jiang, Fangyun Wei, Yin Wang, Han Hu, Lili Qiu, Yuqing Yang:
Attentive Mask CLIP. CoRR abs/2212.08653 (2022) - 2021
- [c33]Xiaosen Wang, Jiadong Lin, Han Hu, Jingdong Wang, Kun He:
Boosting Adversarial Transferability through Enhanced Momentum. BMVC 2021: 272 - [c32]Jindong Gu, Volker Tresp, Han Hu:
Capsule Network Is Not More Robust Than Convolutional Network. CVPR 2021: 14309-14317 - [c31]Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu:
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning. CVPR 2021: 16684-16693 - [c30]Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong:
Group-Free 3D Object Detection via Transformers. ICCV 2021: 2929-2938 - [c29]Mengde Xu, Zheng Zhang, Han Hu, Jianfeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu:
End-to-End Semi-Supervised Object Detection with Soft Teacher. ICCV 2021: 3040-3049 - [c28]Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo:
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. ICCV 2021: 9992-10002 - [c27]Zhuliang Yao, Yue Cao, Yutong Lin, Ze Liu, Zheng Zhang, Han Hu:
Leveraging Batch Normalization for Vision Transformers. ICCVW 2021: 413-422 - [c26]Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Stephen Lin, Han Hu, Xiang Bai:
Bootstrap Your Object Detector via Mixed Training. NeurIPS 2021: 11315-11325 - [c25]Hanzhe Hu, Fangyun Wei, Han Hu, Qiwei Ye, Jinshi Cui, Liwei Wang:
Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning. NeurIPS 2021: 22106-22118 - [c24]Fangyun Wei, Yue Gao, Zhirong Wu, Han Hu, Stephen Lin:
Aligning Pretraining for Detection via Object-Level Contrastive Learning. NeurIPS 2021: 22682-22694 - [i33]Xiaosen Wang, Jiadong Lin, Han Hu, Jingdong Wang, Kun He:
Boosting Adversarial Transferability through Enhanced Momentum. CoRR abs/2103.10609 (2021) - [i32]Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo:
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. CoRR abs/2103.14030 (2021) - [i31]Jindong Gu, Volker Tresp, Han Hu:
Capsule Network is Not More Robust than Convolutional Network. CoRR abs/2103.15459 (2021) - [i30]Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong:
Group-Free 3D Object Detection via Transformers. CoRR abs/2104.00678 (2021) - [i29]Zhenda Xie, Yutong Lin, Zhuliang Yao, Zheng Zhang, Qi Dai, Yue Cao, Han Hu:
Self-Supervised Learning with Swin Transformers. CoRR abs/2105.04553 (2021) - [i28]Yansong Tang, Zhenyu Jiang, Zhenda Xie, Yue Cao, Zheng Zhang, Philip H. S. Torr, Han Hu:
Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning. CoRR abs/2105.05838 (2021) - [i27]Fangyun Wei, Yue Gao, Zhirong Wu, Han Hu, Stephen Lin:
Aligning Pretraining for Detection via Object-Level Contrastive Learning. CoRR abs/2106.02637 (2021) - [i26]Mengde Xu, Zheng Zhang, Han Hu, Jianfeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu:
End-to-End Semi-Supervised Object Detection with Soft Teacher. CoRR abs/2106.09018 (2021) - [i25]Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, Han Hu:
Video Swin Transformer. CoRR abs/2106.13230 (2021) - [i24]Hanzhe Hu, Fangyun Wei, Han Hu, Qiwei Ye, Jinshi Cui, Liwei Wang:
Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning. CoRR abs/2110.05474 (2021) - [i23]Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Stephen Lin, Han Hu, Xiang Bai:
Bootstrap Your Object Detector via Mixed Training. CoRR abs/2111.03056 (2021) - [i22]Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo:
Swin Transformer V2: Scaling Up Capacity and Resolution. CoRR abs/2111.09883 (2021) - [i21]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu:
SimMIM: A Simple Framework for Masked Image Modeling. CoRR abs/2111.09886 (2021) - [i20]Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai:
A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model. CoRR abs/2112.14757 (2021) - 2020
- [c23]Yihong Chen, Yue Cao, Han Hu, Liwei Wang:
Memory Enhanced Global-Local Aggregation for Video Object Detection. CVPR 2020: 10334-10343 - [c22]Minghao Yin, Zhuliang Yao, Yue Cao, Xiu Li, Zheng Zhang, Stephen Lin, Han Hu:
Disentangled Non-local Neural Networks. ECCV (15) 2020: 191-207 - [c21]Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang, Raquel Urtasun, Liwei Wang, Stephen Lin, Han Hu:
Dense RepPoints: Representing Visual Objects with Dense Point Sets. ECCV (21) 2020: 227-244 - [c20]Ze Liu, Han Hu, Yue Cao, Zheng Zhang, Xin Tong:
A Closer Look at Local Aggregation Operators in Point Cloud Analysis. ECCV (23) 2020: 326-342 - [c19]Bin Liu, Yue Cao, Yutong Lin, Qi Li, Zheng Zhang, Mingsheng Long, Han Hu:
Negative Margin Matters: Understanding Margin in Few-Shot Classification. ECCV (4) 2020: 438-455 - [c18]Yue Cao, Zhenda Xie, Bin Liu, Yutong Lin, Zheng Zhang, Han Hu:
Parametric Instance Classification for Unsupervised Visual Feature learning. NeurIPS 2020 - [c17]Yihong Chen, Zheng Zhang, Yue Cao, Liwei Wang, Stephen Lin, Han Hu:
RepPoints v2: Verification Meets Regression for Object Detection. NeurIPS 2020 - [c16]Cheng Chi, Fangyun Wei, Han Hu:
RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoder. NeurIPS 2020 - [i19]Bin Liu, Yue Cao, Yutong Lin, Qi Li, Zheng Zhang, Mingsheng Long, Han Hu:
Negative Margin Matters: Understanding Margin in Few-shot Classification. CoRR abs/2003.12060 (2020) - [i18]Yihong Chen, Yue Cao, Han Hu, Liwei Wang:
Memory Enhanced Global-Local Aggregation for Video Object Detection. CoRR abs/2003.12063 (2020) - [i17]Minghao Yin, Zhuliang Yao, Yue Cao, Xiu Li, Zheng Zhang, Stephen Lin, Han Hu:
Disentangled Non-Local Neural Networks. CoRR abs/2006.06668 (2020) - [i16]Yue Cao, Zhenda Xie, Bin Liu, Yutong Lin, Zheng Zhang, Han Hu:
Parametric Instance Classification for Unsupervised Visual Feature Learning. CoRR abs/2006.14618 (2020) - [i15]Ze Liu, Han Hu, Yue Cao, Zheng Zhang, Xin Tong:
A Closer Look at Local Aggregation Operators in Point Cloud Analysis. CoRR abs/2007.01294 (2020) - [i14]Yihong Chen, Zheng Zhang, Yue Cao, Liwei Wang, Stephen Lin, Han Hu:
RepPoints V2: Verification Meets Regression for Object Detection. CoRR abs/2007.08508 (2020) - [i13]Cheng Chi, Fangyun Wei, Han Hu:
RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoder. CoRR abs/2010.15831 (2020) - [i12]Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu:
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning. CoRR abs/2011.10043 (2020) - [i11]Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu:
Global Context Networks. CoRR abs/2012.13375 (2020)
2010 – 2019
- 2019
- [c15]Xizhou Zhu, Han Hu, Stephen Lin, Jifeng Dai:
Deformable ConvNets V2: More Deformable, Better Results. CVPR 2019: 9308-9316 - [c14]Han Hu, Zheng Zhang, Zhenda Xie, Stephen Lin:
Local Relation Networks for Image Recognition. ICCV 2019: 3463-3472 - [c13]Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu:
Spatial-Temporal Relation Networks for Multi-Object Tracking. ICCV 2019: 3987-3997 - [c12]Ze Yang, Shaohui Liu, Han Hu, Liwei Wang, Stephen Lin:
RepPoints: Point Set Representation for Object Detection. ICCV 2019: 9656-9665 - [c11]Bin Liu, Zhirong Wu, Han Hu, Stephen Lin:
Deep Metric Transfer for Label Propagation with Limited Annotated Data. ICCV Workshops 2019: 1317-1326 - [c10]Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu:
GCNet: Non-Local Networks Meet Squeeze-Excitation Networks and Beyond. ICCV Workshops 2019: 1971-1980 - [i10]Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu:
Spatial-Temporal Relation Networks for Multi-Object Tracking. CoRR abs/1904.11489 (2019) - [i9]Ze Yang, Shaohui Liu, Han Hu, Liwei Wang, Stephen Lin:
RepPoints: Point Set Representation for Object Detection. CoRR abs/1904.11490 (2019) - [i8]Han Hu, Zheng Zhang, Zhenda Xie, Stephen Lin:
Local Relation Networks for Image Recognition. CoRR abs/1904.11491 (2019) - [i7]Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu:
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond. CoRR abs/1904.11492 (2019) - [i6]Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang, Raquel Urtasun, Liwei Wang, Stephen Lin, Han Hu:
Dense RepPoints: Representing Visual Objects with Dense Point Sets. CoRR abs/1912.11473 (2019) - 2018
- [c9]Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei:
Relation Networks for Object Detection. CVPR 2018: 3588-3597 - [c8]Jiayuan Gu, Han Hu, Liwei Wang, Yichen Wei, Jifeng Dai:
Learning Region Features for Object Detection. ECCV (12) 2018: 392-406 - [i5]Jiayuan Gu, Han Hu, Liwei Wang, Yichen Wei, Jifeng Dai:
Learning Region Features for Object Detection. CoRR abs/1803.07066 (2018) - [i4]Xizhou Zhu, Han Hu, Stephen Lin, Jifeng Dai:
Deformable ConvNets v2: More Deformable, Better Results. CoRR abs/1811.11168 (2018) - [i3]Bin Liu, Zhirong Wu, Han Hu, Stephen Lin:
Deep Metric Transfer for Label Propagation with Limited Annotated Data. CoRR abs/1812.08781 (2018) - 2017
- [c7]Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei:
Deformable Convolutional Networks. ICCV 2017: 764-773 - [i2]Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei:
Deformable Convolutional Networks. CoRR abs/1703.06211 (2017) - [i1]Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei:
Relation Networks for Object Detection. CoRR abs/1711.11575 (2017) - 2016
- [j4]Kailin Ge, Han Hu, Jianjiang Feng, Jie Zhou:
Depth Estimation Using a Sliding Camera. IEEE Trans. Image Process. 25(2): 726-739 (2016) - 2015
- [j3]Han Hu, Jianjiang Feng, Jie Zhou:
Exploiting Unsupervised and Supervised Constraints for Subspace Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 37(8): 1542-1557 (2015) - [c6]Chuan Yu, Lu Tian, Han Hu, Yueqi Duan, Jie Zhou:
Progressive feature matching via triplet graph. ICIP 2015: 1860-1864 - 2014
- [c5]Han Hu, Zhouchen Lin, Jianjiang Feng, Jie Zhou:
Smooth Representation Clustering. CVPR 2014: 3834-3841 - 2013
- [j2]Han Hu, Jianjiang Feng, Chuan Yu, Jie Zhou:
Multi-Class Constrained Normalized Cut With Hard, Soft, Unary and Pairwise Priors and its Applications to Object Segmentation. IEEE Trans. Image Process. 22(11): 4328-4340 (2013) - 2012
- [c4]Han Hu, Jiahuan Zhou, Jianjiang Feng, Jie Zhou:
Multi-way constrained spectral clustering by nonnegative restriction. ICPR 2012: 1550-1553 - 2011
- [j1]Jie Zhou, Han Hu, Dingrui Wan:
Video Stabilization and Completion Using Two Cameras. IEEE Trans. Circuits Syst. Video Technol. 21(12): 1879-1889 (2011) - 2010
- [c3]Han Hu, Jie Zhou:
Trajectory matching from unsynchronized videos. CVPR 2010: 1347-1354 - [c2]Han Hu, Quanquan Gu, Jie Zhou:
HTF: a novel feature for general crack detection. ICIP 2010: 1633-1636
2000 – 2009
- 2009
- [c1]Han Hu, Quanquan Gu, Lei Deng, Jie Zhou:
Multiframe Motion Segmentation via Penalized MAP Estimation and Linear Programming. BMVC 2009: 1-11
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-26 23:50 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint