Jingdong Wang (王井东), Fellow of IEEE and IAPR |
![]() |
Chief Architect for Computer Vision |
|
CV    Google Scholar    DBLP    ORCID |
28. Transformer does not outperform CNN: On the Connection between Local Attention and Dynamic Depth-wise Convolution. [pdf] [code]. ICLR 2022 spotlight. 3/2022 |
27. People of ACM interview: URL 12/2021. |
26. Elected as Fellow of IEEE, for his contributions to visual content understanding and retrieval, 11/2021. |
25. Code released for our NeurIPS 2021 paper, HRFormer: High-Resolution Transformer for Dense Prediction. [pdf] code. 09/2021 |
24. Code released for our NeurIPS 2021 paper, SPANN: Highly-efficient Billion-scale ApproximateNearest Neighbor Search. [pdf] code. 09/2021 |
23. Code released for our ICCV 2021 paper, Conditional DETR for Fast Training Convergence. [pdf] code. 8/16/2021 |
22. Local Transformer attention is equivalent to inhomogeneous dynamic depth-wise convolution: Demystifying local attention. 7/2021 |
21. Welcome to the large scale approximate nearest search challenge at NeurIPS 2021: Big ANN Benchmark. 5/2021 |
20. HRNet is shipped to Form Recognizer for Table Recognition. 5/2021 |
19. Update object-contextual representation for semantic segmentation (ECCV 2020). We rephrase it as Segmentation Transformer. [pdf] code. 5/4/2021 |
18. Code released for our CVPR 2021 paper, Lite-HRNet: A Lightweight High-Resolution Network. [pdf] code. 4/12/2021 |
17. Code released for our CVPR 2021 paper, Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression. [pdf] code. 4/7/2021 |
16. HRNet: Deep High-Resolution Representation Learning for Visual Recognition. Accepted by TPAMI. [pdf] or [pdf at arXiv]. This is a longer version of the HRNet paper published in CVPR 2019. HRNet is a stronger backbone, and acheives superior performance on human pose estimation, semantic segmentation, object detection, face alignment, and so on. Codes are available. Human pose estimation: ; Semantic segmentation ; Object detection ; Facial landmark detection ; ImageNet classification: . 3/13/2020 |
15. HRNet + OCR + SegFix is ranked 1 on cityscapes segmentation. Cityscapes segmentation leaderboard (January2020). The implementation of HRNet + OCR is available: code |
14. Invited as an area chair of CVPR 2020, ECCV 2020, and IJCAI 2020. |
13. HRNet + OCR is ranked 1 on cityscapes segmentation. Cityscapes segmentation leaderboard (July 2019). |
12. High-Resolution Network (HRNet). A replacement of classification networks for visual recognition. projects page. |
11. Fast neighborhood graph-based approximate nearest neighbor search: code . Bing vector search. TechCrunch. |
10. Invited as an area chair of ICCV 2019, and IJCAI 2019. |
9. Elected as an ACM Distinguished Member, 11/2018. |
8. Gave a keynote talk about approximate nearest neighbor search on 9/29/2018 at JD.com. slides |
7. Second place entry, COCO keypoints detection challenge ECCV 2018. |
6. Appointed as AE of TPAMI, 09/2018. |
5. Elected as Fellow of IAPR 2018. |
4. One paper is accepted by ECCV 2018. |
3. Two papers are accepted by ACM MM 2018. |
2. Three papers are accepted by CVPR 2018. |
1. Appointed as AE of TCSVT, 01/2018. |
1. High-resolution networks (HRNet). A replacement of classification networks for computer vision problems projects. Human pose estimation (CVPR 2019): code . Other applications pdf (short) pdf (long) code: semantic segmentation , object detection , facial landmark detection , and ImageNet classification . |
2. Small convolutional neural networks. Interleaved group convolutions. IGCV1 (ICCV 2017): pdf code | IGCV2 (CVPR 2018): pdf | IGCV3 (BMVC 2018): pdf code |
3. Large-scale indexing for similarity search. Neighborhood graph search (ACM MM 2012): pdf | Neighborhood graph construction (CVPR 2012): pdf | Trinary-projection trees (TPAMI, CVPR 2010): pdf | code |
4. Hashing and quantization. A survey on learning to hash (TPAMI): pdf v2 html v2 tex v2 pdf v1 | Composite quantization (TPAMI, ICML 2014): pdf code |
5. Salient object detection. Discriminative Regional Feature Integration (IJCV, CVPR 2013): pdf (CVPR) pdf (IJCV) c++ code matlab code project | Local context (BMVC 2011): pdf code | Learning to detect a salient object (TPAMI): pdf |
[14]  | HRFormer: High-Resolution Transformer for Dense Prediction. Yuhui Yuan, Rao Fu, Lang Huang, Weihong Lin, Chao Zhang, Xilin Chen, and Jingdong Wang. NeurIPS 2021. [pdf] [code] |
[13]  | SPANN: Highly-efficient Billion-scale ApproximateNearest Neighbor Search. Qi Chen, Bing Zhao, Haidong Wang, Mingqin Li, Chuanjie Liu, Zengzhong Li, Mao Yang, and Jingdong Wang. NeurIPS 2021. [pdf] [code] |
[12]  | Conditional DETR for Fast Training Convergence. Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, and Jingdong Wang. ICCV 2021. [pdf] [code] |
[11]  | Object-Contextual Representations for Semantic Segmentation. Yuhui Yuan, Xilin Chen, and Jingdong Wang. ECCV 2020. [pdf] [code] |
[10]  | Deep high-resolution representation learning for human pose estimation. Ke Sun, Bin Xiao, Dong Liu and Jingdong Wang. CVPR 2019. [pdf] [code] |
[9]  | Part-Aligned Bilinear Representations for Person Re-identification. Yumin Suh, Jingdong Wang, Siyu Tang, Tao Mei, and Kyoung Mu Lee. ECCV 2018. [pdf] [code] |
[8]  | IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks. Ke Sun, Mingjie Li, Dong Liu, and Jingdong Wang BMVC 2018. [pdf] [code] |
[7]  | Composite Quantization. Jingdong Wang, and Ting Zhang. TPAMI 2018. [pdf] [code] |
[6]  | Deep Convolutional Neural Networks with Merge-and-Run Mappings. Liming Zhao, Mingjie Li, Depu Meng, Xi Li, Zhaoxiang Zhang, Yueting Zhuang, Zhuowen Tu, and Jingdong Wang. IJCAI 2018. [pdf] [code] |
[5]  | IGCV2: Interleaved Structured Sparse Convolutional Neural Networks. Guotian Xie, Jingdong Wang, Ting Zhang, Jianhuang Lai, Richang Hong, and Guo-Jun Qi. CVPR 2018. [pdf] [code] |
[4]  | IGCV1: Interleaved Group Convolutions. Ting Zhang, Guo-Jun Qi, Bin Xiao, and Jingdong Wang. ICCV 2017. [pdf] [code] [related papers] [Zhihu] [blog] |
[3]  | Deeply-Learned Part-Aligned Representations for Person Re-Identification. Liming Zhao, Xi Li, Yueting Zhuang, and Jingdong Wang. ICCV 2017. [pdf] [code] |
[2]  | A Survey on Learning to Hash. Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, and Heng Tao Shen. TPAMI, Accepted 2017. [pdf v2] [[pdf v1]] |
[1]  | Deeply-Fused Nets. Jingdong Wang, Zhen Wei, and Ting Zhang. arXiv. [pdf] [code] |