Region-based convolutional neural networks for occluded person re-identification

Atiqul Islam; Mark Tee Kit Tsun; Lau Bee Theng; Caslon Chua

doi:10.26555/ijain.v10i1.1125


Region-based convolutional neural networks for occluded person re-identification

^{(1) *} Atiqul Islam

(Swinburne University of Technology Sarawak Campus, Malaysia)
⁽²⁾ Mark Tee Kit Tsun

(Swinburne University of Technology Sarawak Campus, Malaysia)
⁽³⁾ Lau Bee Theng

(Swinburne University of Technology Sarawak Campus, Malaysia)
⁽⁴⁾ Caslon Chua

(Swinburne University of Technology, Melbourne, Australia)
^*corresponding author

Abstract

In a variety of applications, including intelligent surveillance systems, targeted tracking, and assistive human-following robots, the ability to accurately identify individuals even when they are partially obscured is imperative. Such Continuous person tracking is complicated by the close similarity between the appearance of people and target occlusions. This study addresses this significant challenge by proposing a two-step, detection-first approach that uses a region-based convolutional neural network (R-CNN) as the re-identification (re-ID)solution. The model is specifically trained to detect occluded persons at different levels of occlusion before forwarding the image for the re-ID process. Three occluded-specific datasets are selected to evaluate the model's effectiveness in detecting occluded people. There are 379 distinct people in total, and each has five images obstructed from different angles. A sample of the data is taken to simulate various environment settings, and new data points are generated with different degrees of occlusion to assess how well the model performs under varying levels of obstruction. The findings demonstrate that the proposed person re-ID model is reliable in most circumstances, correctly re-identifying at 74% (Rank-1) and 90% (Rank-5). Although there is a decrease in accuracy as the number of distinctive people in the dataset increases, this does not significantly impact the tracking performance in various applications, which are expected to recognize a single person or a small group of individuals. Future works will explore refining similarity matching algorithms by delving into robust image comparison techniques, thereby addressing the challenges presented by occlusions. A critical aspect is to assess the model under diverse lighting conditions and investigate scenarios with multiple individuals in a frame. It is also beneficial to exploit high-resolution datasets, such as DukeMTMC-reID, and integrate finer contextual details, like clothing or carried objects. These collective efforts are essential for optimizing the modelâ€™s efficacy in practical applications and advancing person re-ID technologies.

Keywords

Occlusion; R-CNN; Re-identification; Region re-ranking

DOI

https://doi.org/10.26555/ijain.v10i1.1125

Article metrics

Abstract views : 1052 | PDF views : 228

Cite

How to cite item

Full Text

Download

References

[1] A. Islam, M. K. T. Tee, and B. T. Lau, â€œDevelopment of an Improved Occluded Person Re-Identification System Using Deep Learning,â€ in 2022 6th High Performance Computing and Cluster Technologies Conference (HPCCT), Jul. 2022, pp. 44â€“50, doi: 10.1145/3560442.3560449.

[2] M. Ye, J. Shen, G. Lin, T. Xiang, L. Shao, and S. C. H. Hoi, â€œDeep Learning for Person Re-Identification: A Survey and Outlook,â€ IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 6, pp. 2872â€“2893, Jun. 2022, doi: 10.1109/TPAMI.2021.3054775.

[3] X.-T. Vo and K.-H. Jo, â€œAccurate Bounding Box Prediction for Single-Shot Object Detection,â€ IEEE Trans. Ind. Informatics, vol. 18, no. 9, pp. 5961â€“5971, Sep. 2022, doi: 10.1109/TII.2021.3138336.

[4] J. Miao, Y. Wu, P. Liu, Y. Ding, and Y. Yang, â€œPose-Guided Feature Alignment for Occluded Person Re-Identification,â€ in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Oct. 2019, vol. 2019-Octob, pp. 542â€“551, doi: 10.1109/ICCV.2019.00063.

[5] D. Wu et al., â€œRandom Occlusion Recovery for Person Re-identification,â€ J. Imaging Sci. Technol., vol. 63, no. 3, pp. 030405-1-030405â€“9, May 2019, doi: 10.2352/J.ImagingSci.Technol.2019.63.3.030405.

[6] X. Liu, Y. Jiang, P. Jain, and K.-H. Kim, â€œTAR: Enabling Fine-Grained Targeted Advertising in Retail Stores,â€ in Proceedings of the 16th Annual International Conference on Mobile Systems, Applications, and Services, Jun. 2018, pp. 323â€“336, doi: 10.1145/3210240.3210342.

[7] C. B. Nalty et al., â€œA Brief Survey on Person Recognition at a Distance,â€ in 2022 56th Asilomar Conference on Signals, Systems, and Computers, Oct. 2022, vol. 2022-Octob, pp. 145â€“152, doi: 10.1109/IEEECONF56349.2022.10051819.

[8] R. Hou, B. Ma, H. Chang, X. Gu, S. Shan, and X. Chen, â€œVRSTC: Occlusion-Free Video Person Re-Identification,â€ in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2019, vol. 2019-June, pp. 7176â€“7185, doi: 10.1109/CVPR.2019.00735.

[9] X. Zhang, Y. Yan, J.-H. Xue, Y. Hua, and H. Wang, â€œSemantic-Aware Occlusion-Robust Network for Occluded Person Re-Identification,â€ IEEE Trans. Circuits Syst. Video Technol., vol. 31, no. 7, pp. 2764â€“2778, Jul. 2021, doi: 10.1109/TCSVT.2020.3033165.

[10] C. Zhao, X. Lv, S. Dou, S. Zhang, J. Wu, and L. Wang, â€œIncremental Generative Occlusion Adversarial Suppression Network for Person ReID,â€ IEEE Trans. Image Process., vol. 30, pp. 4212â€“4224, 2021, doi: 10.1109/TIP.2021.3070182.

[11] L. Zheng, Y. Yang, and A. G. Hauptmann, â€œPerson Re-identification: Past, Present and Future,â€ arXiv Computer Vision and Pattern Recognition, Oct. 10, pp. 1-20, 2016. https://arxiv.org/abs/1610.02984v1.

[12] Q. Leng, M. Ye, and Q. Tian, â€œA Survey of Open-World Person Re-Identification,â€ IEEE Trans. Circuits Syst. Video Technol., vol. 30, no. 4, pp. 1092â€“1108, Apr. 2020, doi: 10.1109/TCSVT.2019.2898940.

[13] A. Zahra, N. Perwaiz, M. Shahzad, and M. M. Fraz, â€œPerson re-identification: A retrospective on domain specific open challenges and future trends,â€ Pattern Recognit., vol. 142, p. 109669, Oct. 2023, doi: 10.1016/j.patcog.2023.109669.

[14] R. Chalapathy, N. L. D. Khoa, and S. Chawla, â€œRobust Deep Learning Methods for Anomaly Detection,â€ in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Aug. 2020, pp. 3507â€“3508, doi: 10.1145/3394486.3406704.

[15] L. Liu et al., â€œDeep Learning for Generic Object Detection: A Survey,â€ Int. J. Comput. Vis., vol. 128, no. 2, pp. 261â€“318, Feb. 2020, doi: 10.1007/s11263-019-01247-4.

[16] J. Marin, D. Vazquez, A. M. Lopez, J. Amores, and L. I. Kuncheva, â€œOcclusion Handling via Random Subspace Classifiers for Human Detection,â€ IEEE Trans. Cybern., vol. 44, no. 3, pp. 342â€“354, Mar. 2014, doi: 10.1109/TCYB.2013.2255271.

[17] D. G. Lowe, â€œDistinctive Image Features from Scale-Invariant Keypoints,â€ Int. J. Comput. Vis., vol. 60, no. 2, pp. 91â€“110, Nov. 2004, doi: 10.1023/B:VISI.0000029664.99615.94.

[18] Y. Hu, S. Liao, Z. Lei, D. Yi, and S. Z. Li, â€œExploring Structural Information and Fusing Multiple Features for Person Re-identification,â€ in 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, Jun. 2013, pp. 794â€“799, doi: 10.1109/CVPRW.2013.119.

[19] G. Xie et al., â€œPose-guided feature region-based fusion network for occluded person re-identification,â€ Multimed. Syst., vol. 29, no. 3, pp. 1771â€“1783, Jun. 2023, doi: 10.1007/s00530-021-00752-2.

[20] D. K. Dastur et al., â€œThe B-vitamins in malnutrition with alcoholism: A model of intervitamin relationships,â€ Br. J. Nutr., vol. 36, no. 2, pp. 143â€“159, Sep. 1976, doi: 10.1017/S0007114500020158.

[21] J. Miao, Y. Wu, and Y. Yang, â€œIdentifying Visible Parts via Pose Estimation for Occluded Person Re-Identification,â€ IEEE Trans. Neural Networks Learn. Syst., vol. 33, no. 9, pp. 4624â€“4634, Sep. 2022, doi: 10.1109/TNNLS.2021.3059515.

[22] Z. Zhao, R. Song, Q. Zhang, P. Duan, and Y. Zhang, â€œJoT-GAN: A Framework for Jointly Training GAN and Person Re-Identification Model,â€ ACM Trans. Multimed. Comput. Commun. Appl., vol. 18, no. 1s, pp. 1â€“18, Feb. 2022, doi: 10.1145/3491225.

[23] L. Wei, S. Zhang, W. Gao, and Q. Tian, â€œPerson Transfer GAN to Bridge Domain Gap for Person Re-identification,â€ in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun. 2018, pp. 79â€“88, doi: 10.1109/CVPR.2018.00016.

[24] C. Zhang, L. Zhu, S. Zhang, and W. Yu, â€œPAC-GAN: An effective pose augmentation scheme for unsupervised cross-view person re-identification,â€ Neurocomputing, vol. 387, pp. 22â€“39, Apr. 2020, doi: 10.1016/j.neucom.2019.12.094.

[25] T. He, X. Shen, J. Huang, Z. Chen, and X.-S. Hua, â€œPartial Person Re-identification with Part-Part Correspondence Learning,â€ in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2021, pp. 9101â€“9111, doi: 10.1109/CVPR46437.2021.00899.

[26] Y. Li, J. He, T. Zhang, X. Liu, Y. Zhang, and F. Wu, â€œDiverse Part Discovery: Occluded Person Re-identification with Part-Aware Transformer,â€ in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2021, pp. 2897â€“2906, doi: 10.1109/CVPR46437.2021.00292.

[27] X.-P. Lin and Y.-B. Yang, â€œAn Adaptive Part-Based Model For Person Re-Identification,â€ in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun. 2021, vol. 2021-June, pp. 1965â€“1969, doi: 10.1109/ICASSP39728.2021.9415086.

[28] Z. Yao, X. Wu, Z. Xiong, and Y. Ma, â€œA Dynamic Part-Attention Model for Person Re-Identification,â€ Sensors, vol. 19, no. 9, p. 2080, May 2019, doi: 10.3390/s19092080.

[29] L. Zhao, X. Li, Y. Zhuang, and J. Wang, â€œDeeply-Learned Part-Aligned Representations for Person Re-identification,â€ in 2017 IEEE International Conference on Computer Vision (ICCV), Oct. 2017, vol. 2017-Octob, pp. 3239â€“3248, doi: 10.1109/ICCV.2017.349.

[30] W.-S. Zheng, X. Li, T. Xiang, S. Liao, J. Lai, and S. Gong, â€œPartial Person Re-Identification,â€ in 2015 IEEE International Conference on Computer Vision (ICCV), Dec. 2015, pp. 4678â€“4686, doi: 10.1109/ICCV.2015.531.

[31] i-LIDS Team, â€œImagery Library for Intelligent Detection Systems (i-LIDS) A Standard for Testing Video Based Detection Systems,â€ in Proceedings 40th Annual 2006 International Carnahan Conference on Security Technology, Oct. 2006, pp. 75â€“80, doi: 10.1109/CCST.2006.313432.

[32] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman, â€œThe PASCAL Visual Object Classes Challenge 2007 (VOC2007) Development Kit,â€ Pattern Analysis, Statistical Modelling and Computational Learning, Tech. Rep, pp. 1-45, 2012. https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=9d0df3b123a78c34f6ca874d51a321b33a9f1199.

[33] G. Wang, Y. Yuan, X. Chen, J. Li, and X. Zhou, â€œLearning Discriminative Features with Multiple Granularities for Person Re-Identification,â€ in Proceedings of the 26th ACM international conference on Multimedia, Oct. 2018, pp. 274â€“282, doi: 10.1145/3240508.3240552.

[34] J. Dang, X. Tang, and S. Li, â€œHA-FPN: Hierarchical Attention Feature Pyramid Network for Object Detection,â€ Sensors, vol. 23, no. 9, p. 4508, May 2023, doi: 10.3390/s23094508.

[35] Y. Wang, S. Yang, S. Liu, and Z. Zhang, â€œCross-Domain Person Re-identification: A Review,â€ in Lecture Notes in Electrical Engineering, vol. 653, Springer Science and Business Media Deutschland GmbH, 2021, pp. 153â€“160, doi: 10.1007/978-981-15-8599-9_19.

[36] S. Liao, Y. Hu, Xiangyu Zhu, and S. Z. Li, â€œPerson re-identification by Local Maximal Occurrence representation and metric learning,â€ in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2015, vol. 07-12-June, pp. 2197â€“2206, doi: 10.1109/CVPR.2015.7298832.

[37] Y.-C. Chen, W.-S. Zheng, and J. Lai, â€œMirror Representation for Modeling View-Specific Transform in Person Re-Identification,â€ in Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015), 2016, pp. 3402â€“3408. [Online]. Available at: https://www.ijcai.org/Proceedings/15/Papers/479.pdf.

[38] N. Perwaiz, M. M. Fraz, and M. Shahzad, â€œHierarchical Refined Local Associations for Robust Person Re-Identification,â€ in 2019 International Conference on Robotics and Automation in Industry (ICRAI), Oct. 2019, pp. 1â€“6, doi: 10.1109/ICRAI47710.2019.8967389.

[39] L. Zhang, T. Xiang, and S. Gong, â€œLearning a Discriminative Null Space for Person Re-identification,â€ in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2016, vol. 2016-Decem, pp. 1239â€“1248, doi: 10.1109/CVPR.2016.139.

[40] Y. Sun, L. Zheng, W. Deng, and S. Wang, â€œSVDNet for Pedestrian Retrieval,â€ in Proceedings of the IEEE International Conference on Computer Vision, 2018, pp. 3820â€“3828, doi: 10.1109/ICCV.2017.410.

[41] Z. Zhong, L. Zheng, G. Kang, S. Li, and Y. Yang, â€œRandom Erasing Data Augmentation,â€ Proc. AAAI Conf. Artif. Intell., vol. 34, no. 07, pp. 13001â€“13008, Apr. 2020, doi: 10.1609/aaai.v34i07.7000.

[42] C. Yan, G. Pang, J. Jiao, X. Bai, X. Feng, and C. Shen, â€œOccluded Person Re-Identification with Single-scale Global Representations,â€ in 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Oct. 2021, pp. 11855â€“11864, doi: 10.1109/ICCV48922.2021.01166.

[43] Q. Yang, P. Wang, Z. Fang, and Q. Lu, â€œFocus on the Visible Regions: Semantic-Guided Alignment Model for Occluded Person Re-Identification,â€ Sensors, vol. 20, no. 16, p. 4431, Aug. 2020, doi: 10.3390/s20164431.

[44] X. Zhong, M. Wang, W. Liu, J. Yuan, and W. Huang, â€œSCPNet: Self-constrained parallelism network for keypoint-based lightweight object detection,â€ J. Vis. Commun. Image Represent., vol. 90, p. 103719, Feb. 2023, doi: 10.1016/j.jvcir.2022.103719.

[45] L. He, J. Liang, H. Li, and Z. Sun, â€œDeep Spatial Feature Reconstruction for Partial Person Re-identification: Alignment-free Approach,â€ in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun. 2018, pp. 7073â€“7082, doi: 10.1109/CVPR.2018.00739.

[46] H. Luo, W. Jiang, X. Fan, and C. Zhang, â€œSTNReID: Deep Convolutional Networks With Pairwise Spatial Transformer Networks for Partial Person Re-Identification,â€ IEEE Trans. Multimed., vol. 22, no. 11, pp. 2905â€“2913, Nov. 2020, doi: 10.1109/TMM.2020.2965491.

[47] Y. Sun et al., â€œPerceive Where to Focus: Learning Visibility-Aware Part-Level Features for Partial Person Re-Identification,â€ in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2019, vol. 2019-June, pp. 393â€“402, doi: 10.1109/CVPR.2019.00048.

[48] L. He, X. Liao, W. Liu, X. Liu, P. Cheng, and T. Mei, â€œFastReID: A Pytorch Toolbox for General Instance Re-identification,â€ in Proceedings of the 31st ACM International Conference on Multimedia, Oct. 2023, pp. 9664â€“9667, doi: 10.1145/3581783.3613460.

[49] Z. Pang, J. Guo, W. Sun, Y. Xiao, and M. Yu, â€œCross-domain person re-identification by hybrid supervised and unsupervised learning,â€ Appl. Intell., vol. 52, no. 3, pp. 2987â€“3001, Feb. 2022, doi: 10.1007/s10489-021-02551-8.

[50] H. Zhang, S. Wang, N. Wang, S. Liu, and Z. Zhang, â€œEfficiency Evaluation of Deep Model for Person Re-identification,â€ in Lecture Notes in Electrical Engineering, vol. 653, Springer Science and Business Media Deutschland GmbH, 2021, pp. 130â€“136, doi: 10.1007/978-981-15-8599-9_16.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

___________________________________________________________
International Journal of Advances in Intelligent Informatics
ISSN 2442-6571 (print) | 2548-3161 (online)
Organized by UAD and ASCEE Computer Society
Published by Universitas Ahmad Dahlan
W: http://ijain.org
E: info@ijain.org (paper handling issues)
andri.pranolo.id@ieee.org (publication issues)

View IJAIN Stats

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

Username
Password
Remember me