(2) Yihao Chen (Soochow University, Suzhou, China)
(3) Hao Yuan (Soochow University, Suzhou, China)
(4) Cheng Wu (Soochow University, Suzhou, China)
*corresponding author
AbstractOne of the major reasons for the explosion of autonomous driving in recent years is the great development of computer vision. As one of the most fundamental and challenging problems in autonomous driving, environment understanding has been widely studied. It directly determines whether the entire in-vehicle system can effectively identify surrounding objects of vehicles and make correct path planning. Semantic segmentation is the most important means of environment understanding among the many image recognition algorithms used in autonomous driving. However, the success of semantic segmentation models is highly dependent on human expertise in data preparation and hyperparameter optimization, and the tedious process of training is repeated over and over for each new scene. Automated machine learning (AutoML) is a research area for this problem that aims to automate the development of end-to-end ML models. In this paper, we propose an automatic learning method for semantic segmentation based on reinforcement learning (RL), which can realize automatic selection of training data and guide automatic training of semantic segmentation. The results show that our scheme converges faster and has higher accuracy than researchers manually training semantic segmentation models, while requiring no human involvement.
KeywordsAutomated machine learning; Semantic segmentation; Reinforcement learning; Autonomous driving
|
DOIhttps://doi.org/10.26555/ijain.v10i1.1521 |
Article metricsAbstract views : 493 | PDF views : 113 |
Cite |
Full TextDownload |
References
[1] M. Campbell, M. Egerstedt, J. P. How, and R. M. Murray, “Autonomous driving in urban environments: approaches, lessons and challenges,” Philos. Trans. R. Soc. A Math. Phys. Eng. Sci., vol. 368, no. 1928, pp. 4649–4672, Oct. 2010, doi: 10.1098/rsta.2010.0110.
[2] L. Mora, X. Wu, and A. Panori, “Mind the gap: Developments in autonomous driving research and the sustainability challenge,” J. Clean. Prod., vol. 275, p. 124087, Dec. 2020, doi: 10.1016/j.jclepro.2020.124087.
[3] S. Liu, L. Liu, J. Tang, B. Yu, Y. Wang, and W. Shi, “Edge Computing for Autonomous Driving: Opportunities and Challenges,” Proc. IEEE, vol. 107, no. 8, pp. 1697–1716, Aug. 2019, doi: 10.1109/JPROC.2019.2915983.
[4] D. Feng et al., “Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges,” IEEE Trans. Intell. Transp. Syst., vol. 22, no. 3, pp. 1341–1360, Mar. 2021, doi: 10.1109/TITS.2020.2972974.
[5] K. Zheng and H. A. H. Naji, “Road Scene Segmentation Based on Deep Learning,” IEEE Access, vol. 8, pp. 140964–140971, 2020, doi: 10.1109/ACCESS.2020.3009782.
[6] J. Zhao, J. Liu, L. Yang, B. Ai, and S. Ni, “Future 5G-oriented system for urban rail transit: Opportunities and challenges,” China Commun., vol. 18, no. 2, pp. 1–12, Feb. 2021, doi: 10.23919/JCC.2021.02.001.
[7] P. Singh, M. A. Dulebenets, J. Pasha, E. D. R. S. Gonzalez, Y.-Y. Lau, and R. Kampmann, “Deployment of Autonomous Trains in Rail Transportation: Current Trends and Existing Challenges,” IEEE Access, vol. 9, pp. 91427–91461, 2021, doi: 10.1109/ACCESS.2021.3091550.
[8] M. Siam, S. Elkerdawy, M. Jagersand, and S. Yogamani, “Deep semantic segmentation for automated driving: Taxonomy, roadmap and challenges,” in 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Oct. 2017, vol. 2018-March, pp. 1–8, doi: 10.1109/ITSC.2017.8317714.
[9] S. Hao, Y. Zhou, and Y. Guo, “A Brief Survey on Semantic Segmentation with Deep Learning,” Neurocomputing, vol. 406, pp. 302–321, Sep. 2020, doi: 10.1016/j.neucom.2019.11.118.
[10] F. Lateef and Y. Ruichek, “Survey on semantic segmentation using deep learning techniques,” Neurocomputing, vol. 338, pp. 321–348, Apr. 2019, doi: 10.1016/j.neucom.2019.02.003.
[11] S. Chennupati, G. Sistu, S. Yogamani, and S. Rawashdeh, “AuxNet: Auxiliary Tasks Enhanced Semantic Segmentation for Automated Driving,” in Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2019, vol. 5, pp. 645–652, doi: 10.5220/0007684106450652.
[12] Q. Yao et al., “Taking Human out of Learning Applications: A Survey on Automated Machine Learning,” in arxiv Artificial Intelligence, p.20, Oct. 2018. [Online]. Available at: https://arxiv.org/abs/1810.13306v4.
[13] X. He, K. Zhao, and X. Chu, “AutoML: A survey of the state-of-the-art,” Knowledge-Based Syst., vol. 212, p. 106622, Jan. 2021, doi: 10.1016/j.knosys.2020.106622.
[14] J. Y. Hesterman, L. Caucci, M. A. Kupinski, H. H. Barrett, and L. R. Furenlid, “Maximum-Likelihood Estimation With a Contracting-Grid Search Algorithm,” IEEE Trans. Nucl. Sci., vol. 57, no. 3, pp. 1077–1084, Jun. 2010, doi: 10.1109/TNS.2010.2045898.
[15] J. Bergstra and Y. Bengio, “Random search for hyper-parameter optimization,” J. Mach. Learn. Res., vol. 13, pp. 281–305, 2012, [Online]. Available at: https://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf.
[16] B. Shahriari, K. Swersky, Z. Wang, R. P. Adams, and N. de Freitas, “Taking the Human Out of the Loop: A Review of Bayesian Optimization,” Proc. IEEE, vol. 104, no. 1, pp. 148–175, Jan. 2016, doi: 10.1109/JPROC.2015.2494218.
[17] T. Yu and H. Zhu, “Hyper-Parameter Optimization: A Review of Algorithms and Applications,” in arxiv Machine Learning, p. 56, Mar. 2020. [Online]. Available: https://arxiv.org/abs/2003.05689v1.
[18] Y. Bengio, “Gradient-Based Optimization of Hyperparameters,” Neural Comput., vol. 12, no. 8, pp. 1889–1900, Aug. 2000, doi: 10.1162/089976600300015187.
[19] J. Domke, “Generic methods for optimization-based modeling,” J. Mach. Learn. Res., vol. 22, pp. 318–326, 2012, [Online]. Available at: https://proceedings.mlr.press/v22/domke12/domke12.pdf.
[20] L. Franceschi, M. Donini, P. Frasconi, and M. Pontil, “Forward and Reverse Gradient-Based Hyperparameter Optimization.” PMLR, pp. 1165–1173, Jul. 17, 2017. [Online]. Available at: https://proceedings.mlr.press/v70/franceschi17a.html.
[21] D. Maclaurin, D. Duvenaud, and R. P. Adams, “Gradient-based Hyperparameter Optimization through Reversible Learning,” 2015, vol. 37, p. 10, [Online]. Available at: http://arxiv.org/abs/1502.03492.
[22] F. Pedregosa, “Hyperparameter optimization with approximate gradient.” PMLR, pp. 737–746, Jun. 11, 2016. [Online]. Available at: https://proceedings.mlr.press/v48/pedregosa16.html.
[23] K. Chandra, A. Xie, J. Ragan-Kelley, and E. Meijer, “Gradient Descent: The Ultimate Optimizer,” Adv. Neural Inf. Process. Syst., vol. 35, p. 12, Sep. 2019. Available at: https://arxiv.org/abs/1909.13371v2.
[24] D. P. Kingma and J. L. Ba, “Adam: A Method for Stochastic Optimization,” 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings. International Conference on Learning Representations, ICLR, pp. 1–15, Dec. 22, 2014. [Online]. Available at: https://arxiv.org/abs/1412.6980.
[25] Y. Zhang, Z. Qiu, J. Liu, T. Yao, D. Liu, and T. Mei, “Customizable Architecture Search for Semantic Segmentation,” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2019, vol. 2019-June, pp. 11633–11642, doi: 10.1109/CVPR.2019.01191.
[26] V. Nekrasov, H. Chen, C. Shen, and I. Reid, “Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells,” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2019, vol. 2019-June, pp. 9118–9127, doi: 10.1109/CVPR.2019.00934.
[27] C. Liu et al., “Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation,” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2019, vol. 2019-June, pp. 82–92, doi: 10.1109/CVPR.2019.00017.
[28] H. S. Kim, K. Y. Yoo, and L. H. Kim, “Improved performance of image semantic segmentation using NASNet,” Korean Chem. Eng. Res., vol. 57, no. 2, pp. 274–282, 2019, [Online]. Available at: https://www.cheric.org/research/tech/periodicals/view.php?seq=1716510.
[29] W. Chen, X. Gong, X. Liu, Q. Zhang, Y. Li, and Z. Wang, “FasterSeg: Searching for Faster Real-time Semantic Segmentation,” 8th Int. Conf. Learn. Represent. ICLR 2020, Dec. 2019. [Online]. Available at: https://arxiv.org/abs/1912.10917v2.
[30] X. Yang et al., “Towards Automated Semantic Segmentation in Prenatal Volumetric Ultrasound,” IEEE Trans. Med. Imaging, vol. 38, no. 1, pp. 180–193, Jan. 2019, doi: 10.1109/TMI.2018.2858779.
[31] V. Mnih et al., “Playing Atari with Deep Reinforcement Learning,” arxiv Mach. Learn., pp. 1–9, Dec. 2013, Accessed: Nov. 14, 2023. [Online]. Available at: https://arxiv.org/abs/1312.5602v1.
[32] V. Mnih et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, Feb. 2015, doi: 10.1038/nature14236.
[33] Y. Chen, N. Zhu, Q. Wu, C. Wu, W. Niu, and Y. Wang, “MRSI: A multimodal proximity remote sensing data set for environment perception in rail transit,” Int. J. Intell. Syst., vol. 37, no. 9, pp. 5530–5556, Sep. 2022, doi: 10.1002/int.22801.
[34] C. Yu, J. Wang, C. Peng, C. Gao, G. Yu, and N. Sang, “BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11217 LNCS, Springer Verlag, 2018, pp. 334–349, doi: 10.1007/978-3-030-01261-8_20.
[35] C. Yu, C. Gao, J. Wang, G. Yu, C. Shen, and N. Sang, “BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation,” Int. J. Comput. Vis., vol. 129, no. 11, pp. 3051–3068, Nov. 2021, doi: 10.1007/s11263-021-01515-2.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
___________________________________________________________
International Journal of Advances in Intelligent Informatics
ISSN 2442-6571 (print) | 2548-3161 (online)
Organized by UAD and ASCEE Computer Society
Published by Universitas Ahmad Dahlan
W: http://ijain.org
E: info@ijain.org (paper handling issues)
andri.pranolo.id@ieee.org (publication issues)
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0