Detection of one-horned rhino using multispectral images
Abstract
Animal detection and surveillance is an important field of research to address the needs for protection of endangered species among others. The challenges in animal detection include low-contrast and poor image quality which is commonly observed during night time. Researchers have mostly worked on day-light, low-contrast and thermal images. To handle the challenges of detection during night time, multispectral images in combination with deep architectures may be used for better detection performance. In the present work, one-horned rhino is considered for detection because they are getting endangered for reasons like poaching, natural calamities and diseases. A novel multispectral one-horned rhino dataset is introduced and the multispectral data is obtained by combining the channels of color images and the corresponding thermal images. Instance segmentation based techniques YOLACT and YOLACT++ are used here to detect rhinos with the above multispectral dataset. The performances of the detectors are studied in terms of mAP and FPS.
Full Text:
PDFReferences
S. Choudhury, N. Saikia and A. J. Das, “Recent Trends in Learning Based Techniques for Human and Animal Detection”, in Joint National Conference on Emerging Computing Technologies & its Applications (NCECTA 2019), April, 2019, PSG College of Technology, Coimbatore, Tamil Nadu, India.
S. Choudhury, N. Bharti, N. Saikia and S. Rajbongshi, “Detection of One-horned Rhino from Green Environment Background using Deep Learning”, Journal of Green Engineering, vol. 10, pages 4657-4678, September, 2020.
S. Choudhury, N. Saikia, S. Rajbongshi and A. Das, “Employing generative adversarial network in low light animal detection”, In: Kumar, S., Hiranwal, S., Purohit, S.D., Prasad, M. (eds) Proceedings of International Conference on Communication and Computational Technologies . Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-3951-8_75.
K. He, G. Gkioxari, P. Dollár and R. Girshick, “Mask R-CNN”, 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 2980-2988, doi: 10.1109/ICCV.2017.322
S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: towards real-time object detection with region proposal networks”, In Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1 (NIPS'15). MIT Press, Cambridge, MA, USA, 91–99, 2015.
H. Chen, K. Sun, Z. Tian, C. Shen, Y. Huang and Y. Yan, "BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation," 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 2020, pp. 8570-8578, doi: 10.1109/CVPR42600.2020.00860.
D. Bolya, C. Zhou, F. Xiao and Y. J. Lee, "YOLACT: Real-Time Instance Segmentation," 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 2019, pp. 9156-9165, doi: 10.1109/ICCV.2019.00925.
D. Bolya, C. Zhou, F. Xiao and Y. J. Lee, "YOLACT++ Better Real-Time Instance Segmentation," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 2, pp. 1108-1121, 1 Feb. 2022, doi: 10.1109/TPAMI.2020.3014297.
M.O. Gani, S. Kuiry, A. Das, M. Nasipuri and N. Das, “Multispectral Object Detection with Deep Learning”, arXiv e-prints, 2021. doi:10.48550/arXiv.2102.03115.
P. Gudžiu, O. Kurasova, V. Darulis, “Deep learning-based object recognition in multispectral satellite imagery for real-time applications”, Machine Vision and Applications 32, 98 (2021). https://doi.org/10.1007/s00138-021-01209-2.
J. Shu, J. He, L. Li, and T. Reddy G, “MSIS: Multispectral Instance Segmentation Method for Power Equipment”, In Intell. Neuroscience 2022, https://doi.org/10.1155/2022/2864717.
J. Lopez, J. Schoonmaker and S. Saggese, "Automated detection of marine animals using multispectral imaging," 2014 Oceans - St. John's, St. John's, NL, Canada, 2014, pp. 1-6, doi: 10.1109/OCEANS.2014.7003132.
S. Hwang, H.K. Shin, J.M. Park, “Classification of dog skin diseases using deep learning with images captured from multispectral imaging device”, Mol. Cell. Toxicol. 18, 299–309 (2022). https://doi.org/10.1007/s13273-022-00249-7
B. C. Russell, A. Torralba, K. P. Murphy, “LabelMe: A Database and Web-Based Tool for Image Annotation”, In International Journal Computer Vision, 77, 157–173 (2008). https://doi.org/10.1007/s11263-007-0090-8.
Refbacks
- There are currently no refbacks.
------------------------------------------------------------------------------------------------------------------------
The ADBU Journal of Engineering Technology (AJET)" ISSN:2348-7305
This journal is published under the terms of the Creative Commons Attribution (CC-BY) (http://creativecommons.org/licenses/)