[MVIT Research LAB : Mahidol University] การตรวจจับวัตถุขนาดเล็กอย่างมีประสิทธิภาพด้วย MCBAN, Multi-Convolutional Block Attention Network ความร่วมมืองานวิจัย ระหว่าง MUICT และ UTP

สำรวจ
ลงทุน
คำถาม

มีบัญชีอยู่แล้ว?หรือ

MVIT Research LAB : Mahidol University

•

7 ธ.ค. 2024 เวลา 05:33 • วิทยาศาสตร์ & เทคโนโลยี

มหาวิทยาลัยมหิดล

การตรวจจับวัตถุขนาดเล็กอย่างมีประสิทธิภาพด้วย MCBAN, Multi-Convolutional Block Attention Network

ความร่วมมืองานวิจัย ระหว่าง MUICT และ UTP

โดย Hina Bhanbhro, Yew Kwang Hooi, Mohammad Nordin Bin Zakaria, Worapan Kusakunniran, Zaira Hassan Amur

จาก Computer and Information Science Department, Universiti Teknologi PETRONA, Malaysia

Faculty of ICT, Mahidol University, Thailand

การตรวจจับวัตถุมีความก้าวหน้าอย่างมากในช่วงไม่กี่ปีที่ผ่านมา อย่างไรก็ตาม การตรวจจับวัตถุขนาดเล็กยังคงเป็นปัญหาอย่างมากด้วยเหตุผลหลายประการ เช่น พวกมันมีขนาดเล็กมากและมีความอ่อนไหวต่อการตรวจจับที่พลาดเนื่องจากเสียงรบกวนรอบข้าง นอกจากนี้ ข้อมูลวัตถุขนาดเล็กยังได้รับผลกระทบเนื่องจากการดำเนินการลดการสุ่มตัวอย่าง มีการใช้วิธีการDeep learning-based detection

เพื่อจัดการกับความท้าทายที่เกิดจากการตรวจจับวัตถุขนาดเล็ก ในงานวิจัยนี้ได้เสนอวิธีการใหม่ ในชื่อ Multi-Convolutional Block Attention Network (MCBAN) เพื่อเพิ่มความแม่นยําในการตรวจจับ minute objects ที่มุ่งป้องกันการสูญเสียข้อมูลในระหว่างกระบวนการลดการสุ่มตัวอย่าง ตรวจจับวัตถุขนาดเล็กด้วยความแม่นยําที่สูงขึ้นจากการใช้ multi-convolutional attention block (MCAB) และ channel attention and spatial attention module (SAM)

เพื่อสร้างขึ้นเป็น MCAB ซึ่งได้ทำการทดลองกับ Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI) และPattern Analysis กับ Statical Modeling and Computational Learning (PASCAL) บนฐานข้อมูล Visual Object Classes (VOC) และดำเนินกระบวนการ step-wise process เพื่อวิเคราะห์ผลลัพธ์

ผลการทดลองนี้แสดงให้เห็นว่ามีประสิทธิภาพเพิ่มขึ้นอย่างมีนัยสําคัญ เช่น 97.75% สําหรับ KITTI และ 88.97% สําหรับ PASCAL VOC ผลการวิจัยนี้ยืนยันข้อเท็จจริงที่ชัดเจนว่า MCBAN มีประสิทธิภาพในการตรวจจับวัตถุขนาดเล็กมากกว่าเมื่อเทียบกับวิธีการอื่น ๆ ที่มีอยู่ในปัจจุบัน

Object detection has made a significant leap forward in recent years. However, the detection of small objects continues to be a great difficulty for various reasons, such as they have a very small size and they are susceptible to missed detection due to background noise. Additionally, small object information is affected due to the downsampling operations.

Deep learning-based detection methods have been utilized to address the challenge posed by small objects. In this work, we propose a novel method, the Multi-Convolutional Block Attention Network (MCBAN), to increase the detection accuracy of minute objects aiming to overcome the challenge of information loss during the downsampling process.

The multi-convolutional attention block (MCAB); channel attention and spatial attention module (SAM) that make up MCAB, have been crafted to accomplish small object detection with higher precision. We have carried out the experiments on the Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI) and Pattern Analysis, Statical Modeling and Computational Learning (PASCAL) Visual Object Classes (VOC) datasets and have followed a step-wise process to analyze the results.

These experiment results demonstrate that significant gains in performance are achieved, such as 97.75% for KITTI and 88.97% for PASCAL VOC. The findings of this study assert quite unequivocally the fact that MCBAN is much more efficient in the small object detection domain as compared to other existing approaches.

More info: https://doi.org/10.32604/cmc.2024.052138

ดูเพิ่มเติมในซีรีส์

MVIT Lab Research & Publications

โฆษณา

ดาวน์โหลดแอปพลิเคชัน

การตรวจจับวัตถุขนาดเล็กอย่างมีประสิทธิภาพด้วย MCBAN, Multi-Convolutional Block Attention Network

ดาวน์โหลดแอปพลิเคชัน