Global to multi-scale local architecture with hardwired CNN for 1-ms tomato defect detection

Yuan Li*, Tingting Hu, Ryuji Fuchikami, Takeshi Ikenaga


研究成果: Article査読


A 1 millisecond (1-ms) vision system that guarantees high efficiency and timely response for tomato defect detection is essential for factory automation. Because of various defect appearances, recently many existing researches focus on CNN based defect detection, but few of them attempt to reach high processing speed to adapt to the factorial assembly line. This paper proposes a global to multi-scale local based parallel architecture with hardwired CNN for tomato defect detection. This architecture breaks down image-wise detection into pixel-wise localization and block-wise classification. The pixel-wise localization utilizes tomato-aware information as constraints for localization performance. The block-wise classification uses a fully pipelined network structure to obtain the classification result for each block as the pixel stream moves through the network. The classification network has a six-layer lightweight network structure with quantization for hardwired type implementation on FPGA. The experiment results show that the proposed architecture processes 1000 FPS images with 0.9476 ms/frame delay. And for detection performance, this architecture keeps (Formula presented.) at 80.18%, only 1.31% lower than ResNet50 based detection system.

ジャーナルIET Image Processing
出版ステータスAccepted/In press - 2024

ASJC Scopus subject areas

  • ソフトウェア
  • 信号処理
  • コンピュータ ビジョンおよびパターン認識
  • 電子工学および電気工学


「Global to multi-scale local architecture with hardwired CNN for 1-ms tomato defect detection」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。