Abstract: 【Objective】China is a global leader in fruit production, and fruit picking mainly relies on manual labor, which helps to select fruits according to fruit size and quality to reduce loss in this way, different techniques and tools can be adopted according to the characteristics and picking needs of each fruit tree . However,the present picking field is faced with the problem of decreasing human resources and aging problem.Meanwhile, the traditional manual picking method has become unable to meet the demand for fast and efficient picking. To solve the problem of labor shortage,the research and development of automated fruit picking equipment with integrated computer vision has become the key to solve the problem of labor shortage. It can effectively improve the efficiency and quality of fruit picking. 【Methods】Automatic picking equipment combined with computer vision often uses object detection algorithms to identify objects, and object detection algorithms can be divided into traditional algorithms and deep learning-based object detection algorithms.Traditional algorithms identify the position and bounding box of a specific object in an image or video, usually by preprocessing the image (Scaling, grayscale or normalization), feature extraction (using traditional hand-designed features or automatic learning on machine learning), classification or regression (confirming object class and location), and non-maximum suppression to further optimize and filter detected objects. When traditional fruit detection algorithms process images in complex environments, their limited expression ability and robustness are easily affected by illumination, occlusion and other factors, resulting in a decline in recognition accuracy. Furthermore, with the increase of feature complexity and computation amount, the algorithm processing speed will be reduced. When changing scenes, adding fruit types, and updating features, the feature extractor needs to be redesigned and adjusted, and in special cases, the entire system needs to be retrained. Compared with traditional fruit detection algorithms, the fruit detection algorithm based on deep learning can extract and learn rich features from a large amount of data, and has higher accuracy and robustness when processing noisy data. When changing new environments and adding new categories, the fruit detection algorithm based on deep learning can improve the recognition ability and recognition accuracy of the model through transfer learning, data enhancement, multi-model combination, feature fusion and multi-modal data.【Result】Fruit detection algorithms based on deep learning can be divided into two categories: one-stage target detection algorithm and two-stage target detection algorithm. The one-stage object detection algorithm achieves end-to-end detection by using a single convolutional neural network to directly predict the target location and category. This method achieves fast detection while maintaining high accuracy, transforms the problem of target detection into a regression problem, and completes the location and classification of the target directly. In the training and deployment phase of the algorithm, the first-stage object detection algorithm uses pruning and quantization techniques to reduce the model size, which is suitable for running in mobile devices or embedded systems with limited resources. The two-stage target detection algorithm is called the target detection algorithm based on region of interest or region suggestion, which is usually divided into two stages: 1) generate a large number of candidate regions by selective search, regional suggestion network (RPN) and other methods; 2) through the network processing including classifiers and boundary box regressors, the candidate region is identified and accurately located. Traditional algorithms are effective in simple scenarios, but are often limited by design features in complex environments. Algorithms based on deep learning are more suitable for automated fruit picking due to their high efficiency and accuracy. This paper summarizes the improvement and application of traditional Object detection algorithm and deep learning-based Object detection algorithm.And,This paper summarizes the improvements and applications of traditional spherical fruit detection algorithms and deep learning-based spherical fruit detection algorithms, and analyzes the advantages and disadvantages of these algorithms in different use scenarios.【Conclusion】This paper summarizes the fruit picking recognition algorithm and puts forward the future development trend of the algorithm.With model optimization and lightweight as the starting point, efficient network architecture or model compression technology is adopted to reduce computational complexity and model size, improve model processing speed and adapt to mobile automatic picking equipment. Enhance data processing, improve model generalization by preprocessing and synthesizing data, and optimize model adaptability in changing environments. The accuracy and robustness of model recognition are improved by combining spectral, infrared, laser and other sensor data. The model adaptive adjustment algorithm was developed to adjust strategies and parameters according to real-time feedback and adapt to different fruit picking operations and different picking environmental conditions. In the fruit picking recognition algorithm based on deep learning, YOLO can directly predict the boundary box and category probability of the target in a single forward propagation to achieve near real-time detection, which is very important for the fruit picking robot in the orchard that needs fast response. The end-to-end design of YOLO simplifies the training inspection process, reduces complexity, and enables faster deployment in picking robot systems. In the changeable environment of orchards and groves, YOLO can effectively distinguish between fruit and background, improving the accuracy of detection. With the continuous research of domestic scholars, YOLO algorithm is also continuously iteratively optimized, and its detection ability of objects of different sizes and shapes is significantly improved, which can adapt to the maturity, size and occlusion of fruits, and improve the detection performance in complex environments.
PDF ()