Grasping Complex-Shaped and Thin Objects Using a Generative Grasping Convolutional Neural Network

Kim, Jaeseok; Nocentini, Olivia; Bashir, Muhammad Zain; Cavallo, Filippo

doi:10.3390/robotics12020041

Vision-based pose detection and grasping complex-shaped and thin objects are challenging tasks. We propose an architecture that integrates the Generative Grasping Convolutional Neural Network (GG-CNN) with depth recognition to identify a suitable grasp pose. First, we construct a training dataset with data augmentation to train a GG-CNN with only RGB images. Then, we extract a segment of the tool using a color segmentation method and use it to calculate an average depth. Additionally, we apply and evaluate different encoder–decoder models with a GG-CNN structure using the Intersection Over Union (IOU). Finally, we validate the proposed architecture by performing real-world grasping and pick-and-place experiments. Our framework achieves a success rate of over 85.6% for picking and placing seen surgical tools and 90% for unseen surgical tools. We collected a dataset of surgical tools and validated their pick and place with different GG-CNN architectures. In the future, we aim to expand the dataset of surgical tools and improve the accuracy of the GG-CNN.

Grasping Complex-Shaped and Thin Objects Using a Generative Grasping Convolutional Neural Network / Kim, J., Nocentini, O., Bashir, M.Z., Cavallo, F.. - In: ROBOTICS. - ISSN 2218-6581. - ELETTRONICO. - 12:(2023), pp. 41.1-41.16. [10.3390/robotics12020041]