TY - GEN
T1 - SCT-CNN
T2 - 2021 IEEE International Conference on Robotics and Automation, ICRA 2021
AU - Yan, Gang
AU - Schmitz, Alexander
AU - Funabashi, Satoshi
AU - Somlor, Sophon
AU - Tomo, Tito Pradhono
AU - Sugano, Shigeki
N1 - Funding Information:
This research was supported by the JST Grant-in-Aid No. JPMJMS2031, the JSPS Grant-in-Aid No. 19K14948, No. 19H02116, No. 19H01130, MIC project No. JPJ000595, the JST ACT-I Information and Future No. 50185, the Tateishi Science and Technology Foundation Research Grant (S), and the Research Institute for Science and Engineering, Waseda University.
Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - Recently, tactile sensing has attracted great interest for robotic manipulation. Predicting if a grasp will be stable or not, i.e. if the grasped object will drop out of the gripper while being lifted, can aid robust robotic grasping. Previous methods paid equal attention to all regions of the tactile data matrix or all time-steps in the tactile sequence, which may include irrelevant or redundant information. In this paper, we propose to equip Convolutional Neural Networks with spatial-channel and temporal attention mechanisms (SCT attention CNN) to predict future grasp stability. To the best of our knowledge, this is the first time to use attention mechanisms for predicting grasp stability only relying on tactile information. We implement our experiments with 52 daily objects. Moreover, we compare different spatio-temporal models and attention mechanisms as an empirical study. We found a significant accuracy improvement of up to 5% when using SCT attention. We believe that attention mechanisms can also improve the performance of other tactile learning tasks in the future, such as slip detection and hardness perception.
AB - Recently, tactile sensing has attracted great interest for robotic manipulation. Predicting if a grasp will be stable or not, i.e. if the grasped object will drop out of the gripper while being lifted, can aid robust robotic grasping. Previous methods paid equal attention to all regions of the tactile data matrix or all time-steps in the tactile sequence, which may include irrelevant or redundant information. In this paper, we propose to equip Convolutional Neural Networks with spatial-channel and temporal attention mechanisms (SCT attention CNN) to predict future grasp stability. To the best of our knowledge, this is the first time to use attention mechanisms for predicting grasp stability only relying on tactile information. We implement our experiments with 52 daily objects. Moreover, we compare different spatio-temporal models and attention mechanisms as an empirical study. We found a significant accuracy improvement of up to 5% when using SCT attention. We believe that attention mechanisms can also improve the performance of other tactile learning tasks in the future, such as slip detection and hardness perception.
UR - http://www.scopus.com/inward/record.url?scp=85125487733&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125487733&partnerID=8YFLogxK
U2 - 10.1109/ICRA48506.2021.9561397
DO - 10.1109/ICRA48506.2021.9561397
M3 - Conference contribution
AN - SCOPUS:85125487733
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 2627
EP - 2634
BT - 2021 IEEE International Conference on Robotics and Automation, ICRA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 30 May 2021 through 5 June 2021
ER -