TY - GEN
T1 - One-Shot Object Pose Estimation Method for Smart Logistics
AU - Zhang, Yue
AU - Xiao, Nengfei
AU - Yao, Ziying
AU - Li, Yongwei
AU - Wu, Xinkai
N1 - Publisher Copyright:
© ASCE.
PY - 2024
Y1 - 2024
N2 - Smart logistics is an important branch of intelligent transportation. Recognizing the poses of different goods through vision can help achieve fully automated picking and handling of goods by logistics robots, thereby effectively improving productivity. However, in logistics scenarios, there are often challenges such as diverse types of goods and high annotation costs, which greatly increase the difficulty of the 6-degree-of-freedom pose estimation for the targets. In this paper, we propose a transformer-based one-shot object pose estimation method, which enables fast pose estimation for unknown targets without the need to retrain the network. To obtain sparse 3D point cloud models for targets with few features, we apply the LoftR sparse feature image matching method to the SfM (Structure from Motion) pipeline. We use a similarity network based on attention mechanism to estimate similar poses and further optimize the pose estimation network using the transformer method. Experimental results show that the proposed method outperforms existing one-shot object pose estimation methods in terms of accuracy on the GenMOP, OnePose-Lowtexture data sets, and our custom test data sets. It can be practically applied in smart logistics scenarios.
AB - Smart logistics is an important branch of intelligent transportation. Recognizing the poses of different goods through vision can help achieve fully automated picking and handling of goods by logistics robots, thereby effectively improving productivity. However, in logistics scenarios, there are often challenges such as diverse types of goods and high annotation costs, which greatly increase the difficulty of the 6-degree-of-freedom pose estimation for the targets. In this paper, we propose a transformer-based one-shot object pose estimation method, which enables fast pose estimation for unknown targets without the need to retrain the network. To obtain sparse 3D point cloud models for targets with few features, we apply the LoftR sparse feature image matching method to the SfM (Structure from Motion) pipeline. We use a similarity network based on attention mechanism to estimate similar poses and further optimize the pose estimation network using the transformer method. Experimental results show that the proposed method outperforms existing one-shot object pose estimation methods in terms of accuracy on the GenMOP, OnePose-Lowtexture data sets, and our custom test data sets. It can be practically applied in smart logistics scenarios.
UR - https://www.scopus.com/pages/publications/85214004981
U2 - 10.1061/9780784485484.073
DO - 10.1061/9780784485484.073
M3 - 会议稿件
AN - SCOPUS:85214004981
T3 - CICTP 2024: Resilient, Intelligent, Connected, and Lowcarbon Multimodal Transportation - Proceedings of the 24th COTA International Conference of Transportation Professionals
SP - 758
EP - 769
BT - CICTP 2024
A2 - Ma, Jianming
A2 - Luo, Qin
A2 - Sun, Lijun
A2 - Li, Baicheng
A2 - Chen, Jingjing
A2 - Zhang, Guohui
PB - American Society of Civil Engineers (ASCE)
T2 - 24th COTA International Conference of Transportation Professionals: Resilient, Intelligent, Connected, and Lowcarbon Multimodal Transportation, CICTP 2024
Y2 - 23 July 2024 through 26 July 2024
ER -