TY - JOUR
T1 - Enhanced multi-object tracking via embedded graph matching and differentiable Sinkhorn assignment
T2 - addressing challenges in occlusion and varying object appearances
AU - Zhang, Yajuan
AU - Liang, Yongquan
AU - Wang, Junjie
AU - Zhu, Houying
AU - Wang, Zhihui
PY - 2025/1/2
Y1 - 2025/1/2
N2 - In the realm of computer vision, the duty of multiple objects tracking remains challenging, especially in scenarios involving occlusions and varying object appearances. In this work, we propose an innovative approach leveraging embedded graph matching to address these challenges. The proposed method constructs separate detection and tracklet graphs, to capture contextual relationships and matching constraints. An embedded graph matching network is employed to encode higher-order structural information into vertex features, significantly improving robustness against the cases of occlusions. Incorporating a differentiable Sinkhorn layer enables efficient optimal assignment, enhancing computational efficiency. Our experiments on MOT16, MOT17, and MOT20 datasets demonstrate competitive performance of the proposed method, contributing to smart city surveillance, autonomous driving, and other real-time tracking applications. Here, we achieved a 57.1% MOTA score on MOT17, highlighting the effectiveness of our proposed method.
AB - In the realm of computer vision, the duty of multiple objects tracking remains challenging, especially in scenarios involving occlusions and varying object appearances. In this work, we propose an innovative approach leveraging embedded graph matching to address these challenges. The proposed method constructs separate detection and tracklet graphs, to capture contextual relationships and matching constraints. An embedded graph matching network is employed to encode higher-order structural information into vertex features, significantly improving robustness against the cases of occlusions. Incorporating a differentiable Sinkhorn layer enables efficient optimal assignment, enhancing computational efficiency. Our experiments on MOT16, MOT17, and MOT20 datasets demonstrate competitive performance of the proposed method, contributing to smart city surveillance, autonomous driving, and other real-time tracking applications. Here, we achieved a 57.1% MOTA score on MOT17, highlighting the effectiveness of our proposed method.
KW - Data association
KW - Graph embedding
KW - Graph matching
KW - Multiple object tracking
UR - http://www.scopus.com/inward/record.url?scp=85214034150&partnerID=8YFLogxK
U2 - 10.1007/s00371-024-03772-x
DO - 10.1007/s00371-024-03772-x
M3 - Article
AN - SCOPUS:85214034150
SN - 0178-2789
JO - Visual Computer
JF - Visual Computer
M1 - 103448
ER -