TY - JOUR
T1 - LLA
T2 - Loss-aware label assignment for dense pedestrian detection
AU - Ge, Zheng
AU - Wang, Jianfeng
AU - Huang, Xin
AU - Liu, Songtao
AU - Yoshie, Osamu
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2021/10/28
Y1 - 2021/10/28
N2 - Label assignment has been widely studied in general object detection because of its great impact on detectors’ performance. In the field of dense pedestrian detection, human bodies are often heavily entangled, making label assignment more important. However, none of the existing label assignment method focuses on crowd scenarios. Motivated by this, we propose Loss-aware Label Assignment (LLA) to boost the performance of pedestrian detectors in crowd scenarios. Concretely, LLA first calculates classification (cls) and regression (reg) losses between each anchor and ground-truth (GT) pair. A joint loss is then defined as the weighted summation of cls and reg losses as the assigning indicator. Finally, anchors with top K minimum joint losses for a certain GT box are assigned as its positive anchors. Anchors that are not assigned to any GT box are considered negative. LLA is simple but effective. Experiments on CrowdHuman and CityPersons show that such a simple label assigning strategy can boost MR by 9.53% and 5.47% on two famous one-stage detectors – RetinaNet and FCOS, becoming the first one-stage detector that surpasses Faster R-CNN in crowd scenarios.
AB - Label assignment has been widely studied in general object detection because of its great impact on detectors’ performance. In the field of dense pedestrian detection, human bodies are often heavily entangled, making label assignment more important. However, none of the existing label assignment method focuses on crowd scenarios. Motivated by this, we propose Loss-aware Label Assignment (LLA) to boost the performance of pedestrian detectors in crowd scenarios. Concretely, LLA first calculates classification (cls) and regression (reg) losses between each anchor and ground-truth (GT) pair. A joint loss is then defined as the weighted summation of cls and reg losses as the assigning indicator. Finally, anchors with top K minimum joint losses for a certain GT box are assigned as its positive anchors. Anchors that are not assigned to any GT box are considered negative. LLA is simple but effective. Experiments on CrowdHuman and CityPersons show that such a simple label assigning strategy can boost MR by 9.53% and 5.47% on two famous one-stage detectors – RetinaNet and FCOS, becoming the first one-stage detector that surpasses Faster R-CNN in crowd scenarios.
KW - Label assignment
KW - Occlusion aware
KW - Pedestrain detection
UR - http://www.scopus.com/inward/record.url?scp=85112413361&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112413361&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2021.07.094
DO - 10.1016/j.neucom.2021.07.094
M3 - Article
AN - SCOPUS:85112413361
SN - 0925-2312
VL - 462
SP - 272
EP - 281
JO - Neurocomputing
JF - Neurocomputing
ER -