TY - GEN
T1 - Person re-identification by two-stream feature-fusion architecture utilizing a partial body image
AU - Hiroi, Yuki
AU - Kameyama, Wataru
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/10/13
Y1 - 2020/10/13
N2 - Because of the proliferation of surveillance camera and the wide range of its utilization, 'Person Re-identification' technology has been drawing attention. However, the issues such as differences in person's appearances depending on their wearing items, clothes and behaviors still remain. Therefore, in this paper, we propose a two-stream feature-fusion architecture to improve the re-identification accuracy, where spatio-temporal features of partial body images, that we conceive to represent person's individuality robust to such differences, and the corresponding entire images, by applying convolutional LSTM and 3D CNN. The evaluation using the MARS dataset shows that the feet features are most effective among the four horizontally-split partial body images. And the CMS (Cumulative Match Score) by convolutional LSTM applied to the feet features in the proposed architecture is higher than the existing method which applies CNN and temporal pooling only to the entire images. The results show that it is effective to additionally use spatio-temporal features of feet in the MARS dataset.
AB - Because of the proliferation of surveillance camera and the wide range of its utilization, 'Person Re-identification' technology has been drawing attention. However, the issues such as differences in person's appearances depending on their wearing items, clothes and behaviors still remain. Therefore, in this paper, we propose a two-stream feature-fusion architecture to improve the re-identification accuracy, where spatio-temporal features of partial body images, that we conceive to represent person's individuality robust to such differences, and the corresponding entire images, by applying convolutional LSTM and 3D CNN. The evaluation using the MARS dataset shows that the feet features are most effective among the four horizontally-split partial body images. And the CMS (Cumulative Match Score) by convolutional LSTM applied to the feet features in the proposed architecture is higher than the existing method which applies CNN and temporal pooling only to the entire images. The results show that it is effective to additionally use spatio-temporal features of feet in the MARS dataset.
KW - 3D CNN
KW - Convolutional LSTM
KW - Partial Body Image
KW - Person Re-identification
KW - Two-stream Feature-fusion Architecture
UR - http://www.scopus.com/inward/record.url?scp=85099399997&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099399997&partnerID=8YFLogxK
U2 - 10.1109/GCCE50665.2020.9291887
DO - 10.1109/GCCE50665.2020.9291887
M3 - Conference contribution
AN - SCOPUS:85099399997
T3 - 2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020
SP - 399
EP - 400
BT - 2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th IEEE Global Conference on Consumer Electronics, GCCE 2020
Y2 - 13 October 2020 through 16 October 2020
ER -