Abstract
Effective and accurate detection of clothing categories in streets is imperative for the description and analysis of crowd clothing profile which is crucial for fashion designers and E-commerce. In this paper, we present a real-time tracking system from surveillance videos to detect and track the various clothing categories with a state-of-the-art deep learning approach, which proposes a combinational framework based on deep convolutional neural network (CNN). First, we take advantage of the mechanism of Focal Loss to improve the loss function of the one-stage detector YOLOv2. We adopt CREST as the visual tracker which largely overcomes the difficulties of occlusion and deformation for clothing detection. Additionally, we have collected and preprocessed a dataset including over 1,200,000 still images from previous works and thousands of video fragments for the training and validation of CNNs. Finally, we compare our work with the baseline and previous work, and our framework demonstrates its effectiveness and accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Z. Hu, H. Yan, and X. Lin. Clothing segmentation using foreground and background estimation based on the constrained Delaunay triangulation. Pattern Recognition, vol. 41, no. 5, 2008.
M. Yang, S. Zhu, F. Lv, and K. Yu. Correspondence driven adaptation for human profile recognition. In Proc. CVPR, 2011.
X. Chao, M. J. Huiskes, T. Gritti, and C. Ciuhu. A framework for robust feature selection for real-time fashion style recommendation. In Proc. ACM Multimedia Workshop Interactive Multimedia for Consumer Electronics, October 2009, pp. 35–42.
Ming Yang, Kai Yu. Real-time clothing recognition in surveillance videos. In: IEEE International Conference on Image Processing, pp. 2937–2940, 2011.
A. C. Gallagher and T. Chen. Clothing cosegmentation for recognizing people. In Proc. CVPR, June 2008.
Z. Hu, H. Yan, and X. Lin. Clothing segmentation using foreground and background estimation based on the constrained Delaunay triangulation. Pattern Recognition, vol. 41, no. 5, pp. 1581–1592, May 2008.
D. Anguelov, K. C. Lee, S. B. Gokturk, and B. Sumengen. Contextual identity recognition in personal photo albums. In Proc. CVPR, 2007.
A. C. Gallagher and T. Chen. Clothing cosegmentation for recognizing people. In Proc. CVPR, June 2008.
M. Yang, K. Yu. Real-time clothing recognition in surveillance videos. ICIP, 2011.
M. H. Kiapour, X. Han, S. Lazebnik, A. C. Berg, and T. L. Berg. Where to buy it: Matching street clothing photos in online shops. In ICCV, 2015.
J. Huang, R. S. Feris, Q. Chen, and S. Yan. Cross-domain image retrieval with a dual attribute-aware ranking network. In ICCV, 2015.
Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, and Xiaoou Tang. Deep Fashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. In CVPR, 2016.
M. H. Kiapour, K. Yamaguchi, A. C. Berg, and T. L. Berg. Hipster wars: Discovering elements of fashion styles. In ECCV, pages 472–488. 2014.
Q. Chen, J. Huang, R. Feris, L. M. Brown, J. Dong, and S. Yan. Deep domain adaptation for describing people based on fine-grained clothing attributes. In CVPR, pages 5315–5324, 2015.
E. Simo-Serra, S. Fidler, F. Moreno-Noguer, and R. Urtasun. Neuroaesthetics in fashion: Modeling the perception of beauty. In CVPR, 2015.
Y. Kalantidis, L. Kennedy, and L.-J. Li. Getting the look: clothing recognition and segmentation for automatic product suggestions in everyday photos. In ICMR, 2013.
T. Xiao, T. Xia, Y. Yang, C. Huang, and X. Wang. Learning from massive noisy labeled data for image classification. In CVPR, pages 2691–2699, 2015.
S. Liu, Z. Song, G. Liu, C. Xu, H. Lu, and S. Yan. Street-toshop: Cross-scenario clothing retrieval via parts alignment and auxiliary set. In CVPR, pages 3330–3337, 2012.
A. Veit, B. Kovacs, S. Bell, J. McAuley, K. Bala, and S. Belongie. Learning visual clothing style with heterogeneous dyadic co-occurrences. In ICCV, pages 4642–4650, 2015.
X. Liang, L. Lin, W. Yang, P. Luo, J. Huang, and S. Yan. Clothes co-parsing via joint image segmentation and labeling with application to clothing retrieval. In IEEE Transactions on Multimedia, 2016.
Bewley, Alex and Ge, Zongyuan and Ott, Lionel and Ramos, Fabio and Upcroft, Ben. Simple online and realtime tracking. ICIP, 2016.
Song Yibing, Ma Chao, Gong Lijun, Zhang Jiawei, Lau Rynson and Yang Ming-Hsuan. CREST: Convolutional Residual Learning for Visual Tracking. ICCV, 2017.
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. You Only Look Once: Unified, Real-Time Object Detection. CVPR, 2016.
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg. SSD: Single Shot MultiBox Detector. ECCV, 2016.
Joseph Redmon, Ali Farhadi. YOLO9000: Better, Faster, Stronger. CVPR, 2017.
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. Focal Loss for Dense Object Detection. ICCV, 2017.
Acknowledgments
This work is supported by WTU SF-165003/165010 and FRFCU SCUN-CZQ18008.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Huang, J., Wu, X., Zhu, J., He, R. (2019). Real-Time Clothing Detection with Convolutional Neural Network. In: Patnaik, S., Jain, V. (eds) Recent Developments in Intelligent Computing, Communication and Devices. Advances in Intelligent Systems and Computing, vol 752. Springer, Singapore. https://doi.org/10.1007/978-981-10-8944-2_28
Download citation
DOI: https://doi.org/10.1007/978-981-10-8944-2_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8943-5
Online ISBN: 978-981-10-8944-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)