Skip to main content

Near-Duplicate Video Cleansing Method Based on Locality Sensitive Hashing and the Sorted Neighborhood Method

  • Conference paper
  • First Online:
2nd EAI International Conference on Robotic Sensor Networks

Part of the book series: EAI/Springer Innovations in Communication and Computing ((EAISICC))

Abstract

With the wide utilization of intelligent video surveillance technology, increasing amounts of near-duplicate video has been generated, which seriously affects the data quality of the video data set. Cleaning this dirty data automatically from the video data set has become an important issue that needs to be urgently resolved. In this chapter, a near-duplicate video cleansing method based on locality sensitive hashing (LSH) and the sorted neighborhood method (SNM) is presented in an attempt to solve the above problem. First, the speeded-up robust feature is extracted from the video and then the sorted candidate set is built by using LSH; on this basis, the near-duplicate videos are cleaned by using the SNM. Finally, the simulation experiments are implemented to show that the presented method in this chapter is effective, which can be used to clean near-duplicate videos automatically and improve video data quality.

Please note that the LNICST Editorial assumes that all authors have used the western naming convention, with given names preceding surnames. This determines the structure of the names in the running heads and the author index.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wang, W., & Zhang, L. (2013). Application and research of security data mining techniques in coal mine mobile video monitoring system (in Chinese). Coal Technology, 9, 101–103.

    Google Scholar 

  2. Chikkerur, S., Sundaram, V., Reisslein, M., et al. (2011). Objective video quality assessment methods: A classification, review, and performance comparison. IEEE Transactions on Broadcasting, 57(2), 165–182.

    Article  Google Scholar 

  3. Ringler, A. T., Hagerty, M. T., Holland, J., et al. (2015). The data quality analyzer: A quality control program for seismic data. Computers & Geosciences, 76, 96–111.

    Article  Google Scholar 

  4. Kim, W., Choi, B. J., Hong, E. K., et al. (2003). A taxonomy of dirty data. Data Mining and Knowledge Discovery, 7(1), 81–99.

    Article  MathSciNet  Google Scholar 

  5. Wu, X., Ngo, C., Hauptmann, A., et al. (2009). Real-time near-duplicate elimination for web video search with content and context. IEEE Transactions on Multimedia, 11(2), 196–207.

    Article  Google Scholar 

  6. Huang, Z., Shen, H. T., Shao, J., Zhou, X., & Cui, B. (2009). Bounded coordinate system indexing for real time video clip search. ACM Transactions on Information Systems, 27(3), 17–33.

    Article  Google Scholar 

  7. Huang, Z., Hu, B., Cheng, H., Shen, H. T., Liu, H., & Zhou, X. (2010). Mining near-duplicate graph for cluster-based reranking of web video search results. ACM Transactions on Information Systems, 28(4), 22.

    Article  Google Scholar 

  8. Zhou, X., Chen, L., Bouguettaya, A., Xiao, N., & Taylor, J. A. (2009). An efficient near duplicate video shot detection method using shot-based interest points. IEEE Transactions on Multimedia, 11(5), 879–891.

    Article  Google Scholar 

  9. Liu, J., Huang, Z., Cai, H., et al. (2013). Near-duplicate video retrieval: Current research and future trends. ACM Computing Surveys, 45(4), 44–46.

    Article  Google Scholar 

  10. Rahm, E., & Do, H. H. (2000). Data cleaning: Problems and current approach. IEEE Data Engineering Bulletin, 23(4), 3–13.

    Google Scholar 

  11. Minnich, A., Abu-El-Rub, N., Gokhale, M., et al. (2016). Clear view: Data cleaning for online review mining. In IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (pp. 555–558). San Francisco: IEEE Press.

    Chapter  Google Scholar 

  12. Zobel, J., & Hoad, T. C. (2006). Detection of video sequences using compact signatures. ACM Transactions on Information Systems, 24(1), 1–50.

    Article  Google Scholar 

  13. Douze, M., Jégou, H., & Schmid, C. (2010). An image-based approach to video copy detection with spatiotemporal post-filtering. IEEE Transactions on Multimedia, 12(4), 257–266.

    Article  Google Scholar 

  14. Liu, S., Zhu, M., & Zheng, Q. (2010). A detection method for near duplicate video clips based on content similarity (in Chinese). Journal of University of Science and Technology of China, 40(11), 1130–1135.

    Google Scholar 

  15. Wang, H., & Liu, X. (2012). Near-duplicate web video detection based on locality sensitive hashing (in Chinese). Application Research of Computers, 29(5), 1954–1958.

    Google Scholar 

  16. Liu, D., & Zhu, M. (2013). A fast algorithm for near-duplicate video detection (in Chinese). Journal of Chinese Computer Systems, 34(6), 1400–1406.

    Google Scholar 

  17. Liu, D., & Zhu, M. (2015). A computationally efficient algorithm for large scale near-duplicate video detection. In International Conference on Multimedia Modeling (MMM 2015) (pp. 481–490). Basel: Springer.

    Google Scholar 

  18. Bay, H., Tuytelaars, T., & Van Gool, L. (2008). Speeded-up robust features (SURF). Computer Vision and Image Understanding, 110(3), 346–359.

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported in part by the Shannxi Provincial Department of Education special scientific research project (No.16JK1505).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ou Ye .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ye, O., Li, Z., Zhang, Y. (2020). Near-Duplicate Video Cleansing Method Based on Locality Sensitive Hashing and the Sorted Neighborhood Method. In: Lu, H., Yujie, L. (eds) 2nd EAI International Conference on Robotic Sensor Networks. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-17763-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-17763-8_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-17762-1

  • Online ISBN: 978-3-030-17763-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics