Automatic comic page segmentation based on polygon detection

Li, Luyuan; Wang, Yongtao; Tang, Zhi; Gao, Liangcai

doi:10.1007/s11042-012-1241-7

Automatic comic page segmentation based on polygon detection

Published: 27 September 2012

Volume 69, pages 171–197, (2014)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Luyuan Li¹,
Yongtao Wang¹,
Zhi Tang^1,2 &
…
Liangcai Gao¹

690 Accesses
20 Citations
6 Altmetric
Explore all metrics

Abstract

Comic page segmentation aims to automatically decompose scanned comic images into storyboards (frames), which is the key technique to produce digital comic documents that are suitable for reading on mobile devices. In this paper, we propose a novel method for comic page segmentation by finding the quadrilateral enclosing box of each storyboard. We first acquire the edge image of the input comic image, and then extract line segments with a heuristic line segment detection algorithm. We perform line clustering to further merge the overlapped line segments and remove the redundancy line segments. Finally, we perform another round of line clustering and post-processing to compose the obtained line segments into complete quadrilateral enclosing boxes of the storyboards. The proposed method is tested on 2,237 comic images from 12 different printed comic series, and the experimental results demonstrate that our method is effective for comic image segmentation and outperforms the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Comic storyboard extraction via edge segment analysis

Article 16 May 2015

Balloon extraction from complex comic books using edge detection and histogram scoring

Article 08 September 2015

Comic Character Recognition (CCR): Extraction of Speech Balloon Context and Character of Interest in Comics

References

Arai K, Tolle H (2010) Automatic e-comic content adaptation. Int J Ubiquit Comput 1(1):1–11
Google Scholar
Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit 13(2):111–122
Article MATH Google Scholar
Burns JB, Hanson AR, Riseman EM (1986) Extracting straight lines. IEEE Trans Pattern Anal Mach Intell 8(4):425–455
Article Google Scholar
Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8(6):679–698
Article Google Scholar
Chung KL, Lin ZW, Huang ST, Huang YH, Liao HYM (2010) New orientation-based elimination approach for accurate line-detection. Pattern Recognit 31:11–19
Article Google Scholar
Forsyth DA, Ponce J (2002) Computer vision: a modern approach, 1st edn. Prentice Hall 467–490
Gioi RG, Jakubowicz J, Morel JM, Randall G (2010) LSD: a fast line segment detector with a false detection control. IEEE Trans Pattern Anal Mach Intell 32(4):722–732
Article Google Scholar
Grana C, Borghesani D, Cucchiara R (2010) Optimized block-based connected components labeling with decision trees. IEEE Trans Image Process 19(6):1596–1609
Article MathSciNet Google Scholar
Ho CT, Chen LH (1996) A high-speed algorithm for line detection. Pattern Recognit Lett 17:467–473
Article Google Scholar
In Y, Oie T, Higuchi M, Kawasaki S et al (2010) Fast frame decomposition and sorting by contour tracing for mobile phone comic images. Proc. International Conference on Visualization, imaging and simulation (VIS), Wisconsin, 2010:23–28
Ishii D, Watanabe H (2010) A study on frame position detection of digitized comic images. Workshop on Picture Coding and Image Processing (PCSJ), Nagoya, 2010:124–125
Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Pattern Anal Mach Intell 20(12):294–308
Article Google Scholar
Lo RC, Tsai WH (1995) Gray-scale Hough transform for thick line detection in gray-scale images. Pattern Recognit 28(5):647–661
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Article Google Scholar
Tanaka T, Shoji K, Toyama F, Miyamichi J (2007) Layout analysis of tree-structured scene frames in comic images. Proc. International Joint Conferences on Artificial Intelligence (IJCAI), Hyderabad, January 2007: 2885–2890
Theodoridis S, Koutroumbas K (2008) Pattern recognition, 4th edn. Academic Press 20–50
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Article Google Scholar
Xi J, Hu J, Wu L (2002) Page segmentation of Chinese newspapers. Pattern Recognit 35(12):2695–2704
Article MATH Google Scholar
Yamada M, Budiarto R, Endoo M, Miyazaki S (2004) Comic image decomposition for reading comics on cellular phones. IEICE Trans Inf Syst E87-D(6):1370–1376
Google Scholar

Download references

Acknowledgments

This work is supported by National Basic Research Program of China, also Named “973 Program” (No. 2010CB735908).

Author information

Authors and Affiliations

Institute of Computer Science & Technology, Peking University, Beijing, China
Luyuan Li, Yongtao Wang, Zhi Tang & Liangcai Gao
State Key Laboratory of Digital Publishing Technology, Beijing, China
Zhi Tang

Authors

Luyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongtao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhi Tang
View author publications
You can also search for this author in PubMed Google Scholar
Liangcai Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yongtao Wang or Zhi Tang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, L., Wang, Y., Tang, Z. et al. Automatic comic page segmentation based on polygon detection. Multimed Tools Appl 69, 171–197 (2014). https://doi.org/10.1007/s11042-012-1241-7

Download citation

Published: 27 September 2012
Issue Date: March 2014
DOI: https://doi.org/10.1007/s11042-012-1241-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic comic page segmentation based on polygon detection

Abstract

Access this article

Similar content being viewed by others

Comic storyboard extraction via edge segment analysis

Balloon extraction from complex comic books using edge detection and histogram scoring

Comic Character Recognition (CCR): Extraction of Speech Balloon Context and Character of Interest in Comics

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Automatic comic page segmentation based on polygon detection

Abstract

Access this article

Similar content being viewed by others

Comic storyboard extraction via edge segment analysis

Balloon extraction from complex comic books using edge detection and histogram scoring

Comic Character Recognition (CCR): Extraction of Speech Balloon Context and Character of Interest in Comics

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation