A Probabilistic Line Breaking Algorithm

Bouckaert, Remco R.

doi:10.1007/978-3-540-24581-0_33

Remco R. Bouckaert⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2903))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

1547 Accesses
1 Citations

Abstract

We show how a probabilistic interpretation of an ill defined problem, the problem of finding line breaks in a paragraph, can lead to an efficient new algorithm that performs well. The graphical model that results from the probabilistic interpretation has the advantage that it is easy to tune due to the probabilistic approach. Furthermore, the algorithm optimizes the probability a break up is acceptable over the whole paragraph, it does not show threshold effects and it allows for easy incorporation of subtle typographical rules. Thanks to the architecture of the Bayesian network, the algorithm is linear in the number of characters in a paragraph. Empirical evidence suggests that this algorithm performs closer to results published through desk top publishing than a number of existing systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fine, J.: Line breaking and page breaking. TUGBoat 21(3), 210–221 (2000)
MathSciNet Google Scholar
Freytag, A.: Line Breaking Properties Unicode Standard Annex #14 (part of the Unicode Standard). Technical Report (2002)
Google Scholar
Knuth, D.E.: Computers & Typesetting Volume A, The TeXbook. The TeXbook, vol. A. Addison-Wesley, Reading (1984)
Google Scholar
Knuth, D.E., Plass, M.F.: Breaking Paragraphs into Lines. Software—Practice and Experience 11, 1119–1184 (1981)
Article MATH Google Scholar
Lauritzen, S.L., Spiegelhalter, D.J.: Local computations with probabilities on graphical structures and their applications to expert systems (with discussion). Journal of the Royal Statistical Society B 50, 157–224 (1988)
MATH MathSciNet Google Scholar
Liang, F.M.: Word Hy-phen-a-tion by Computer. Ph.D. Thesis, Department of Computer Science, Stanford University (August 1983)
Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems, Networks of Plausible Inference. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Extensible Stylesheet Language (XSL). Version 1.0, W3C Recommendation, October 15 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Xtal Mountain Information Technology and Computer Science Department, University of Waikato, New Zealand
Remco R. Bouckaert

Authors

Remco R. Bouckaert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Australian National University, ACT 0200, Acton, Australia
Tamás (Tom) Domonkos Gedeon
Murdoch University,
Lance Chun Che Fung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bouckaert, R.R. (2003). A Probabilistic Line Breaking Algorithm. In: Gedeon, T.(.D., Fung, L.C.C. (eds) AI 2003: Advances in Artificial Intelligence. AI 2003. Lecture Notes in Computer Science(), vol 2903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24581-0_33

Download citation

DOI: https://doi.org/10.1007/978-3-540-24581-0_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20646-0
Online ISBN: 978-3-540-24581-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics