Computer Assisted Transcription: General Framework

Toselli, Alejandro Héctor; Vidal, Enrique; Casacuberta, Francisco

doi:10.1007/978-0-85729-479-1_2

Alejandro Héctor Toselli,
Enrique Vidal &
Francisco Casacuberta

588 Accesses
1 Citations

Abstract

This chapter described the common basics on which are grounded the computer assisted transcription approaches described in the three subsequent chapters: Chaps. 3, 4 and 5. Besides, a general overview is provided of the common features characterizing the up-to-date systems we have employed for handwritten text and speech recognition.

Specific mathematical formulation and modeling adequate for interactive transcription of handwritten text images and speech signals are derived from a particular instantiation of the interactive–predictive general framework already introduced in Sect. 1.3.3. Moreover, on this ground and by adopting the passive left-to-right interaction protocol described in Sect. 1.4.2, the two basic computer assisted handwriting and speech transcription approaches were developed (detailed in Chaps. 3 and 4, respectively), along with the evaluation measures used to assess their performance.

With Contribution Of: Verónica Romero and Luis Rodriguez.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Barrachina, S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S., Ney, A. L. H., Tomás, J., & Vidal, E. (2009). Statistical approaches to computer-assisted translation. Computational Linguistics, 35(1), 3–28.
Article MathSciNet Google Scholar
Jelinek, F. (1998). Statistical methods for speech recognition. Cambridge: MIT Press.
Google Scholar
Katz, S. M. (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizer. I.E.E.E. Transactions on Acoustics, Speech, and Signal Processing, ASSP-35, 400–401.
Article Google Scholar
Kneser, R., & Ney, H. (1995). Improved backing-off for n-gram language modeling. In Proceedings of the international conference on acoustics, speech and signal processing (ICASSP) (Vol. 1, pp. 181–184).
Google Scholar
Liu, P., & Soong, F. K. (2006). Word graph based speech recognition error correction by handwriting input. In Proceedings of the international conference on multimodal interfaces (ICMI’06) (pp. 339–346), New York, NY, USA. New York: ACM.
Google Scholar
Serrano, N., Sanchis, A., & Juan, A. (2010). Balancing error and supervision effort in interactive–predictive handwritten text recognition. In Proceedings of the international conference on intelligent user interfaces (IUI’10) (pp. 373–376), Hong Kong, China.
Google Scholar

Download references

Author information

Authors and Affiliations

Authors

Dr. Alejandro Héctor Toselli
View author publications
You can also search for this author in PubMed Google Scholar
Dr. Enrique Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Prof. Francisco Casacuberta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alejandro Héctor Toselli .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Toselli, A.H., Vidal, E., Casacuberta, F. (2011). Computer Assisted Transcription: General Framework. In: Multimodal Interactive Pattern Recognition and Applications. Springer, London. https://doi.org/10.1007/978-0-85729-479-1_2

Download citation

DOI: https://doi.org/10.1007/978-0-85729-479-1_2
Publisher Name: Springer, London
Print ISBN: 978-0-85729-478-4
Online ISBN: 978-0-85729-479-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics