Interactive Pattern Recognition

Vidal, Enrique; Rodríguez, Luis; Casacuberta, Francisco; García-Varea, Ismael

doi:10.1007/978-3-540-78155-4_6

Enrique Vidal²,
Luis Rodríguez¹,
Francisco Casacuberta² &
…
Ismael García-Varea¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4892))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

1073 Accesses
22 Citations

Abstract

Pattern Recognition systems are not error-free. Human intervention is typically needed to verify and/or correct the result of such systems. To formalize this fact, a new framework, which integrates the human activity into the recognition process taking advantage of the user’s feedback, is described. Several applications, involving Interactive Speech Transcription and Multimodal Interactive Machine Translation, have recently been considered under this framework. These applications are reviewed in this paper, and some experiments, showing that the proposed framework can save significant amounts of human effort, are also presented.

This work has been partially supported by the Spanish project iDoc TIN2006-15694-C02-01. Reviewer’s comments have significantly improved the original manuscript.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brown, P.F., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19(2), 263–311 (1993)
Google Scholar
Casacuberta, F., Ney, H., Och, F.J., Vidal, E., Vilar, J.M., Barrachina, S., García-Varea, I., Llorens, D., Martínez, C., Molau, S., Nevado, F., Pastor, M., Picó, D., Sanchis, A., Tillmann, C.: Some approaches to statistical and finite-state speech-to-speech translation. Computer Speech and Language 18, 25–47 (2004)
Article Google Scholar
Casacuberta, F., Vidal, E.: Learning finite-state models for machine translation. Machine Learning 66(1), 69–91 (2007)
Article Google Scholar
Civera, J., Lagarda, A.L., Cubel, E., Casacuberta, F., Vidal, E., Vilar, J.M., Barrachina, S.: Computer-Assisted Translation Tool based on Finite-State Technology. In: Proc. of EAMT 2006, pp. 33–40 (2006)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. John Wiley and Sons, New York, NY (2000)
Google Scholar
Frederking, R., Rudnicky, A.I., Hogan, C.: Interactive speech translation in the DIPLOMAT project. In: Procs. of the ACL-97 Spoken Language Translation Workshop, pp. 61–66, Madrid, ACL (1997)
Google Scholar
Jelinek, F.: Statistical Methods for Speech Recognition. The MIT Press, Cambridge, Massachusetts, USA (1998)
Google Scholar
Macklovitch, E.: The contribution of end-users to the transtype2 project (TT2). In: Frederking, R.E., Taylor, K.B. (eds.) AMTA 2004. LNCS (LNAI), vol. 3265, pp. 197–207. Springer, Heidelberg (2004)
Google Scholar
Ney, H., Niessen, S., Och, F.J., Sawaf, H., Tillmann, C., Vogel, S.: Algorithms for statistical translation of spoken language. IEEE Transactions on Speech and Audio Processing 8(1), 24–36 (2000)
Article Google Scholar
Rodríguez, L., Casacuberta, F., Vidal, E.: Computer assisted transcription of speech. In: 3rd Iberian Conference on Pattern Recognition and Image Analysis, Girona (Spain) (June 2007)
Google Scholar
Suhm, B., Myers, B., Waibel, A.: Multimodal error correction for speech user interfaces. ACM Trans. Comput.-Hum. Interact. 8(1), 60–98 (2001)
Article Google Scholar
Tomás, J., Casacuberta, F.: Statistical phrase-based models for interactive computer-assisted translation. In: Proceedings of the Coling/ACL, pp. 835–841, Sydney, Australia (17th-21th July 2006), http://acl.ldc.upenn.edu/P/P06/P06-2107.pdf
Vidal, E., Casacuberta, F., Rodríguez, L., Civera, J., Martínez, C.: Computer-assisted translation using speech recognition. IEEE Transaction on Audio, Speech and Language Processing 14(3), 941–951 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Sistemas Informáticos, Universidad de Castilla La Mancha, Spain
Luis Rodríguez & Ismael García-Varea
Departamento de Sistemas Informáticosy Computación, Universidad Politécnica de Valencia, Spain
Enrique Vidal & Francisco Casacuberta

Authors

Enrique Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Luis Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Casacuberta
View author publications
You can also search for this author in PubMed Google Scholar
Ismael García-Varea
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Andrei Popescu-Belis Steve Renals Hervé Bourlard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vidal, E., Rodríguez, L., Casacuberta, F., García-Varea, I. (2008). Interactive Pattern Recognition. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds) Machine Learning for Multimodal Interaction. MLMI 2007. Lecture Notes in Computer Science, vol 4892. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78155-4_6

Download citation

DOI: https://doi.org/10.1007/978-3-540-78155-4_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78154-7
Online ISBN: 978-3-540-78155-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics