Abstract
Pattern Recognition systems are not error-free. Human intervention is typically needed to verify and/or correct the result of such systems. To formalize this fact, a new framework, which integrates the human activity into the recognition process taking advantage of the user’s feedback, is described. Several applications, involving Interactive Speech Transcription and Multimodal Interactive Machine Translation, have recently been considered under this framework. These applications are reviewed in this paper, and some experiments, showing that the proposed framework can save significant amounts of human effort, are also presented.
This work has been partially supported by the Spanish project iDoc TIN2006-15694-C02-01. Reviewer’s comments have significantly improved the original manuscript.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brown, P.F., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19(2), 263–311 (1993)
Casacuberta, F., Ney, H., Och, F.J., Vidal, E., Vilar, J.M., Barrachina, S., García-Varea, I., Llorens, D., Martínez, C., Molau, S., Nevado, F., Pastor, M., Picó, D., Sanchis, A., Tillmann, C.: Some approaches to statistical and finite-state speech-to-speech translation. Computer Speech and Language 18, 25–47 (2004)
Casacuberta, F., Vidal, E.: Learning finite-state models for machine translation. Machine Learning 66(1), 69–91 (2007)
Civera, J., Lagarda, A.L., Cubel, E., Casacuberta, F., Vidal, E., Vilar, J.M., Barrachina, S.: Computer-Assisted Translation Tool based on Finite-State Technology. In: Proc. of EAMT 2006, pp. 33–40 (2006)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. John Wiley and Sons, New York, NY (2000)
Frederking, R., Rudnicky, A.I., Hogan, C.: Interactive speech translation in the DIPLOMAT project. In: Procs. of the ACL-97 Spoken Language Translation Workshop, pp. 61–66, Madrid, ACL (1997)
Jelinek, F.: Statistical Methods for Speech Recognition. The MIT Press, Cambridge, Massachusetts, USA (1998)
Macklovitch, E.: The contribution of end-users to the transtype2 project (TT2). In: Frederking, R.E., Taylor, K.B. (eds.) AMTA 2004. LNCS (LNAI), vol. 3265, pp. 197–207. Springer, Heidelberg (2004)
Ney, H., Niessen, S., Och, F.J., Sawaf, H., Tillmann, C., Vogel, S.: Algorithms for statistical translation of spoken language. IEEE Transactions on Speech and Audio Processing 8(1), 24–36 (2000)
Rodríguez, L., Casacuberta, F., Vidal, E.: Computer assisted transcription of speech. In: 3rd Iberian Conference on Pattern Recognition and Image Analysis, Girona (Spain) (June 2007)
Suhm, B., Myers, B., Waibel, A.: Multimodal error correction for speech user interfaces. ACM Trans. Comput.-Hum. Interact. 8(1), 60–98 (2001)
Tomás, J., Casacuberta, F.: Statistical phrase-based models for interactive computer-assisted translation. In: Proceedings of the Coling/ACL, pp. 835–841, Sydney, Australia (17th-21th July 2006), http://acl.ldc.upenn.edu/P/P06/P06-2107.pdf
Vidal, E., Casacuberta, F., Rodríguez, L., Civera, J., Martínez, C.: Computer-assisted translation using speech recognition. IEEE Transaction on Audio, Speech and Language Processing 14(3), 941–951 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vidal, E., Rodríguez, L., Casacuberta, F., García-Varea, I. (2008). Interactive Pattern Recognition. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds) Machine Learning for Multimodal Interaction. MLMI 2007. Lecture Notes in Computer Science, vol 4892. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78155-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-78155-4_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78154-7
Online ISBN: 978-3-540-78155-4
eBook Packages: Computer ScienceComputer Science (R0)