Skip to main content

Interactive Pattern Recognition

  • Conference paper
Machine Learning for Multimodal Interaction (MLMI 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4892))

Included in the following conference series:

Abstract

Pattern Recognition systems are not error-free. Human intervention is typically needed to verify and/or correct the result of such systems. To formalize this fact, a new framework, which integrates the human activity into the recognition process taking advantage of the user’s feedback, is described. Several applications, involving Interactive Speech Transcription and Multimodal Interactive Machine Translation, have recently been considered under this framework. These applications are reviewed in this paper, and some experiments, showing that the proposed framework can save significant amounts of human effort, are also presented.

This work has been partially supported by the Spanish project iDoc TIN2006-15694-C02-01. Reviewer’s comments have significantly improved the original manuscript.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brown, P.F., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19(2), 263–311 (1993)

    Google Scholar 

  2. Casacuberta, F., Ney, H., Och, F.J., Vidal, E., Vilar, J.M., Barrachina, S., García-Varea, I., Llorens, D., Martínez, C., Molau, S., Nevado, F., Pastor, M., Picó, D., Sanchis, A., Tillmann, C.: Some approaches to statistical and finite-state speech-to-speech translation. Computer Speech and Language 18, 25–47 (2004)

    Article  Google Scholar 

  3. Casacuberta, F., Vidal, E.: Learning finite-state models for machine translation. Machine Learning 66(1), 69–91 (2007)

    Article  Google Scholar 

  4. Civera, J., Lagarda, A.L., Cubel, E., Casacuberta, F., Vidal, E., Vilar, J.M., Barrachina, S.: Computer-Assisted Translation Tool based on Finite-State Technology. In: Proc. of EAMT 2006, pp. 33–40 (2006)

    Google Scholar 

  5. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. John Wiley and Sons, New York, NY (2000)

    Google Scholar 

  6. Frederking, R., Rudnicky, A.I., Hogan, C.: Interactive speech translation in the DIPLOMAT project. In: Procs. of the ACL-97 Spoken Language Translation Workshop, pp. 61–66, Madrid, ACL (1997)

    Google Scholar 

  7. Jelinek, F.: Statistical Methods for Speech Recognition. The MIT Press, Cambridge, Massachusetts, USA (1998)

    Google Scholar 

  8. Macklovitch, E.: The contribution of end-users to the transtype2 project (TT2). In: Frederking, R.E., Taylor, K.B. (eds.) AMTA 2004. LNCS (LNAI), vol. 3265, pp. 197–207. Springer, Heidelberg (2004)

    Google Scholar 

  9. Ney, H., Niessen, S., Och, F.J., Sawaf, H., Tillmann, C., Vogel, S.: Algorithms for statistical translation of spoken language. IEEE Transactions on Speech and Audio Processing 8(1), 24–36 (2000)

    Article  Google Scholar 

  10. Rodríguez, L., Casacuberta, F., Vidal, E.: Computer assisted transcription of speech. In: 3rd Iberian Conference on Pattern Recognition and Image Analysis, Girona (Spain) (June 2007)

    Google Scholar 

  11. Suhm, B., Myers, B., Waibel, A.: Multimodal error correction for speech user interfaces. ACM Trans. Comput.-Hum. Interact. 8(1), 60–98 (2001)

    Article  Google Scholar 

  12. Tomás, J., Casacuberta, F.: Statistical phrase-based models for interactive computer-assisted translation. In: Proceedings of the Coling/ACL, pp. 835–841, Sydney, Australia (17th-21th July 2006), http://acl.ldc.upenn.edu/P/P06/P06-2107.pdf

  13. Vidal, E., Casacuberta, F., Rodríguez, L., Civera, J., Martínez, C.: Computer-assisted translation using speech recognition. IEEE Transaction on Audio, Speech and Language Processing 14(3), 941–951 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Andrei Popescu-Belis Steve Renals Hervé Bourlard

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vidal, E., Rodríguez, L., Casacuberta, F., García-Varea, I. (2008). Interactive Pattern Recognition. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds) Machine Learning for Multimodal Interaction. MLMI 2007. Lecture Notes in Computer Science, vol 4892. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78155-4_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78155-4_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78154-7

  • Online ISBN: 978-3-540-78155-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics