Skip to main content

Image Processing and Pattern Recognition Tools for the Automatic Image Transcription

  • Conference paper
  • First Online:
Computers Helping People with Special Needs (ICCHP 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9758))

Abstract

The main objective of this work is to automate the conversion process of pedagogical images into information easily understandable by blind people and visually impaired people. This is performed by determining automatically the different areas of interest in the image, then identify each region by assigning it a texture which will be for example transformed in relief. The text present in the image is also detected, recognized and transformed in accessible text (in Braille text or vocal message). The solution that we offer by this work is to provide a tool that tries to find automatically the principal information conveyed by the image and then transmit it to blind people.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Brock, A., Truillet, P., Oriola, B., Picard, D., Jouffrais, C.: Design and user satisfaction of interactive maps for visually impaired people. In: Miesenberger, K., Karshmer, A., Penaz, P., Zagler, W. (eds.) ICCHP 2012, Part II. LNCS, vol. 7383, pp. 544–551. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  2. Bau, O., Poupyrev, I., Israr, A., Harrison, C.: TeslaTouch: electrovibration for touch surfaces. ACM Symposium on User Interface Software and Technology (ACM UIST) (2010)

    Google Scholar 

  3. Birare, S.D., Nalbalwar, S.L.: Review on super resolution of images using wavelet transform. Int. J. Eng. Sci. Technol. 2(12), 7363–7371 (2010)

    Google Scholar 

  4. Gllavata, J., Ewerth, R., Freisleben, B.: Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: 17th International Conference on Pattern Recognition (ICPR 2004), vol. 1, pp. 425–428 (2004)

    Google Scholar 

  5. Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image Vis. Comput. 23(8), 565–576 (2005)

    Article  Google Scholar 

  6. Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Proceedings of ACCV, Part IV. LNCS, vol. 6495, pp. 2067–2078 (2010)

    Google Scholar 

  7. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010)

    Google Scholar 

  8. Pan, Y., Hou, X., Liu, C.: A hybrid approach to detect and localize texts in natural scene images. IEEE Trans. Image Process. 20(3), 800–813 (2011)

    Article  MathSciNet  Google Scholar 

  9. Candes, E.J., Demanet, L., Donoho, D.L., Ying, L.: Fast discrete curvelet transforms. Multiscale Model. Simul. 5, 861–899 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  10. Ding, Z., Sun, J., Zhang, Y.: FCM image segmentation algorithm based on color space and spatial information. Int. J. Comput. Commun. Eng. 2(1), 48–51 (2013)

    Article  Google Scholar 

  11. Wolfe, J., Levi, D., Kluender, K., Bartoshuk, L., Herz, R., Klatzky, R., Lederman, S.J., Merfeld, D.: Sensation and Perception, 3rd edn. Sinauer Associates, Inc., Sunderland (2011)

    Google Scholar 

  12. de Winter, J.C.F.: Using the Student’s t-test with extremely small sample sizes. Pract. Assess. Res. Eval. 18, 1–12 (2013)

    Google Scholar 

Download references

Acknowledgements

This work was supported by PICRI CARTASAM no. 13020599.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zehira Haddad .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Haddad, Z., Chen, Y., Krahe, J.L. (2016). Image Processing and Pattern Recognition Tools for the Automatic Image Transcription. In: Miesenberger, K., Bühler, C., Penaz, P. (eds) Computers Helping People with Special Needs. ICCHP 2016. Lecture Notes in Computer Science(), vol 9758. Springer, Cham. https://doi.org/10.1007/978-3-319-41264-1_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41264-1_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41263-4

  • Online ISBN: 978-3-319-41264-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics