Skip to main content

Geometric Transformations

  • Chapter
  • First Online:
  • 2253 Accesses

Abstract

In this chapter, techniques for remapping pixel locations are described. These techniques are known as geometric transformations of the images, necessary in different applications of the artificial vision, both to correct any geometric distortions or to introduce desired visual effects. Geometric correction is usually introduced during the acquisition, for example, images acquired while the objects or the sensors are moving, as in the case of satellite and/or aerial acquisitions. In all cases, the geometrical operator must be able to reproduce as accurately as possible the image with the same initial information content through the image resampling process. The class of affine transformations that covers different transformations is presented: translation, rotation, resizing, and shearing. Elementary transformations can be combined in different ways by multiplying the matrices in different orders, producing a different number of transformations. Furthermore, projective transformations are described, which are known as a subclass of warping techniques. The perspective transformation (already considered in the chapter of the image formation process) which projects 3D points on a plane is also described from an algebraic-geometric point of view. Perspective transformations play a central role in image processing because they provide an approximation to the way an image is formed by looking at a 3D world. The projective and perspective transformation essentially changes the geometric and radiometric characteristics with which the image was acquired. This leads to the need for image resampling which is the process of regaining a discrete image from a set of pixels geometrically transformed into another discrete set of pixel coordinates. Finally, the method of resampling images with the various methods of interpolation and image sampling is described.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    From now on we will assume \( \Delta x = \Delta y = 1\).

  2. 2.

    For simplicity, we will use the interchangeable term of image and function in the 2D case.

  3. 3.

    In essence, RMSE compares the predicted value (in this case the intensity of the pixels of the original image) and the observed value (in this case the intensity of the pixels of the processed image) defined as follows:

    $$\begin{aligned} RMSE=\sqrt{\frac{\sum _{i=1}^{N} (predval_{i}-obsval_{i})^{2})}{N}} \end{aligned}$$

    where N indicates the number of pixels in the image.

References

  1. A. Ben-Israel, T.N.E. Greville, Generalized Inverses. Theory and Applications, 2nd edn. (Springer, New York, 2003). ISBN 0-387-00293-6

    Google Scholar 

  2. D.C. Brown, Decentering distortion of lenses. Photogramm. Eng. 32(3), 444–462 (1966)

    Google Scholar 

  3. M.J. Burge, W. Burger, Principles of Digital Image Processing: Core Algorithms, 1st edn. (Springer, Berlin, 2009). ISBN 978-1-84800-194-7

    Google Scholar 

  4. E. Catmull, A.R. Smith, 3-d transformations of images in scanline order. Comput. Graph. 14(3), 279–285 (1980)

    Article  Google Scholar 

  5. R. Hartley, A. Zisserman, Multiple View Geometry in Computer Vision, 2nd edn. (Cambridge University Press, Cambridge, 2003)

    Google Scholar 

  6. H.S. Hou, H.C. Andrews, Cubic splines for image interpolation and digital filtering. IEEE Trans. Acoust. Speech Signal Process. 26, 508–517 (1978)

    Google Scholar 

  7. A.J. Jerri, The Gibbs Phenomenon in Fourier Analysis, Splines, and Wavelet Application, 1st edn. (Kluwer Academic, New York, 1998)

    MATH  Google Scholar 

  8. R. Keys, Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. Speech Signal Process. 29(6), 1153–1160 (1981)

    Article  MathSciNet  Google Scholar 

  9. S.S. Rifman, D.M. McKinnon, Evaluation of digital correction techniques for ERTS images. Technical Report TRW 20634-6003-TU-OO (TRW Corporation, 1974)

    Google Scholar 

  10. I.J. Schoenberg, Cardinal Spline Interpolation. Regional Conference Series in Applied Mathematics, vol. 12 (1973)

    Google Scholar 

  11. Z. Zhang, A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1330–1334 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arcangelo Distante .

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Distante, A., Distante, C. (2020). Geometric Transformations. In: Handbook of Image Processing and Computer Vision. Springer, Cham. https://doi.org/10.1007/978-3-030-42374-2_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-42374-2_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-42373-5

  • Online ISBN: 978-3-030-42374-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics