Skip to main content

CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service

  • Chapter
  • First Online:
Mobile Cloud Visual Media Computing

Abstract

We are witnessing a proliferation of massive visual data. Unfortunately, scaling existing computer vision algorithms to large datasets leaves researchers repeatedly solving the same algorithmic, logistical, and infrastructural problems. Our goal is to democratize computer vision; one should not have to be a computer vision, big data, and distributed computing expert to have access to state-of-the-art distributed computer vision algorithms. We present CloudCV, a comprehensive system to provide access to state-of-the-art distributed computer vision algorithms as a cloud service through a web interface and APIs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Big data, big impact: New possibilities for international development. World Economic Forum Report (2012) http://www.weforum.org/reports/big-data-big-impact-new-possibilities-international-development

  2. Lohr, S.: The age of big data. New York Times (20102) http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html?pagewanted=all

  3. Berriman, G.B., Groom, S.L.: How will astronomy archives survive the data tsunami? Queue 9(10), 21:20–21:27 (2011)

    Google Scholar 

  4. Kvilekval, K., Fedorov, D., Obara, B., Singh, A., Manjunath, B.: Bisque: a platform for bioimage analysis and management. Bioinformatics 26(4), 544–552 (2010)

    Article  Google Scholar 

  5. Strickland, N.H.: Pacs (picture archiving and communication systems): filmless radiology. Arch. Dis. Child. 83(1), 82–86 (2000)

    Article  MathSciNet  Google Scholar 

  6. Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J., Ng, A.: Building high-level features using large scale unsupervised learning. In: International Conference in Machine Learning (2012)

    Google Scholar 

  7. Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. CVPR (2009)

    Google Scholar 

  8. Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies, pp. 1–10 (2010)

    Google Scholar 

  9. Bradski, G., Kaehler, A.: Learning OpenCV: Computer Vision with the OpenCV Library. O’Reilly (2008). http://opencv.org

  10. Integrating Vision Toolkit. http://ivt.sourceforge.net/

  11. The Vision-something-Libraries. http://vxl.sourceforge.net/

  12. Bolme, D.S., O’Hara, S.: Pyvision—computer vision toolkit (2008) http://pyvision.sourceforge.net

  13. AForge.NET Image Processing Lab. http://www.aforgenet.com/

  14. Bouguet J.Y.: Camera calibration toolbox for Matlab (2008) http://www.vision.caltech.edu/bouguetj/calib_doc/

  15. Furukawa, Y.: Clustering Views for Multi-view Stereo (CMVS). http://grail.cs.washington.edu/software/cmvs/

  16. Snavely, N.: Bundler: Structure from Motion (SfM) for Unordered Image Collections. http://phototour.cs.washington.edu/bundler/

  17. Wu, C.: VisualSFM : a visual structure from motion system. http://www.cs.washington.edu/homes/ccwu/vsfm/

  18. Vedaldi, A., Fulkerson, B.: VLFeat: an open and portable library of computer vision algorithms (2008) http://www.vlfeat.org/

  19. Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding (2014) arXiv preprint arXiv:1408.5093

  20. Bastien, F., Lamblin, P., Pascanu, R., Bergstra, J., Goodfellow, I.J., Bergeron, A., Bouchard, N., Bengio, Y.: Theano: new features and speed improvements. In: Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop (2012)

    Google Scholar 

  21. Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio, Y.: Theano: a CPU and GPU math expression compiler. In: Proceedings of the Python for Scientific Computing Conference (SciPy) (2010)

    Google Scholar 

  22. Torch:A scientific computing framework for LUAJIT. http://torch.ch/

  23. Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively trained deformable part models, release 4. http://www.cs.brown.edu/~pff/latent-release4/

  24. Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. CVPR, pp. 1385–1392 (2011)

    Google Scholar 

  25. Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts? PAMI 26(2), 147–159 (2004)

    Article  MATH  Google Scholar 

  26. Amazon elastic compute cloud (amazon ec2). http://aws.amazon.com/ec2/

  27. Orbeus rekognition. https://rekognition.com/

  28. Clarifai. http://www.clarifai.com/

  29. vision.ai. http://vision.ai/

  30. Django: the web framework for perfectionists with deadlines. https://www.djangoproject.com/

  31. Node.js. https://nodejs.org/

  32. Socket.IO. http://socket.io/

  33. Redis. http://redis.io/

  34. Celery: distributed task queue. http://www.celeryproject.org/

  35. javaScript Object Notation. http://torch.ch/

  36. Advanced Message Queueing Protocol. https://www.amqp.org

  37. Low, Y., Gonzalez, J., Kyrola, A., Bickson, D., Guestrin, C., Hellerstein, J.M.: Graphlab: a new parallel framework for machine learning. In: UAI (2010)

    Google Scholar 

  38. The Graphlab Computer Vision Toolkit. http://graphlab.org/toolkits/computer-vision/

  39. Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. NIPS (2012)

    Google Scholar 

  40. Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: Decaf: a deep convolutional activation feature for generic visual recognition. ICML (2014)

    Google Scholar 

  41. Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition (2014) arXiv preprint arXiv:1403.6382

  42. Mathialagan, C.S., Batra, D., Gallagher, A.C.: Vip: finding important people in group images. In: Computer Vision and Pattern Recognition (2015)

    Google Scholar 

  43. SkyBiometry. https://www.skybiometry.com/

  44. Bradski, G.: The OpenCV library (2000)

    Google Scholar 

  45. Brown, M., Lowe, D.: Automatic panoramic image stitching using invariant features. IJCV 74(1), 59–73 (2007)

    Article  Google Scholar 

  46. Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)

    Article  Google Scholar 

  47. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  48. Triggs, B., Mclauchlan, P., Hartley, R., Fitzgibbon, A.: Bundle adjustment—a modern synthesis. Vision Algorithms: Theory and Practice. Lecture Notes in Computer Science, vol. 1883, pp. 298–372. Springer, Berlin (1999)

    Chapter  Google Scholar 

  49. NVIDIA DIGITS interactive deep learning gpu training system. https://developer.nvidia.com/digits. Accessed 1 June 2015

  50. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  51. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This work was partially supported by the Virginia Tech ICTAS JFC Award, and the National Science Foundation CAREER award IIS-1350553. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the U.S. Government or any sponsor.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Harsh Agrawal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Agrawal, H. et al. (2015). CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service. In: Hua, G., Hua, XS. (eds) Mobile Cloud Visual Media Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-24702-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24702-1_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24700-7

  • Online ISBN: 978-3-319-24702-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics