CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service

Agrawal, Harsh; Mathialagan, Clint Solomon; Goyal, Yash; Chavali, Neelima; Banik, Prakriti; Mohapatra, Akrit; Osman, Ahmed; Batra, Dhruv

doi:10.1007/978-3-319-24702-1_11

Harsh Agrawal³,
Clint Solomon Mathialagan³,
Yash Goyal³,
Neelima Chavali³,
Prakriti Banik³,
Akrit Mohapatra³,
Ahmed Osman⁴ &
…
Dhruv Batra³

1045 Accesses
19 Citations

Abstract

We are witnessing a proliferation of massive visual data. Unfortunately, scaling existing computer vision algorithms to large datasets leaves researchers repeatedly solving the same algorithmic, logistical, and infrastructural problems. Our goal is to democratize computer vision; one should not have to be a computer vision, big data, and distributed computing expert to have access to state-of-the-art distributed computer vision algorithms. We present CloudCV, a comprehensive system to provide access to state-of-the-art distributed computer vision algorithms as a cloud service through a web interface and APIs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Big data, big impact: New possibilities for international development. World Economic Forum Report (2012) http://www.weforum.org/reports/big-data-big-impact-new-possibilities-international-development
Lohr, S.: The age of big data. New York Times (20102) http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html?pagewanted=all
Berriman, G.B., Groom, S.L.: How will astronomy archives survive the data tsunami? Queue 9(10), 21:20–21:27 (2011)
Google Scholar
Kvilekval, K., Fedorov, D., Obara, B., Singh, A., Manjunath, B.: Bisque: a platform for bioimage analysis and management. Bioinformatics 26(4), 544–552 (2010)
Article Google Scholar
Strickland, N.H.: Pacs (picture archiving and communication systems): filmless radiology. Arch. Dis. Child. 83(1), 82–86 (2000)
Article MathSciNet Google Scholar
Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J., Ng, A.: Building high-level features using large scale unsupervised learning. In: International Conference in Machine Learning (2012)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. CVPR (2009)
Google Scholar
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies, pp. 1–10 (2010)
Google Scholar
Bradski, G., Kaehler, A.: Learning OpenCV: Computer Vision with the OpenCV Library. O’Reilly (2008). http://opencv.org
Integrating Vision Toolkit. http://ivt.sourceforge.net/
The Vision-something-Libraries. http://vxl.sourceforge.net/
Bolme, D.S., O’Hara, S.: Pyvision—computer vision toolkit (2008) http://pyvision.sourceforge.net
AForge.NET Image Processing Lab. http://www.aforgenet.com/
Bouguet J.Y.: Camera calibration toolbox for Matlab (2008) http://www.vision.caltech.edu/bouguetj/calib_doc/
Furukawa, Y.: Clustering Views for Multi-view Stereo (CMVS). http://grail.cs.washington.edu/software/cmvs/
Snavely, N.: Bundler: Structure from Motion (SfM) for Unordered Image Collections. http://phototour.cs.washington.edu/bundler/
Wu, C.: VisualSFM : a visual structure from motion system. http://www.cs.washington.edu/homes/ccwu/vsfm/
Vedaldi, A., Fulkerson, B.: VLFeat: an open and portable library of computer vision algorithms (2008) http://www.vlfeat.org/
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding (2014) arXiv preprint arXiv:1408.5093
Bastien, F., Lamblin, P., Pascanu, R., Bergstra, J., Goodfellow, I.J., Bergeron, A., Bouchard, N., Bengio, Y.: Theano: new features and speed improvements. In: Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop (2012)
Google Scholar
Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio, Y.: Theano: a CPU and GPU math expression compiler. In: Proceedings of the Python for Scientific Computing Conference (SciPy) (2010)
Google Scholar
Torch:A scientific computing framework for LUAJIT. http://torch.ch/
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively trained deformable part models, release 4. http://www.cs.brown.edu/~pff/latent-release4/
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. CVPR, pp. 1385–1392 (2011)
Google Scholar
Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts? PAMI 26(2), 147–159 (2004)
Article MATH Google Scholar
Amazon elastic compute cloud (amazon ec2). http://aws.amazon.com/ec2/
Orbeus rekognition. https://rekognition.com/
Clarifai. http://www.clarifai.com/
vision.ai. http://vision.ai/
Django: the web framework for perfectionists with deadlines. https://www.djangoproject.com/
Node.js. https://nodejs.org/
Socket.IO. http://socket.io/
Redis. http://redis.io/
Celery: distributed task queue. http://www.celeryproject.org/
javaScript Object Notation. http://torch.ch/
Advanced Message Queueing Protocol. https://www.amqp.org
Low, Y., Gonzalez, J., Kyrola, A., Bickson, D., Guestrin, C., Hellerstein, J.M.: Graphlab: a new parallel framework for machine learning. In: UAI (2010)
Google Scholar
The Graphlab Computer Vision Toolkit. http://graphlab.org/toolkits/computer-vision/
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. NIPS (2012)
Google Scholar
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: Decaf: a deep convolutional activation feature for generic visual recognition. ICML (2014)
Google Scholar
Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition (2014) arXiv preprint arXiv:1403.6382
Mathialagan, C.S., Batra, D., Gallagher, A.C.: Vip: finding important people in group images. In: Computer Vision and Pattern Recognition (2015)
Google Scholar
SkyBiometry. https://www.skybiometry.com/
Bradski, G.: The OpenCV library (2000)
Google Scholar
Brown, M., Lowe, D.: Automatic panoramic image stitching using invariant features. IJCV 74(1), 59–73 (2007)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Triggs, B., Mclauchlan, P., Hartley, R., Fitzgibbon, A.: Bundle adjustment—a modern synthesis. Vision Algorithms: Theory and Practice. Lecture Notes in Computer Science, vol. 1883, pp. 298–372. Springer, Berlin (1999)
Chapter Google Scholar
NVIDIA DIGITS interactive deep learning gpu training system. https://developer.nvidia.com/digits. Accessed 1 June 2015
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar

Download references

Acknowledgments

This work was partially supported by the Virginia Tech ICTAS JFC Award, and the National Science Foundation CAREER award IIS-1350553. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the U.S. Government or any sponsor.

Author information

Authors and Affiliations

Virginia Tech, Blacksburg, VA, USA
Harsh Agrawal, Clint Solomon Mathialagan, Yash Goyal, Neelima Chavali, Prakriti Banik, Akrit Mohapatra & Dhruv Batra
Imperial College London, London, UK
Ahmed Osman

Authors

Harsh Agrawal
View author publications
You can also search for this author in PubMed Google Scholar
Clint Solomon Mathialagan
View author publications
You can also search for this author in PubMed Google Scholar
Yash Goyal
View author publications
You can also search for this author in PubMed Google Scholar
Neelima Chavali
View author publications
You can also search for this author in PubMed Google Scholar
Prakriti Banik
View author publications
You can also search for this author in PubMed Google Scholar
Akrit Mohapatra
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Osman
View author publications
You can also search for this author in PubMed Google Scholar
Dhruv Batra
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harsh Agrawal .

Editor information

Editors and Affiliations

Visual Computing Group, Microsoft Research Asia, Beijing, Beijing, China
Gang Hua
Alibaba Group, Hangzhou, Zhejiang, China
Xian-Sheng Hua

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Agrawal, H. et al. (2015). CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service. In: Hua, G., Hua, XS. (eds) Mobile Cloud Visual Media Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-24702-1_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-24702-1_11
Published: 24 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24700-7
Online ISBN: 978-3-319-24702-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics