Abstract
We are witnessing a proliferation of massive visual data. Unfortunately, scaling existing computer vision algorithms to large datasets leaves researchers repeatedly solving the same algorithmic, logistical, and infrastructural problems. Our goal is to democratize computer vision; one should not have to be a computer vision, big data, and distributed computing expert to have access to state-of-the-art distributed computer vision algorithms. We present CloudCV, a comprehensive system to provide access to state-of-the-art distributed computer vision algorithms as a cloud service through a web interface and APIs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Big data, big impact: New possibilities for international development. World Economic Forum Report (2012) http://www.weforum.org/reports/big-data-big-impact-new-possibilities-international-development
Lohr, S.: The age of big data. New York Times (20102) http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html?pagewanted=all
Berriman, G.B., Groom, S.L.: How will astronomy archives survive the data tsunami? Queue 9(10), 21:20–21:27 (2011)
Kvilekval, K., Fedorov, D., Obara, B., Singh, A., Manjunath, B.: Bisque: a platform for bioimage analysis and management. Bioinformatics 26(4), 544–552 (2010)
Strickland, N.H.: Pacs (picture archiving and communication systems): filmless radiology. Arch. Dis. Child. 83(1), 82–86 (2000)
Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J., Ng, A.: Building high-level features using large scale unsupervised learning. In: International Conference in Machine Learning (2012)
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. CVPR (2009)
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies, pp. 1–10 (2010)
Bradski, G., Kaehler, A.: Learning OpenCV: Computer Vision with the OpenCV Library. O’Reilly (2008). http://opencv.org
Integrating Vision Toolkit. http://ivt.sourceforge.net/
The Vision-something-Libraries. http://vxl.sourceforge.net/
Bolme, D.S., O’Hara, S.: Pyvision—computer vision toolkit (2008) http://pyvision.sourceforge.net
AForge.NET Image Processing Lab. http://www.aforgenet.com/
Bouguet J.Y.: Camera calibration toolbox for Matlab (2008) http://www.vision.caltech.edu/bouguetj/calib_doc/
Furukawa, Y.: Clustering Views for Multi-view Stereo (CMVS). http://grail.cs.washington.edu/software/cmvs/
Snavely, N.: Bundler: Structure from Motion (SfM) for Unordered Image Collections. http://phototour.cs.washington.edu/bundler/
Wu, C.: VisualSFM : a visual structure from motion system. http://www.cs.washington.edu/homes/ccwu/vsfm/
Vedaldi, A., Fulkerson, B.: VLFeat: an open and portable library of computer vision algorithms (2008) http://www.vlfeat.org/
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding (2014) arXiv preprint arXiv:1408.5093
Bastien, F., Lamblin, P., Pascanu, R., Bergstra, J., Goodfellow, I.J., Bergeron, A., Bouchard, N., Bengio, Y.: Theano: new features and speed improvements. In: Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop (2012)
Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio, Y.: Theano: a CPU and GPU math expression compiler. In: Proceedings of the Python for Scientific Computing Conference (SciPy) (2010)
Torch:A scientific computing framework for LUAJIT. http://torch.ch/
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively trained deformable part models, release 4. http://www.cs.brown.edu/~pff/latent-release4/
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. CVPR, pp. 1385–1392 (2011)
Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts? PAMI 26(2), 147–159 (2004)
Amazon elastic compute cloud (amazon ec2). http://aws.amazon.com/ec2/
Orbeus rekognition. https://rekognition.com/
Clarifai. http://www.clarifai.com/
vision.ai. http://vision.ai/
Django: the web framework for perfectionists with deadlines. https://www.djangoproject.com/
Node.js. https://nodejs.org/
Socket.IO. http://socket.io/
Redis. http://redis.io/
Celery: distributed task queue. http://www.celeryproject.org/
javaScript Object Notation. http://torch.ch/
Advanced Message Queueing Protocol. https://www.amqp.org
Low, Y., Gonzalez, J., Kyrola, A., Bickson, D., Guestrin, C., Hellerstein, J.M.: Graphlab: a new parallel framework for machine learning. In: UAI (2010)
The Graphlab Computer Vision Toolkit. http://graphlab.org/toolkits/computer-vision/
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. NIPS (2012)
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: Decaf: a deep convolutional activation feature for generic visual recognition. ICML (2014)
Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition (2014) arXiv preprint arXiv:1403.6382
Mathialagan, C.S., Batra, D., Gallagher, A.C.: Vip: finding important people in group images. In: Computer Vision and Pattern Recognition (2015)
SkyBiometry. https://www.skybiometry.com/
Bradski, G.: The OpenCV library (2000)
Brown, M., Lowe, D.: Automatic panoramic image stitching using invariant features. IJCV 74(1), 59–73 (2007)
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Triggs, B., Mclauchlan, P., Hartley, R., Fitzgibbon, A.: Bundle adjustment—a modern synthesis. Vision Algorithms: Theory and Practice. Lecture Notes in Computer Science, vol. 1883, pp. 298–372. Springer, Berlin (1999)
NVIDIA DIGITS interactive deep learning gpu training system. https://developer.nvidia.com/digits. Accessed 1 June 2015
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Acknowledgments
This work was partially supported by the Virginia Tech ICTAS JFC Award, and the National Science Foundation CAREER award IIS-1350553. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the U.S. Government or any sponsor.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Agrawal, H. et al. (2015). CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service. In: Hua, G., Hua, XS. (eds) Mobile Cloud Visual Media Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-24702-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-24702-1_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24700-7
Online ISBN: 978-3-319-24702-1
eBook Packages: Computer ScienceComputer Science (R0)