Food Category Recognition Using SURF and MSER Local Feature Representation

Razali, Mohd Norhisham; Manshor, Noridayu; Halin, Alfian Abdul; Yaakob, Razali; Mustapha, Norwati

doi:10.1007/978-3-319-70010-6_20

Mohd Norhisham Razali^21,22,
Noridayu Manshor²¹,
Alfian Abdul Halin²¹,
Razali Yaakob²¹ &
…
Norwati Mustapha²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10645))

Included in the following conference series:

International Visual Informatics Conference

2491 Accesses
1 Citations

Abstract

Food object recognition has gained popularity in recent years. This can perhaps be attributed to its potential applications in fields such as nutrition and fitness. Recognizing food images however is a challenging task since various foods come in many shapes and sizes. Besides having unexpected deformities and texture, food images are also captured in differing lighting conditions and camera viewpoints. From a computer vision perspective, using global image features to train a supervised classifier might be unsuitable due to the complex nature of the food images. Local features on the other hand seem the better alternative since they are able to capture minute intricacies such as interest points and other intricate information. In this paper, two local features namely SURF (Speeded- Up Robust Feature) and MSER (Maximally Stable Extremal Regions) are investigated for food object recognition. Both features are computationally inexpensive and have shown to be effective local descriptors for complex images. Specifically, each feature is firstly evaluated separately. This is followed by feature fusion to observe whether a combined representation could better represent food images. Experimental evaluations using a Support Vector Machine classifier shows that feature fusion generates better recognition accuracy at 86.6%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Yanai, K., Kawano, Y.: Twitter food photo mining and analysis for one hundred kinds of foods. In: Ooi, W.T., Snoek, C.G.M., Tan, H.K., Ho, C.-K., Huet, B., Ngo, C.-W. (eds.) PCM 2014. LNCS, vol. 8879, pp. 22–32. Springer, Cham (2014). doi:10.1007/978-3-319-13168-9_3
Google Scholar
Farinella, G.M., Allegra, D., Moltisanti, M., Stanco, F., Battiato, S.: Food understanding from digital images. (2015)
Google Scholar
Xu, R., Jiang, S., Wang, S., Song, X., Jain, R., Herranz, L.: Geolocalized modeling for dish recognition. IEEE Trans. Multimed. 17, 1187–1199 (2015)
Article Google Scholar
Pouladzadeh, P., Shirmohammadi, S., Al-maghrabi, R.: Measuring calorie and nutrition from food image. IEEE Trans. Instrum. Measur. 63, 1947–1956 (2014)
Article Google Scholar
Kong, F., Raynor, H.A., Tan, J., He, H.: DietCam: multi-view regular shape food recognition with a camera phone. Pervasive Mob. Comput. 19, 108–121 (2015)
Article Google Scholar
Kong, F., Tan, J.: DietCam: automatic dietary assessment with mobile camera phones. Pervasive Mob. Comput. 8, 147–163 (2012)
Article Google Scholar
Bosch, M., Zhu, F., Khanna, N., Boushey, C.J., Delp, E.J.: Combining global and local features for food identification in dietary assessment. pp. 1789–1792 (2011)
Google Scholar
Kagaya, H., Aizawa, K.: New Trends in Image Analysis and Processing - ICIAP 2015 Workshops, vol. 9281, pp. 350–357. Springer, Heidelberg (2015). doi:10.1007/978-3-319-23222-5
Book Google Scholar
Nguyen, D.T., Ogunbona, P.O., Probst, Y., Li, W., Zong, Z.: Food image classification using local appearance and global structural information. Neurocomputing. 140, 242–251 (2014)
Article Google Scholar
Altintakan, U.L., Yazici, A.: An improved BOW approach using fuzzy feature encoding and visual-word weighting. In: IEEE International Conference on Fuzzy System 2015-November, (2015). doi:10.1109/FUZZ-IEEE.2015.7338108
Kong, F., Tan, J.: DietCam: Regular Shape Food Recognition with a Camera Phone. In: International Conference on Body Sensor Networks (2011)
Google Scholar
Anthimopoulos, M.M., Scarnato, L., Diem, P., Mougiakakou, S.G., Gianola, L.: A food recognition system for diabetic patients based on an optimized bag-of-features model. IEEE J. Biomed. Health Inform. 18, 1261–1271 (2014)
Article Google Scholar
Razali, M.N., Manshor, N.: Object detection framework for multiclass food object localization and classification
Google Scholar
Norhisham, M., Manshor, N., Halin, A.A., Mustapha, N.: Analysis of SURF and SIFT representations to recognize food objects
Google Scholar
Zhu, F., Woo, I., Kim, S.Y., Boushey, C.J., Ebert, D.S., Delp, E.J., Bosch, M.: The use of mobile devices in aiding dietary assessment and evaluation. IEEE J. Sel. Top. Signal Process. 4, 756–766 (2010)
Article Google Scholar
Oliveira, L., Neves, G., Oliveira, T., Jorge, E., Lizarraga, M., Costa, V.: A mobile, lightweight, poll-based food identification system. Pattern Recogn. 47, 1941–1952 (2014)
Article Google Scholar
Wu, J., Cui, Z., Sheng, V.S., Zhao, P., Su, D., Gong, S.: A comparative study of SIFT and its variants. Meas. Sci. Rev. 13, 122–131 (2013)
Google Scholar
Zong, Z., Nguyen, D.T., Ogunbona, P., Li, W.: On the combination of local texture and global structure for food classification. In: Proceedings of 2010 IEEE International Symposium Multimedia, ISM 2010, pp. 204–211 (2010). doi:10.1109/ISM.2010.37
Chen, M., Wu, W., Yang, L., Sukthankar, R., Yang, J., Dhingra, K.: PFID: pittsburgh fast-food image dataset. In: Proceedings of the 16th IEEE International Conference on Image Processing. pp. 289–292 (2009)
Google Scholar
Kawano, Y., Yanai, K.: FoodCam: a real-time food recognition system on a smartphone. Multimed. Tools Appl. 74, 5263–5287 (2015). doi:10.1007/s11042-014-2000-8
Article Google Scholar
Pooja, H., Madival, P.S.A.: Food recognition and calorie extraction using bag-of- surf and spatial pyramid matching methods. Int. J. Comput. Sci. Mobile Comput. 5, 387–393 (2016)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). doi:10.1007/11744023_32
Chapter Google Scholar
Donoser, M., Riemenschneider, H., Bischof, H.: Shape guided maximally stable extremal region (MSER) tracking. pp. 1800–1803 (2010). doi:10.1109/ICPR.2010.444
Nistér, D., Stewénius, H.: Linear time maximally stable extremal regions. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5303, pp. 183–196. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88688-4_14
Chapter Google Scholar
Extremal, M.S., Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from. In: British Machine Vision Conference, pp. 384–393 (2002). doi:10.5244/C.16.36
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bag of keypoints. In: International Workshop in Statistic Learning and Computer Vision, pp. 1–22 (2004). doi:10.1234/12345678
Kawano, Y., Yanai, K.: FoodCam: a real-time mobile food recognition system employing fisher vector. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8326, pp. 369–373. Springer, Cham (2014). doi:10.1007/978-3-319-04117-9_38
Chapter Google Scholar
Aizawa, K., Li, H., Morikawa, C., Maruyama, Y.: Food balance estimation by using personal dietary tendencies in a multimedia food log. IEEE Trans. Multimed. 15, 2176–2185 (2013)
Article Google Scholar
Jiang, Y., Yang, J., Ngo, C., Hauptmann, A.G.: Representations of keypoint-based semantic concept detection: a comprehensive study representations of keypoint-based semantic concept detection: a comprehensive study. IEEE Trans. Multimed. 12, 42–53 (2010)
Article Google Scholar
Yu, J., Qin, Z., Wan, T., Zhang, X.: Feature integration analysis of bag-of-features model for image retrieval. Neurocomputing 120, 355–364 (2013). doi:10.1016/j.neucom.2012.08.061
Article Google Scholar
Matsuda, Y., Hoashi, H., Yanai, K.: Recognition of multiple-food images by detecting candidate regions. In: Proceedings of IEEE International Conference on Multimedia and Exposition, pp. 25–30 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, 43300, Serdang, Selangor, Malaysia
Mohd Norhisham Razali, Noridayu Manshor, Alfian Abdul Halin, Razali Yaakob & Norwati Mustapha
Faculty of Computing and Informatics, Universiti Malaysia Sabah, 88400, Kota Kinabalu, Sabah, Malaysia
Mohd Norhisham Razali

Authors

Mohd Norhisham Razali
View author publications
You can also search for this author in PubMed Google Scholar
Noridayu Manshor
View author publications
You can also search for this author in PubMed Google Scholar
Alfian Abdul Halin
View author publications
You can also search for this author in PubMed Google Scholar
Razali Yaakob
View author publications
You can also search for this author in PubMed Google Scholar
Norwati Mustapha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Noridayu Manshor .

Editor information

Editors and Affiliations

Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
Halimah Badioze Zaman
University of Cambridge, Cambridge, United Kingdom
Peter Robinson
Dublin City University, Dublin, Ireland
Alan F. Smeaton
National Central University, Jhongli, Taiwan
Timothy K. Shih
Carlos III University of Madrid, Madrid, Spain
Sergio Velastin
Toyo University, Kawagoe, Japan
Tada Terutoshi
Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
Azizah Jaafar
Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
Nazlena Mohamad Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Razali, M.N., Manshor, N., Halin, A.A., Yaakob, R., Mustapha, N. (2017). Food Category Recognition Using SURF and MSER Local Feature Representation. In: Badioze Zaman, H., et al. Advances in Visual Informatics. IVIC 2017. Lecture Notes in Computer Science(), vol 10645. Springer, Cham. https://doi.org/10.1007/978-3-319-70010-6_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-70010-6_20
Published: 29 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70009-0
Online ISBN: 978-3-319-70010-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics