Yoda: An Accurate and Scalable Web-Based Recommendation System

  • Cyrus Shahabi
  • Farnoush Banaei-Kashani
  • Yi-Shin Chen
  • Dennis McLeod
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2172)


Recommendation systems are applied to personalize and customize the Web environment.We have developed a recommendation system, termed Yoda, that is designed to support large-scale Web-based applications requiring highly accurate recommendations in real-time. With Yoda, we introduce a hybrid approach that combines collaborative filtering (CF) and content-based querying to achieve higher accuracy. Yoda is structured as a tunable model that is trained off-line and employed for real-time recommendation on-line. The on-line process benefits from an optimized aggregation function with low complexity that allows realtime weighted aggregation of the soft classification of active users to predefined recommendation sets. Leveraging on localized distribution of the recommendable items, the same aggregation function is further optimized for the off-line process to reduce the time complexity of constructing the pre-defined recommendation sets of the model. To make the off-line process scalable furthermore, we also propose a filtering mechanism, FLSH, that extends the Locality Sensitive Hashing technique by incorporating a novel distance measure that satisfies specific requirements of our application. Our end-to-end experiments show while Yoda’s complexity is low and remains constant as the number of users and/or items grow, its accuracy surpasses that of the basic nearest-neighbor method by a wide margin (in most cases more than 100%).


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Sarwar B., G. Karypis, J. Konstan, and J. Riedl: Analysis of Recommendation Algorithms for E-Commerce, Proceedings of E-Commerce Coneference, 17-20 October, Minneapolis, Minnesota, 2000. 419Google Scholar
  2. 2.
    Fagin R.: Combining Fuzzy Information from Multiple Systems, Proceedings of Fifteenth ACM Symposyum on Principles of Database Systems, Montreal, pp. 216–226, 1996. 420, 424, 425Google Scholar
  3. 3.
    Shahabi C., and Y. Chen: Efficient Support of Soft Query in Image Retrieval Systems, Proceedings of SSGRR 2000 Computer and eBusiness Conference, Rome, Italy, Aguest, 2000.Google Scholar
  4. 4.
    Balabanovi M., and Y. Shoham: Fab, content-based, collaborative recommendation, Communications of the ACM, Vol 40(3), pp. 66–72, 1997. 421CrossRefGoogle Scholar
  5. 5.
    Resnick P., N. Iacovou, M. Suchak, P. Bergstrom, and J. Riedl: GroupLens, An Open Architecture for Collaborative Filtering of Netnews, Proceedings of ACM conference on Cumputer-Supported Cooperative Work, Chapel Hill, NC, pp. 175–186, 1994. 419, 420Google Scholar
  6. 6.
    Sarwar B., G. Karypis, J. Konstan, and J. Riedl: Analysis of Recommendation Algorithms for e-Commerce, Proceedings of ACM e-Commerce 2000 Conference, Minneapolis, MN, 2000. 421Google Scholar
  7. 7.
    Good N., J. Schafer, J. Konstan, A. Borchers, B. Sarwar, J. Herlocker, and J. Riedl: Combining Collaborative Filtering with Personal Agents for Better Recommendations, Proceedings of the 1999 Conference of the American Association of Artifical Intelligence, pp. 439–446, 1999. 421Google Scholar
  8. 8.
    Kitts B., David Freed, and Martin Vrieze: Cross-sell, a fast promotion-tunable customer-item recommendation method based on conditionally independent probabilities, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, Boston, MA, pp. 437–446, August, 2000. 419, 421Google Scholar
  9. 9.
    Breese J., D. Heckerman, and C. Kadie: Empirical Analysis of Predictive Algorithms for Collaborative Filtering, Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, Madison, WI, pp. 43–52, July, 1998. 419, 421Google Scholar
  10. 10.
    Sarwar B., G. Karypis, J. Konstan, and J. Riedl: Application of Dimensionality Reduction in Recommender System-A Case Study, ACM WebKDD 2000 Web Mining for e-Commerce Workshop, 2000. 421, 428Google Scholar
  11. 11.
    Konstan, J., B. Miller, D. Maltz, J. Herlocker, L. Gordon, and J. Riedl: Applying Collaborative Filtering to Usenet News, Communications of the ACM (40) 3, 1997. 421CrossRefGoogle Scholar
  12. 12.
    Shahabi C., A. M. Zarkesh, J. Adibi, and V. Shah: Knowledge Discovery from Users Web Page Navigation, Proceedings of the IEEE RIDE97 Workshop, April, 1997. 421, 422Google Scholar
  13. 13.
    Shahabi C., F. Banaei-Kashani, J. Faruque, and A. Faisal: Feature Matrices: A Model for Efficient and Anonymous Web Usage Mining, EC-Web 2001, Germany, September 2001. 419, 422, 423Google Scholar
  14. 14.
    Shahabi C., and Y. Chen: A Unified FrameWork to Incorporate Soft Query into Image Retrieval Systems, International Conference on Enterprise Information Systems, Setubal, Portugal, July 2001. 423, 425Google Scholar
  15. 15.
    Gionis A., P. Indyk, and R. Motwani: Similarity search in high dimensions via hashing, Proceedings of the 25th International Conference on Very Large Databases, Edinburgh, Scotland, 1999. 420, 426Google Scholar
  16. 16.
    Weber R., H. Schek and S. Blott: A quantitative analysis and performance study for Similarity Search Methods in High Dimensional Spaces, Proceedings of the 24th International Conference on Very Large Data bases, 1999. 426Google Scholar
  17. 17.
    Karnik N., and J. Mendel: Operations on Type-2 Fuzzy Sets, International Journal on Fuzzy Sets and Systems, 2000. 430Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Cyrus Shahabi
    • 1
  • Farnoush Banaei-Kashani
    • 1
  • Yi-Shin Chen
    • 1
  • Dennis McLeod
    • 1
  1. 1.Department of Computer Science, Integrated Media Systems CenterUniversity of Southern CaliforniaLos AngelesUSA

Personalised recommendations