Abstract
Sina Weibo is currently the microblogging web service with the highest number of registered users in China. As in any large social network, the relationship representation is so huge that executing queries over the network is a very challenging problem. The WISE 2012 conference proposed a challenge based on Sina Weibo with two tracks: performance testing and repost prediction. This paper focuses on the first track challenge, which goal is to implement 19 queries with the highest throughput and the lowest latency, using a scalable parallel paradigm. In the input database, there are 265 millions of relations among more than 60 millions of users and more than 400 millions sent messages. This paper formalizes the logical model of the relationship in order to present the queries in precise and simple manner. Some optimization techniques were also proposed, such as the aggregate-rank-delete procedures, which can be applied to some of the queries for improving the performance. The proposed model and optimizations were implemented in a very scalable parallel system and the experimental results show that our solution can obtain high throughput and low latency for most of the queries.
This research has been partially supported by the CNPq and FAPDF grants.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sina Weibo. SINA Corporation (2012), http://www.weibo.com
Wu, X., Wang, J.: How about micro-blogging service in China: analysis and mining on sina micro-blog. In: Proceedings of 1st International Symposium on From Digital Footprints to Social and Community Intelligence, SCI 2011, New York, pp. 37–42 (2011)
Liu, Z., Chen, X., Sun, M.: Mining the interests of Chinese microbloggers via keyword extraction. Front. Comput. Sci China 6(1), 76–87 (2012)
Zhao, B., Zhang, Z., Gu, Y., Gong, X., Qian, W., Zhou, A.: Discovering Collective Viewpoints on Micro-blogging Events Based on Community and Temporal Aspects. In: Tang, J., King, I., Chen, L., Wang, J. (eds.) ADMA 2011, Part I. LNCS, vol. 7120, pp. 270–284. Springer, Heidelberg (2011)
Wang, D., Li, Z., Salamatian, K., Xie, G.: The pattern of information diffusion in microblog. In: Proceedings of The ACM CoNEXT Student Workshop, CoNEXT 2011 Student, pp. 3:1–3:2. ACM, New York (2011)
Wang, R., Jin, Y.: An Empirical Study on the Relationship between the Followers’ Number and Influence of Microblogging. In: Proceedings of the 2010 International Conference on E-Business and E-Government, ICEE 2010, pp. 2014–2017. IEEE Computer Society, Washington, DC (2010)
Qu, Y., Huang, C., Zhang, P., Zhang, J.: Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake. In: Proceedings of the ACM 2011 Conference on Computer Supported Cooperative Work, CSCW 2011, pp. 25–34. ACM, New York (2011)
Guo, Z., Li, Z., Tu, H.: Sina Microblog: An Information-Driven Online Social Network. In: Proceedings of the 2011 International Conference on Cyberworlds, CW 2011, pp. 160–167. IEEE Computer Society, Washington, DC (2011)
WISE 2012 Challenge, Paphos, Cyprus (2012), http://www.wise2012.cs.ucy.ac.cy/challenge.html
IMC, ECNU: BSMA Performance Testing Tool Manual, WISE 2012 Challenge, Paphos, Cyprus (2012), http://www.wise2012.cs.ucy.ac.cy/challenge.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
De Sandes, E.F.O., Weigang, L., de Melo, A.C.M.A. (2012). Logical Model of Relationship for Online Social Networks and Performance Optimizing of Queries. In: Wang, X.S., Cruz, I., Delis, A., Huang, G. (eds) Web Information Systems Engineering - WISE 2012. WISE 2012. Lecture Notes in Computer Science, vol 7651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35063-4_59
Download citation
DOI: https://doi.org/10.1007/978-3-642-35063-4_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35062-7
Online ISBN: 978-3-642-35063-4
eBook Packages: Computer ScienceComputer Science (R0)