Abstract
Recent advances in information technology have turned out World Wide Web to be the main platform for interactions where participants—users and corresponding events—are triggered. Although the participants vary in accordance with scenarios, a considerable size of data will be generated. This phenomenon indeed causes the complexity in information retrieval, management, and resuse, and meanwhile, turns down the value of this data. In this research, we attempt to achieve efficient management of user-generated data and its derivative contexts (i.e., social ad hoc data) for human supports. The correlations among data, contexts, and their hybridization are specifically concentrated. An intelligent state machine is proposed to outline the relations of data and contexts, and applied to further identify their usage scenarios. The performance and feasibility can be revealed by the experiments that were conducted on the data collected from open social networks (e.g., Facebook, Twitter, etc.) in the past few years with size around 500 users and 8,000,000 shared contents from them.
Similar content being viewed by others
Notes
The contexts derived from the user-generated data are considered as separate data in this study. However, dependency exists especially when one of them is triggered.
A scenario of information searching is implemented as a service that facilitates the search input and data reuse/revisit. Although several features, such as search guidance, multi-factors enhanced search process, and etc., exist, the search process is specially focused in this study.
A marking can be considered as a state of ISM because the marking will be updated when a transition is fired. That is the reason why the outcome event of a transition may affect the obtained attribute(s).
All the collected data from open social networks are verified with existing semantic analysis tool [30] that supports identify whether the data itself and/or its derived contexts possess implicit meaning(s) to specific event(s). The specific event here is considered a form of social ad hoc data set.
References
Aarts F, Jonsson B, Uijen J (2010) Generating models of infinite-state communication protocols using regular inference with abstraction. Test Softw Syst 6435:188–204
Bose I, Mahapatra RK (2001) Business data mining—a machine learning perspective. Inf Manag 39(3):211–225
Carpineto C, Osinski S, Romano G, Weiss D (2009) A survey of Web clustering engines. ACM Comput Surv 41(3):17
Cavalli A, Gervy C, Prokopenko S (2003) New approaches for passive testing using an extended finite state machine specification. Inf Softw Technol 45(12):837–852
Chen Y, Dong G, Han J, Wah BW, Wang J (2002) Multi-dimensional regression analysis of time-series data streams. Proceedings of the 28th international conference on Very Large Data Bases, 323–334
Cheng KT, Krishnakumar AS (1996) Automatic generation of functional vectors using the extended finite state machine model. ACM Trans Des Autom Electron Syst 1(1):57–79
Culotta A, Bekkerman R, McCallum A (2004) Extracting social networks and contact information from email and the Web. In: Proceedings of CEAS-1
Erickson T, Kellogg WA (2000) Social translucence: an approach to designing systems that support social processes. ACM Trans Comput-Hum Interact 7(1):59–83
Esling P, Agon C (2012) Time-series data mining. ACM Comput Surv 45(1):12
Fagni T, Perego R, Silvestri F, Orlando S (2006) Boosting the performance of Web search engines: caching and prefetching query results by exploiting historical usage data. ACM Trans Inf Syst 24(1):51–78
Faloutsos C, McCurley KS, Tomkins A (2004) Fast discovery of connection subgraphs. In: Proc. ACM SIGKDD 2004
Folstad A, Hornbaek K, Ulleberg P (2013) Social design feedback: evaluations with users in online ad-hoc groups. Human-Centric Comput Inf Sci 3:18
Gaber MM, Zaslavsky A, Krishnaswamy S (2005) Mining data streams: a review. ACM SIGMOD Rec 34(2):18–26
Glynn Mangold W, Faulds DJ (2009) Social media: the new hybrid element of the promotion mix. Bus Horiz 52(4):357–365
Guralnik V, Srivastave J (1999) Event detection from time series data. Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, 33–42
Harada M, Sato Sh, Kazama K (2004) Finding authoritative people from the web. In: Proc. Joint Conference on Digital Libraries (JCDL2004)
Hoheisel A, Alt M (2007) Petri nets. In: Workflows for e-Science. pp 190–207
Hong JE, Bae DH (2000) Software modeling and analysis using a hierarchical object-oriented Petri net. Inf Sci 130(1–4):133–164
Jensen K, Kristensen LM, Wells L (2007) Coloured Petri nets and CPN tools for modelling and validation of concurrent systems. Int J Softw Tools Technol Transfer 9(3):213–254
Joachims T, Granka L, Pan B, Hembrooke H, Gay G (2005) Accurately interpreting clickthrough data as implicit feedback. Proceeding of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 154–161
Karsai M, Kivela M, Pan RK, Kaski K, Kertesz J, Barabasi A-L, Saramaki J (2011) Small but slow world: how network topology and burstiness slow down spreading. Phys Rev E 83(2):025102
Keogh E, Kasetty S (2003) On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min Knowl Disc 7(4):349–371
Kozłowski T, Dagless E, Saul J, Adamski M, Szajna J (1995) Parallel controller synthesis using Petri nets. IEE Proc Comput Digit Tech 142(4):263–271
Lee K, Agrawal A, Choudhary A (2013) Real-time disease surveillance using Twitter data: demonstration on flu and cancer. Proceeding of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1474–1477
Li L, Hadjiicostis CN, Sreenivas RS (2008) Designs of bisimilar Petri net controllers with fault tolerance capabilities. IEEE Trans Syst Man Cybern Syst Hum 38(1):207–217
Liu B, Liu YK (2002) Expected value of fuzzy variable and fuzzy expected value models. IEEE Trans Fuzzy Syst 10(4):445–450
Mandal SN, Choudhury JP, Chaudhuri SRB, De D (2008) Soft computing approach in prediction of a time series data. J Theor Appl Inf Technol 4(12):1131–1141
Mika P (2005) Ontologies are us: a unified model of social networks and semantics. In: Proc. ISWC2005
Milanovic N, Malek M (2004) Current solutions for web service composition. IEEE Internet Comput 8(6):51–59
Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11):39–41
Mitra S, Pal SK, Mitra P (2002) Data mining in soft computing framework: a survey. IEEE Trans Neural Netw 13(1):3–14
Pais R, Gomes L, Paulo Barros J (2011) From UML state machines to Petri nets: history atribute translation strategies. The 37th Annual Conference on IEEE Industrial Electronics Society, 3776–3781
Salimifard K, Wright M (2001) Petri net-based modeling of workflow systems: an overview. Eur J Oper Res 134(3):664–676
Schadt EE, Linderman MD, Soreson J, Lee L, Nolan GP (2010) Computational solutions to large-scale data management and analysis. Nat Rev Genet 11:647–657
Shtykh RY, Jin Q (2011) A human-centric integrated approach to web information search and sharing. Human-Centric Comput Inf Sci 1:2
Steyvers M, Tenenbaum JB (2005) The large-scale structure of semantic networks: statistical analyses and a model of semantic growth. Cogn Sci 29(1):41–78
Tay FEH, Cao L (2001) Application of support vector machines in financial time series forecasting. Omega 29(4):309–317
Thelwall M (2001) A web crawler design for data mining. J Inf Sci 27(5):319–325
van der Aalst WMP, Song M (2004) Mining social networks: uncovering interaction patterns in business processes. Bus Process Manag LNCS 3080:244–260
Yen NY, Shih TK, Jin Q (2013) LONET: an interactive search network for intelligent path generation. ACM Trans Intell Syst Technol 4(2):30
Zhang J, Chang CK, Chung JY, Kim SW (2004) WS-Net: a Petri-net based specification model for Web services. IEEE International Conference on Web Services, 420–427
Acknowledgments
This research was supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the ITRC (Information Technology Research Center) support program (NIPA-2014-H0301-14-4007) supervised by the NIPA(National IT Industry Promotion Agency).
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Yen, N.Y., Jin, Q., Tsai, J.C. et al. Intelligent state machine for social ad hoc data management and reuse. Multimed Tools Appl 74, 3521–3541 (2015). https://doi.org/10.1007/s11042-014-1941-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-1941-2