Skip to main content
Log in

Intelligent state machine for social ad hoc data management and reuse

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Recent advances in information technology have turned out World Wide Web to be the main platform for interactions where participants—users and corresponding events—are triggered. Although the participants vary in accordance with scenarios, a considerable size of data will be generated. This phenomenon indeed causes the complexity in information retrieval, management, and resuse, and meanwhile, turns down the value of this data. In this research, we attempt to achieve efficient management of user-generated data and its derivative contexts (i.e., social ad hoc data) for human supports. The correlations among data, contexts, and their hybridization are specifically concentrated. An intelligent state machine is proposed to outline the relations of data and contexts, and applied to further identify their usage scenarios. The performance and feasibility can be revealed by the experiments that were conducted on the data collected from open social networks (e.g., Facebook, Twitter, etc.) in the past few years with size around 500 users and 8,000,000 shared contents from them.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. The contexts derived from the user-generated data are considered as separate data in this study. However, dependency exists especially when one of them is triggered.

  2. A scenario of information searching is implemented as a service that facilitates the search input and data reuse/revisit. Although several features, such as search guidance, multi-factors enhanced search process, and etc., exist, the search process is specially focused in this study.

  3. A marking can be considered as a state of ISM because the marking will be updated when a transition is fired. That is the reason why the outcome event of a transition may affect the obtained attribute(s).

  4. All the collected data from open social networks are verified with existing semantic analysis tool [30] that supports identify whether the data itself and/or its derived contexts possess implicit meaning(s) to specific event(s). The specific event here is considered a form of social ad hoc data set.

References

  1. Aarts F, Jonsson B, Uijen J (2010) Generating models of infinite-state communication protocols using regular inference with abstraction. Test Softw Syst 6435:188–204

    Article  Google Scholar 

  2. Bose I, Mahapatra RK (2001) Business data mining—a machine learning perspective. Inf Manag 39(3):211–225

    Article  Google Scholar 

  3. Carpineto C, Osinski S, Romano G, Weiss D (2009) A survey of Web clustering engines. ACM Comput Surv 41(3):17

    Article  Google Scholar 

  4. Cavalli A, Gervy C, Prokopenko S (2003) New approaches for passive testing using an extended finite state machine specification. Inf Softw Technol 45(12):837–852

    Article  Google Scholar 

  5. Chen Y, Dong G, Han J, Wah BW, Wang J (2002) Multi-dimensional regression analysis of time-series data streams. Proceedings of the 28th international conference on Very Large Data Bases, 323–334

  6. Cheng KT, Krishnakumar AS (1996) Automatic generation of functional vectors using the extended finite state machine model. ACM Trans Des Autom Electron Syst 1(1):57–79

    Article  Google Scholar 

  7. Culotta A, Bekkerman R, McCallum A (2004) Extracting social networks and contact information from email and the Web. In: Proceedings of CEAS-1

  8. Erickson T, Kellogg WA (2000) Social translucence: an approach to designing systems that support social processes. ACM Trans Comput-Hum Interact 7(1):59–83

    Article  Google Scholar 

  9. Esling P, Agon C (2012) Time-series data mining. ACM Comput Surv 45(1):12

    Article  Google Scholar 

  10. Fagni T, Perego R, Silvestri F, Orlando S (2006) Boosting the performance of Web search engines: caching and prefetching query results by exploiting historical usage data. ACM Trans Inf Syst 24(1):51–78

    Article  Google Scholar 

  11. Faloutsos C, McCurley KS, Tomkins A (2004) Fast discovery of connection subgraphs. In: Proc. ACM SIGKDD 2004

  12. Folstad A, Hornbaek K, Ulleberg P (2013) Social design feedback: evaluations with users in online ad-hoc groups. Human-Centric Comput Inf Sci 3:18

    Article  Google Scholar 

  13. Gaber MM, Zaslavsky A, Krishnaswamy S (2005) Mining data streams: a review. ACM SIGMOD Rec 34(2):18–26

    Article  Google Scholar 

  14. Glynn Mangold W, Faulds DJ (2009) Social media: the new hybrid element of the promotion mix. Bus Horiz 52(4):357–365

    Article  Google Scholar 

  15. Guralnik V, Srivastave J (1999) Event detection from time series data. Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, 33–42

  16. Harada M, Sato Sh, Kazama K (2004) Finding authoritative people from the web. In: Proc. Joint Conference on Digital Libraries (JCDL2004)

  17. Hoheisel A, Alt M (2007) Petri nets. In: Workflows for e-Science. pp 190–207

  18. Hong JE, Bae DH (2000) Software modeling and analysis using a hierarchical object-oriented Petri net. Inf Sci 130(1–4):133–164

    Article  MATH  Google Scholar 

  19. Jensen K, Kristensen LM, Wells L (2007) Coloured Petri nets and CPN tools for modelling and validation of concurrent systems. Int J Softw Tools Technol Transfer 9(3):213–254

    Article  Google Scholar 

  20. Joachims T, Granka L, Pan B, Hembrooke H, Gay G (2005) Accurately interpreting clickthrough data as implicit feedback. Proceeding of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 154–161

  21. Karsai M, Kivela M, Pan RK, Kaski K, Kertesz J, Barabasi A-L, Saramaki J (2011) Small but slow world: how network topology and burstiness slow down spreading. Phys Rev E 83(2):025102

    Google Scholar 

  22. Keogh E, Kasetty S (2003) On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min Knowl Disc 7(4):349–371

    Article  MathSciNet  Google Scholar 

  23. Kozłowski T, Dagless E, Saul J, Adamski M, Szajna J (1995) Parallel controller synthesis using Petri nets. IEE Proc Comput Digit Tech 142(4):263–271

    Article  Google Scholar 

  24. Lee K, Agrawal A, Choudhary A (2013) Real-time disease surveillance using Twitter data: demonstration on flu and cancer. Proceeding of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1474–1477

  25. Li L, Hadjiicostis CN, Sreenivas RS (2008) Designs of bisimilar Petri net controllers with fault tolerance capabilities. IEEE Trans Syst Man Cybern Syst Hum 38(1):207–217

    Article  Google Scholar 

  26. Liu B, Liu YK (2002) Expected value of fuzzy variable and fuzzy expected value models. IEEE Trans Fuzzy Syst 10(4):445–450

    Article  Google Scholar 

  27. Mandal SN, Choudhury JP, Chaudhuri SRB, De D (2008) Soft computing approach in prediction of a time series data. J Theor Appl Inf Technol 4(12):1131–1141

    Google Scholar 

  28. Mika P (2005) Ontologies are us: a unified model of social networks and semantics. In: Proc. ISWC2005

  29. Milanovic N, Malek M (2004) Current solutions for web service composition. IEEE Internet Comput 8(6):51–59

    Article  Google Scholar 

  30. Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11):39–41

    Article  Google Scholar 

  31. Mitra S, Pal SK, Mitra P (2002) Data mining in soft computing framework: a survey. IEEE Trans Neural Netw 13(1):3–14

    Article  Google Scholar 

  32. Pais R, Gomes L, Paulo Barros J (2011) From UML state machines to Petri nets: history atribute translation strategies. The 37th Annual Conference on IEEE Industrial Electronics Society, 3776–3781

  33. Salimifard K, Wright M (2001) Petri net-based modeling of workflow systems: an overview. Eur J Oper Res 134(3):664–676

    Article  MATH  Google Scholar 

  34. Schadt EE, Linderman MD, Soreson J, Lee L, Nolan GP (2010) Computational solutions to large-scale data management and analysis. Nat Rev Genet 11:647–657

    Article  Google Scholar 

  35. Shtykh RY, Jin Q (2011) A human-centric integrated approach to web information search and sharing. Human-Centric Comput Inf Sci 1:2

    Article  Google Scholar 

  36. Steyvers M, Tenenbaum JB (2005) The large-scale structure of semantic networks: statistical analyses and a model of semantic growth. Cogn Sci 29(1):41–78

    Article  Google Scholar 

  37. Tay FEH, Cao L (2001) Application of support vector machines in financial time series forecasting. Omega 29(4):309–317

    Article  Google Scholar 

  38. Thelwall M (2001) A web crawler design for data mining. J Inf Sci 27(5):319–325

    Article  Google Scholar 

  39. van der Aalst WMP, Song M (2004) Mining social networks: uncovering interaction patterns in business processes. Bus Process Manag LNCS 3080:244–260

    Google Scholar 

  40. Yen NY, Shih TK, Jin Q (2013) LONET: an interactive search network for intelligent path generation. ACM Trans Intell Syst Technol 4(2):30

    Article  Google Scholar 

  41. Zhang J, Chang CK, Chung JY, Kim SW (2004) WS-Net: a Petri-net based specification model for Web services. IEEE International Conference on Web Services, 420–427

Download references

Acknowledgments

This research was supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the ITRC (Information Technology Research Center) support program (NIPA-2014-H0301-14-4007) supervised by the NIPA(National IT Industry Promotion Agency).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Qun Jin or James J. Park.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yen, N.Y., Jin, Q., Tsai, J.C. et al. Intelligent state machine for social ad hoc data management and reuse. Multimed Tools Appl 74, 3521–3541 (2015). https://doi.org/10.1007/s11042-014-1941-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-1941-2

Keywords

Navigation