Web Intelligence Linked Open Data for Website Design Reuse

  • Maxim BakaevEmail author
  • Vladimir Khvorostov
  • Sebastian Heil
  • Martin Gaedke
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10360)


Code and design reuse are as old as software engineering industry itself, but it’s also always a new trend, as more and more software products and websites are being created. Domain-specific design reuse on the web has especially high potential, saving work effort for thousands of developers and encouraging better interaction quality for millions of Internet users. In our paper we perform pilot feature engineering for finding similar solutions (website designs) within Domain, Task, and User UI models supplemented by Quality aspects. To obtain the feature values, we propose extraction of website-relevant data from online global services (DMOZ, Alexa, SimilarWeb, etc.) considered as linked open data sources, using specially developed web intelligence data miner. The preliminary investigation with 21 websites and 82 human annotators showed reasonable accuracy of the data sources and suggests potential feasibility of the approach.


Linked data quality Software reuse Web design patterns Data mining Model-driven development 



The reported study was funded by RFBR according to the research project No. 16-37-60060 mol_a_dk. The authors also thank S. Firmenich and J.M. Rivero from LIFIA (Argentina) who contributed to the discussion of the paper topics.


  1. 1.
    Glass, R.L.: Facts and Fallacies of Software Engineering. Addison-Wesley Professional, Boston (2002)Google Scholar
  2. 2.
    Gaedke, M., Rehse, J.: Supporting compositional reuse in component-based Web engineering. ACM Symp. Appl. Comput. 2, 927–933 (2000)Google Scholar
  3. 3.
    Kumar, R., et al.: Webzeitgeist: design mining the web. In: SIGCHI Conference on Human Factors in Computing Systems, pp. 3083–3092 (2013)Google Scholar
  4. 4.
    Norrie, M.C., Nebeling, M., Geronimo, L., Murolo, A.: X-Themes: supporting design-by-example. In: International Conference on Web Engineering (ICWE 2014), pp. 480–489 (2014)Google Scholar
  5. 5.
    Anderson, M.R., et al.: Brainwash: a data system for feature engineering. In: 6th Biennial Conference on Innovative Data Systems Research (2013)Google Scholar
  6. 6.
    Mangai, J.A., Kumar, V.S., Balamurugan, S.A.: A novel feature selection framework for automatic web page classification. Int. J. Autom. Comput. 9(4), 442–448 (2012)CrossRefGoogle Scholar
  7. 7.
    Saraç, E., Özel, S.A.: An ant colony optimization based feature selection for web page classification. Sci. World J., 2014, 1–16 (2014). doi: 10.1155/2014/649260, Article ID: 649260CrossRefGoogle Scholar
  8. 8.
    Paganelli, L., Paterno, F.: A tool for creating design models from web site code. Int. J. Softw. Eng. KEng. 13(02), 169–189 (2003)CrossRefGoogle Scholar
  9. 9.
    Park, J., Choi, B.C., Kim, K.: A vector space approach to tag cloud similarity ranking. Inf. Process. Lett. 110(12), 489–496 (2010)MathSciNetCrossRefzbMATHGoogle Scholar
  10. 10.
    Kosinski, M., et al.: Manifestations of user personality in website choice and behaviour on online social networks. Mach. Learn. 95(3), 357–380 (2014)MathSciNetCrossRefGoogle Scholar
  11. 11.
    Varnagar, C.R., et al.: Web usage mining: a review on process, methods and techniques. In: IEEE Information Communication and Embedded Systems (ICICES), pp. 40–46 (2013)Google Scholar
  12. 12.
    Cappiello, C., Di Noia, T., Marcu, B.A., Matera, M.: A quality model for linked data exploration. In: International Conference on Web Engineering (ICWE), pp. 397–404 (2016)Google Scholar
  13. 13.
    Zaveri, A., et al.: Quality assessment for linked data: a survey. Seman. Web 7(1), 63–93 (2016)CrossRefGoogle Scholar
  14. 14.
    Wang, R.Y., Strong, D.M.: Beyond accuracy: what data quality means to data consumers. J. Manage. Inf. Syst. 12(4), 5–33 (1996)CrossRefGoogle Scholar
  15. 15.
    Bakaev, M., Gaedke, M., Heil, S.: Kansei Engineering experimental research with University websites. TU Chemnitz Technical Report, CSR-16-01 (2016)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Maxim Bakaev
    • 1
    Email author
  • Vladimir Khvorostov
    • 1
  • Sebastian Heil
    • 2
  • Martin Gaedke
    • 2
  1. 1.Novosibirsk State Technical UniversityNovosibirskRussia
  2. 2.Technische Universität ChemnitzChemnitzGermany

Personalised recommendations