Journal of the Korean Physical Society

, Volume 74, Issue 7, pp 731–743 | Cite as

Analysis and Correction of Web Documents’ Non-Compliance with Web Standards

  • So-Young Chae
  • Chang-Yong LeeEmail author


Based on the justification for equal accessibility of the World Wide Web (Web for short), we analyzed the non-compliance of collected web documents with web standards through a statistical physics approach. The web documents were examined by using a validator that classified the noncompliance into errors and warnings of different types. We found that the frequency distributions of errors and warnings in a web document followed a power-law distribution and that a strong correlation existed between the numbers of errors and warnings. In addition, some errors or warnings were identified much more frequently than others, which could be modeled by a geometric distribution. By utilizing these properties, we proposed a scheme to correct non-compliance that focused on the most frequently occurring errors and warnings. We empirically tested the proposed method against the collected web documents and showed that the proposed method effectively corrected about 47% and 85% of errors and warnings, respectively. We also used network theory to analyze correlations within and between different errors and warnings in correction results and found that some types of errors and/or warnings affected each other in the correction. In this paper, correction results of the proposed method are compared with those of Tidy, and different characteristics between the two correction methods are discussed.


World wide web Web standards Validator Non-compliance Power-law distribution 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.



This work was supported by the Korea Research Foundation Grant funded by the Korean Government (MOEHRD) (NRF-2018R1D1A3B07042338).

Supplementary material

40042_2019_4050_MOESM1_ESM.pdf (102 kb)
Analysis and Correction of Web Documents’ Non-Compliance with Web Standards


  1. [1]
    The Web Standard Project, (accessed 13 July 2018).
  2. [2]
    S. L. Henry, Web Accessibility: Web Standards and Regulatory Compliance (Apress, New York, 2006), pp. 1–52.CrossRefGoogle Scholar
  3. [3]
    J. Zeldman and E. Marcotte, Designing with Web Standards (New Riders Press, Berkeley CA, 2009).Google Scholar
  4. [4]
    S. Collison, A. Budd and C. Moll, CSS Mastery: Advanced Web Standards Solutions (Black & White) (Springer-Verlag, New York, 2009).Google Scholar
  5. [5]
    J. Niederst, Web Design in a Nutshell: A Desktop Quick Reference (O’Reilly Media, Sebastopol CA, 2006).Google Scholar
  6. [6]
    D. Smith and T. Negrino, JavaScript: Visual QuickStart Guide (Peachpit Press, San Francisco, 2014).Google Scholar
  7. [7]
    L. F. Sikos, Web Standards: Mastering HTML5, CSS3, and XML (Apress, New York, 2014).Google Scholar
  8. [8]
    World Wide Web Consortium., 2018 (accessed 13 July 2018).
  9. [9]
    P. L. Krapivsky and S. Redner, Comput. Networks 39, 261 (2002).CrossRefGoogle Scholar
  10. [10]
    M. Takayasu, K. Fukuda and H. Takayasu, Physica A 274, 140 (1999).ADSCrossRefGoogle Scholar
  11. [11]
    R. Pastor-Satorras and A. Vespignani, Phys. Rev. Lett. 86, 3200 (2001).ADSCrossRefGoogle Scholar
  12. [12]
    R. Pastor-Satorras and A. Vespignani, Evolution and Structure of the Internet: A Statistical Physics Approach (Cambridge University Press, Cambridge, 2008).Google Scholar
  13. [13]
    The W3C Markup Validation Service,, (accessed 13 July, 2018).
  14. [14]
    Document Object Model (DOM),, 2018 (accessed 13 July, 2018).
  15. [15]
    D. Raggett, Clean up your Web pages with HTML Tidy,, 2018 (accessed 13 July 2018).Google Scholar
  16. [16]
    Openwebspider,, 2018 (accessed 13 July 2018).
  17. [17]
    M. Newman, J. Contemp. Phys. 46, 323 (2005).ADSCrossRefGoogle Scholar
  18. [18]
    A. Clauset, C. Shaliz and M. Newman, SIAM Rev. 51, 661 (2009).ADSMathSciNetCrossRefGoogle Scholar
  19. [19]
    C. Petersen, J. G. Simonsen and C. Lioma, ACM TOIS 34, 1 (2016).CrossRefGoogle Scholar
  20. [20]
    D. Roberts and D. Turcotte, Fractals 6, 351 (1998).CrossRefGoogle Scholar
  21. [21]
    For a review, see A.-L. Barabasi, Science 325, 412 (2009).ADSMathSciNetCrossRefGoogle Scholar
  22. [22]
    A. Vazquez, R. Pastor-Satorras and A. Vespignani, Phys. Rev. E 65, 066130 (2002).ADSCrossRefGoogle Scholar
  23. [23]
    L. Zhao, L. Park and Y.-C. Lai, Phys. Rev. E 70, 035101(R) (2004).ADSCrossRefGoogle Scholar
  24. [24]
    J. Campbell, A. Lo and A. MacKinlay, The Econometrics of Financial Markets (Princeton University Press, New Jersey, 1996).zbMATHGoogle Scholar
  25. [25]
    G. Box and R. Meyer, Technometrics 28, 11 (1986).MathSciNetCrossRefGoogle Scholar
  26. [26]
    R. Pressman, Software Engineering: A Practitioner’s Approach (McGraw-Hill, Boston, 2010).zbMATHGoogle Scholar
  27. [27]
    J. Zimmerman, Applying the Pareto Principle (80-20 Rule) to Baseball,, 2018 (accessed 13 July 2018).Google Scholar
  28. [28]
    B. Efron and R. Tibshirani, An Introduction to the Bootstrap (Chapman & Hall/CRC, Boca Raton, 1993).CrossRefzbMATHGoogle Scholar
  29. [29]
    J. L. Rodgers and W. A. Nicewander, The American Statistician 42, 59 (1988).CrossRefGoogle Scholar
  30. [30]
    K. Baba, R. Shibata and M. Sibuya, Aust. N.Z. J. Stat. 46, 657 (2004).MathSciNetCrossRefGoogle Scholar
  31. [31]
    R. A. Fisher, Biometrika (Biometrika Trust) 10, 507 (1915).Google Scholar
  32. [32]
    Y. Benjamini and Y. Hochberg, J. R. Stat. Soc. Series B (Methodological) 57, 289 (1995).CrossRefGoogle Scholar

Copyright information

© The Korean Physical Society 2019

Authors and Affiliations

  1. 1.Department of Industrial and Systems EngineeringKongju National UniversityCheonanKorea

Personalised recommendations