Analysis and Correction of Web Documents’ Non-Compliance with Web Standards
- 5 Downloads
Based on the justification for equal accessibility of the World Wide Web (Web for short), we analyzed the non-compliance of collected web documents with web standards through a statistical physics approach. The web documents were examined by using a validator that classified the noncompliance into errors and warnings of different types. We found that the frequency distributions of errors and warnings in a web document followed a power-law distribution and that a strong correlation existed between the numbers of errors and warnings. In addition, some errors or warnings were identified much more frequently than others, which could be modeled by a geometric distribution. By utilizing these properties, we proposed a scheme to correct non-compliance that focused on the most frequently occurring errors and warnings. We empirically tested the proposed method against the collected web documents and showed that the proposed method effectively corrected about 47% and 85% of errors and warnings, respectively. We also used network theory to analyze correlations within and between different errors and warnings in correction results and found that some types of errors and/or warnings affected each other in the correction. In this paper, correction results of the proposed method are compared with those of Tidy, and different characteristics between the two correction methods are discussed.
KeywordsWorld wide web Web standards Validator Non-compliance Power-law distribution
Unable to display preview. Download preview PDF.
This work was supported by the Korea Research Foundation Grant funded by the Korean Government (MOEHRD) (NRF-2018R1D1A3B07042338).
- The Web Standard Project, https://doi.org/www.webstandards.org (accessed 13 July 2018).
- J. Zeldman and E. Marcotte, Designing with Web Standards (New Riders Press, Berkeley CA, 2009).Google Scholar
- S. Collison, A. Budd and C. Moll, CSS Mastery: Advanced Web Standards Solutions (Black & White) (Springer-Verlag, New York, 2009).Google Scholar
- J. Niederst, Web Design in a Nutshell: A Desktop Quick Reference (O’Reilly Media, Sebastopol CA, 2006).Google Scholar
- L. F. Sikos, Web Standards: Mastering HTML5, CSS3, and XML (Apress, New York, 2014).Google Scholar
- World Wide Web Consortium. https://doi.org/www.w3.org/, 2018 (accessed 13 July 2018).
- R. Pastor-Satorras and A. Vespignani, Evolution and Structure of the Internet: A Statistical Physics Approach (Cambridge University Press, Cambridge, 2008).Google Scholar
- The W3C Markup Validation Service, https://doi.org/validator.w3.org/about.html, (accessed 13 July, 2018).
- Document Object Model (DOM), https://doi.org/www.w3.org/DOM/#what, 2018 (accessed 13 July, 2018).
- Openwebspider, https://doi.org/www.openwebspider.org, 2018 (accessed 13 July 2018).
- J. Zimmerman, Applying the Pareto Principle (80-20 Rule) to Baseball, https://doi.org/www.beyondtheboxscore.com/2010/6/4/1501048/applying-the-parento-principle-80, 2018 (accessed 13 July 2018).Google Scholar
- R. A. Fisher, Biometrika (Biometrika Trust) 10, 507 (1915).Google Scholar