Skip to main content

Web Page Clustering for More Efficient Website Accessibility Evaluations

  • Conference paper
  • First Online:
Computers Helping People with Special Needs (ICCHP 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9758))

Abstract

Despite advances in the legal framework to assure web accessibility people with disabilities still find barriers hindering websites access. The European Internet Inclusion Initiative (EIII) has delivered methods and tools to carry out large scale evaluations of websites. The tools have been used to carry out 180 million tests on 540, 000 web pages to check 1065 websites at a rate of about 7 sites per hour. This paper outlines an approach to reduce the number of web pages needed to compute accessibility scores. The suggested approach relies on machine learning to cluster the web pages according to the barriers detected and to select representative pages for the score calculation. Analysis of the experimental results has confirmed the validity of the accessibility test result as a new feature for clustering web pages, which is planned to be implemented in the EIII website checker tools.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chakrabarti, D., Mehta, R.: The paths more taken: matching DOM trees to search logs for accurate webpage clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 211–220. ACM (2010)

    Google Scholar 

  2. Harper, S., Moon, A.A., Vigo, M., Brajnik, G., Yesilada, Y.: DOM block clustering for enhanced sampling and evaluation. In: Proceedings of the 12th Web for All Conference, Florence, Italy, pp. 15:1–15:10. ACM (2015)

    Google Scholar 

  3. Crescenzi, V., Merialdo, P., Missier, P.: Clustering web pages based on their structure. Elsevier Sci. Publishers B.V. Data Knowl. Eng. 54, 279–299 (2004)

    Google Scholar 

  4. Brajnik, G., Mulas, A., Pitton, C.: Effects of sampling methods on web accessibility evaluations. In: Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility, Tempe, Arizona, USA, pp. 59–66. ACM (2007)

    Google Scholar 

  5. Hernández, I., Rivero, C.R., Ruiz, D., Corchuelo, R.: CALA: an unsupervised URL-based web page classification system. Elsevier Sci. Publishers B.V. Know.-Based Syst. 57, 168–180 (2014)

    Google Scholar 

  6. Alpaydin, E.: Introduction to Machine Learning. Adaptive Computation and Machine Learning. MIT Press, Cambridge (2014). icchp: fraley02

    Google Scholar 

  7. Fraley, C., Raftery, A.E.: Model-based clustering, discriminant analysis, and density estimation. J. Am. Stat. Assoc. 97(458), 611–631 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  8. Hernández, I., Rivero, C.R., Ruiz, D., Corchuelo, R.: A statistical approach to URL-based web page clustering. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012 Companion, Lyon, France, pp. 525–526. ACM (2012)

    Google Scholar 

  9. Zhang, M., Wang, C., Bu, J., Yu, Z., Zhou, Y., Chen, C.: A sampling method based on URL clustering for fast web accessibility evaluation. Zhejiang Univ. Press Front. Inf. Technol. Electron. Eng. 16, 449–456 (2015)

    Article  Google Scholar 

  10. Nietzio, A., Berker, F.: D4.1 Integration Methodology (EIII project final report) (2015). https://box.tingtun.no/d/75ca8eab85/files/?p=/EIII-submitted-deliverables/D4.1-Integration-Methodology-v1.0.pdf. Accessed 15 Mar 2016

  11. European Internet Inclusion Initiative (EIII): EIII Page Checker HTML Tests (2016). http://checkers.eiii.eu/en/tests/. Accessed 10 Mar 2016

  12. Web Content Accessibility Guidelines Working Group (WCAG WG) and the User Agent Accessibility Guidelines Working Group (UAWG): Mobile Accessibility Task Force (Mobile A11Y TF) (2014). https://www.w3.org/WAI/GL/mobile-a11y-tf/. Accessed 23 Mar 2016

  13. Mobile Accessibility Task Force (Mobile A11Y TF): Mobile Accessibility: How WCAG 2.0 and Other W3C/WAI Guidelines Apply to Mobile (2016). http://w3c.github.io/Mobile-A11y-TF-Note/. Accessed 23 Mar 2016

Download references

Acknowledgements

The EIII project (http://eiii.eu) was co-funded by the European Commission under FP7 Project no.: 609667. The results presented in this paper build on the outcomes of several previous projects and the collaboration of the EIII team with researchers, practitioners and users.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mikael Snaprud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Mucha, J., Snaprud, M., Nietzio, A. (2016). Web Page Clustering for More Efficient Website Accessibility Evaluations. In: Miesenberger, K., Bühler, C., Penaz, P. (eds) Computers Helping People with Special Needs. ICCHP 2016. Lecture Notes in Computer Science(), vol 9758. Springer, Cham. https://doi.org/10.1007/978-3-319-41264-1_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41264-1_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41263-4

  • Online ISBN: 978-3-319-41264-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics