Automatic Retrieval of Web Pages with Standards of Ethics and Trustworthiness Within a Medical Portal: What a Page Name Tells Us

Gaudinat, Arnaud; Grabar, Natalia; Boyer, Célia

doi:10.1007/978-3-540-73599-1_24

Automatic Retrieval of Web Pages with Standards of Ethics and Trustworthiness Within a Medical Portal: What a Page Name Tells Us

Arnaud Gaudinat¹,
Natalia Grabar¹ &
Célia Boyer¹

Conference paper

1505 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4594))

Abstract

The ever-increasing volume of health online information, coupled with the uneven reliability and quality, may have considerable implications for the citizen. In order to address this issue, we propose to use, within a general or specialised search engine, standards for identifying the reliability of online documents. Standards used are those related to the ethics as well as trustworthiness of websites. In this research, they are detected through the URL names of Web pages by applying machine learning algorithms. According to algorithms used and to principles, our straightforward approach shows up to 93% precision and 91% recall. But a few principles remain difficult to recognize.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fox, S.: Online Health Search 2006. Most Internet users start at a search engine when looking for health information online. Very few check the source and date of the information they find. Technical report, Pew Internet & American Life Project, Washington DC (2006)
Google Scholar
Risk, A., Dzenowagis, J.: Review of internet information quality initiatives. Journal of Medical Internet Research 3(4), e28 (2001)
Article Google Scholar
Boyer, C., Baujard, O., Baujard, V., Aurel, S., Selby, M., Appel, R.: Health on the net automated database of health and medical information. Int. J Med. Inform. 47(1-2), 27–29 (1997)
Article Google Scholar
Wang, Y., Liu, Z.: Automatic detecting indicators for quality of health information on the web. International Journal of Medical Informatics (2006)
Google Scholar
Price, S., Hersh, W.: Filtering web pages for quality indicators: an empirical approach to finding high quality consumer health information on the world wide web. In: AMIA 1999, pp. 911–915 (1999)
Google Scholar
Vinot, R., Grabar, N., Valette, M.: Application d’algorithmes de classification automatique pour la détection des contenus racistes sur l’internet. In: TALN, pp. 257–284 (2003)
Google Scholar
Wang, Y.: Automatic recognition of text difficulty from consumers health information. In: IEEE. (ed.) Computer-Based Medical Systems (2006)
Google Scholar
Gaudinat, A., Grabar, N., Boyer, C.: Machine learning approach for automatic quality criteria detection of health webpages. In: McCray, A. (ed.) MEDINFO 2007, Brisbane, Australia (to appear, 2007)
Google Scholar
Williams, K., Calvo, R.A.: A framework for text categorization. In: 7th Australian document computing symposium (2002)
Google Scholar
Salton, G.: Developments in automatic text retrieval. Science 253, 974–979 (1991)
Article MathSciNet Google Scholar
Koller, D., Sahami, M.: Toward optimal feature selection. In: International Conference on Machine Learning, pp. 284–292 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Health on the Net Foundation, SIM/HUG, Geneva, Switzerland
Arnaud Gaudinat, Natalia Grabar & Célia Boyer

Authors

Arnaud Gaudinat
View author publications
You can also search for this author in PubMed Google Scholar
Natalia Grabar
View author publications
You can also search for this author in PubMed Google Scholar
Célia Boyer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Riccardo Bellazzi Ameen Abu-Hanna Jim Hunter

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gaudinat, A., Grabar, N., Boyer, C. (2007). Automatic Retrieval of Web Pages with Standards of Ethics and Trustworthiness Within a Medical Portal: What a Page Name Tells Us. In: Bellazzi, R., Abu-Hanna, A., Hunter, J. (eds) Artificial Intelligence in Medicine. AIME 2007. Lecture Notes in Computer Science(), vol 4594. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73599-1_24

Download citation

DOI: https://doi.org/10.1007/978-3-540-73599-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73598-4
Online ISBN: 978-3-540-73599-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics