Advertisement

Scholarly Information Network

  • Paul Ginsparg
Part III Information Networks & Social Networks
Part of the Lecture Notes in Physics book series (LNP, volume 650)

Abstract

I review the background and some recent trends of a particular scholarly information network, arXiv.org, and discuss some of its implications for new scholarly publication models. If we were to start from scratch today to design a quality-controlled archive and distribution system for scientific and technical information, it could take a very different form from what has evolved in the past decade from pre-existing print infrastructure. Near-term advances in automated classification systems, authoring tools, and document formats will facilitate efficient datamining and long-term archival stability, and I discuss how these could provide not only more efficient means of accessing and navigating the information, but also more cost-effective means of authentication and quality control. Finally, I illustrate the use of machine learning techniques to analyze, structure, maintain, and evolve a large online corpus of academic literature. An emerging field of research can be identified as part of an existing corpus, permitting the implementation of a more coherent community structure for its network of practitioners.

Keywords

Support Vector Machine Peer Review Process Electronic Publishing Expert Reader Coauthorship Network 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1. P. Ginsparg. Winners and losers in the global research village. In Sir R. Elliot and D. Shaw, editors, Electronic Publishing in Science I, Proceedings of joint ICSU Press/UNESCO conference, Paris, 1996 (copy at http://arXiv.org/blurb/pg96unesco.html). URL: http://users.ox.ac.uk/ icsuinfo/ginsparg.htm.Google Scholar
  2. 2. P. Ginsparg. Creating a global knowledge network. In Sir R. Elliot and D. Shaw, editors, Electronic Publishing in Science II, proceedings of joint ICSU Press/UNESCO conference, Paris. ICSU Press, 2001 (copy at http://arXiv.org/blurb/pg01unesco.html). URL: http://users.ox.ac.uk/ icsuinfo/ginspargfin.htm.Google Scholar
  3. 3. P. Ginsparg. Can peer review be better focused?, 2003, Science & Technology Libraries, to appear. URL: http://arXiv.org/blurb/pg02pr.html.Google Scholar
  4. 4. Brinkman Outlines Priorities, Challenges for APS in 2002. APS News, January 2002. URL: http://www.aps.org/apsnews/0102/010208.html.Google Scholar
  5. 5. M. E. J. Newman, Who Is the Best Connected Scientist? A Study of Scientific Coauthorship Networks, Lect. Notes Phys. 650, 337–370 (2004).Google Scholar
  6. 6. R.S. Berry. Is electronic publishing being used in the best interests of science? the scientist’s view. In Sir R. Elliot and D. Shaw, editors, Electronic Publishing in Science II, proceedings of joint ICSU Press/UNESCO conference, Paris, 2001. URL: http://users.ox.ac.uk/~icsuinfo/berryfin.htm.Google Scholar
  7. 7. S. Bachrach, R.S. Berry, M. Blume, T. von Foerster, A. Fowler, P. Ginsparg, S. Heller, N. Kestner, A. Odlyzko, A. Okerson, R. Wigington, and A. Moffat. Who should own scientific papers? Science, 281:1459–1460, 1998.URL: http://www.sciencemag.org/cgi/content/full/281/5382/1459.Google Scholar
  8. 8. S. Rogers and C. Hurt. How scholarly communication should work in the 21st century. The Chronicle of Higher Education, October 18:A56, 1989.Google Scholar
  9. 9. P. Ginsparg. First steps towards electronic research communication. Computers in Physics, 8(4 (Jul/Aug)), 1994.Google Scholar
  10. 10. S. Gass. Transforming scientific communication for the 21st century. Science and Technology Libraries, 19(3/4):3–18, 2001.Google Scholar
  11. 11. D. Stern. eprint moderator model, 1999, version dated Jan 25, 1999.URL: http://www.library.yale.edu/scilib/modmodexplain.html.Google Scholar
  12. 12. R. Kling, L. Spector, and G. McKim. Locally controlled scholarly publishing via the internet: The guild model. The Journal of Electronic Publishing, August, 2002. URL: http://www.press.umich.edu/jep/08-01/kling.html.Google Scholar
  13. 13. J. Willinsky. Copyright contradictions in scholarly publishing. First Monday, 7(11 (November)), 2002. URL: http://firstmonday.org/issues/issue7_11/willinsky/index.html.Google Scholar
  14. 14. J. Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002. URL: http://www.cs.cornell.edu/home/kleinber/kleinber.html.Google Scholar
  15. 15. H. B. O’Connell. Physicists thriving with paperless publishing. HEP Lib. Web., 6:3, arXiv:physics/0007040, 2002. URL: http://arXiv.org/physics/0007040.Google Scholar
  16. 16. M. J. Kurtz, G. Eichhorn, A. Accomazzi, C. Grant, M. Demleitner, S. S. Murray, N. Martimbeau, and B. Elwell. The NASA astrophysics data system: Sociology, bibliometrics and impact. 2003. URL: http://cfa-www.harvard.edu/~kurtz/jasist-submitted.pdf.Google Scholar
  17. 17. P. Ginsparg, P. Houle, T. Joachims, and J.-H. Sul. Mapping subsets of scholarly information. PNAS, to appear, 2004. URL: http://arXiv.org/cs.IR/0312018.Google Scholar
  18. 18. P. Hayes and S. Weinstein. CONSTRUE/TIS: a system for content-based indexing of a database of news stories. In Annual Conference on Innovative Applications of AI, 1990.Google Scholar
  19. 19. G. Salton and C. Buckley. Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513–523, 1988.Google Scholar
  20. 20. M. Porter. An algorithm for suffix stripping. Program (Automated Library and Information Systems), 14(3):130–137, July 1980.Google Scholar
  21. 21. V. Vapnik. Statistical Learning Theory. Wiley, Chichester, GB, 1998.Google Scholar
  22. 22. T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the European Conference on Machine Learning, pages 137 – 142, Berlin, 1998. Springer. URL: http://www-ai.cs.uni-dortmund.de/DOKUMENTE/joachims_98a.ps.gz.Google Scholar
  23. 23. S. Dumais, J. Platt, D. Heckerman, and M. Sahami. Inductive learning algorithms and representations for text categorization. In Proceedings of ACM-CIKM98, November 1998.Google Scholar
  24. 24. T. Joachims. Learning to Classify Text Using Support Vector Machines – Methods, Theory, and Algorithms. Kluwer, 2002.Google Scholar
  25. 25. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121–167, 1998.Google Scholar
  26. 26. N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, 2000.Google Scholar
  27. 27. I. Guyon, N. Matic, and V. Vapnik. Discovering informative patterns and data cleaning. In Advances in Knowledge Discovery and Data Mining, pages 181–203, 1996.Google Scholar

Authors and Affiliations

  • Paul Ginsparg
    • 1
  1. 1.Departments of Physics and Computing & Information Science, Cornell University, Ithaca, NY 14853USA

Personalised recommendations