Skip to main content

Scalable Digital Libraries Based on NCSTRL/Dienst

  • Conference paper
  • First Online:
Research and Advanced Technology for Digital Libraries (ECDL 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1923))

Included in the following conference series:

Abstract

NCSTRL (The Networked Computer Science Technical Report Library) is a successful digital library for scientific and technical information. It uses the Dienst protocol that was developed by ARPA-funded CS-TR project. We encountered several problems while implementing NCSTRL based largescale libraries: UPS for Los Alamos and JDL for JTASC. The document collection for these libraries can range from several hundred thousands to few millions. The first problem we found that the native Dienst implementation does not scale beyond approximately 30,000 records. Secondly we found that the implementation is tightly coupled to the Unix platform. Finally, for a large number of hits the NCSTRL search interface support is limited in terms of usability. To address these problems, we replaced the Dienst repository service implementation with an Oracle-based implementation using servlet technology. The Oracle database stores the index information (metadata) and is partitioned horizontally to speed searching through different archives. Furthermore, indexes were built in order to speed the search by different key items such as the author name, the title and the abstract. Our implementation significantly reduced the average wait time for a user for searches that resulted in a large number of hits. In addition, we get all the other benefits of using servlet technology such as efficiency and portability. In this paper, we present the performance results of the new implementation and compare it with that of the implementation of the Dienst protocol in NCSTRL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Accomazi, A., Eichhorn, G., Kurtz, M. J., Grant, C. S., Murray, S. S.,: Astronomical information Discovery and Access: Design and Implementation of the ADS Bibliographic Services. Astronomical data Analysis Software and Systems VI, 125, (1997), 357–360

    Google Scholar 

  2. Browne, S., Dongarra, J., Horner, J., McMahan, P., Wells, S.: National HPCC Software Exchange (NHSE): Uniting the High Performance Computing and Communications Community. D-Lib Magazine, May (1998), http://www.dlib.org/dlib/may98/browne/05browne.html

  3. Davis, J.R., Kraft, D. B., Lagoze, C.: Dienst: Building a Production Technical Report Server. Advances in Digital Libraries. Springer-Verlag, (1995), 211–222

    Google Scholar 

  4. Davis, J. R., Lagoze, C.: The Networked Computer Science Technical Report Library. Cornell CS TR96-1595, July, (1996)

    Google Scholar 

  5. Dushay, N., French, J. C., Lagoze, C.: A Characterization Study of NCSTRL Distributed Searching. Cornell CS TR99-1725, January (1999)

    Google Scholar 

  6. Entlich, R., Garson, L., Lesk, M., Normore, L., Olsen, J., Weibel, S.: Making a Digital Library: The Contents of the CORE Project. ACM Transactions on Information Systems, 15(2), (1997), 103–123

    Article  Google Scholar 

  7. Hunter, J, and Crawfor, W.: Java Servlet Programming. O’Reilly and Associates, October1998.

    Google Scholar 

  8. Leiner, B.M.: The NCSTRL Approach to Open Architecture for the Confederated Digital Library. D-Lib Magazine, December (1998)

    Google Scholar 

  9. Maly, K., French, J., Fox, E., Salman, A.: Wide Area Technical Report Service-Technical Reports Online. Communications of the ACM, p. 45, April (1995)

    Google Scholar 

  10. Maly, K., Nelson, M. L, Zubair, M..: Smart Objects, Dumb Archives A User-Centric, Layered Digital Library Framework. D-Lib Magazine, March (1999), Volume 5 Issue 3

    Google Scholar 

  11. Maly, K., Nelson, M. L., Shen, S. N. T, Zubair M..: Buckets: Aggregative, Intelligent Agents for Publishing. WebNet Journal, Vol. 1, No. 1, March (1999), 58–65

    Google Scholar 

  12. Nelson, M. L., Maly, K., Shen, S. N. T., Zubair, M.: NCSTRL+: Adding Multi-Discipline and Multi-Genre Support to the Dienst Protocol Using Clusters and Buckets. Proceeding of Advances in Digital Libraries 98, Santa Barbara, CA, April 22-24 (1998)

    Google Scholar 

  13. Schatz, B., Chen, H.: Building Large-Scale Digital Libraries. IEEE Computer, 29(5), (1996), 22–26

    Article  Google Scholar 

  14. Schatz, B., Mischo, W. H., Cole, T. W., Hardin, J. B., Bishop, A. P., Chen, H.: Federating Diverse Collections of Scientific Literature. IEEE Computer, 29(5), (1996), 28–36

    Article  Google Scholar 

  15. Sompel, H. V., Nelson, M.L., Lyapunov, V.M., Zubair, M., Liu, X., Krichel, T., Hochestenbach, P., Maly, K., Kholief, M., O’Connell, H.:The UPS Prototype project: exploring the obstacles in creating a cross e-print archive end-user service. D-Lib Magazine, February (2000), Volume 6 Issue 2

    Google Scholar 

  16. Sompel, H. V., Lagoze, C.: The Santa Fe Convention of the Open Archives Initiative. DLib Magazine, February (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Maly, K., Zubair, M., Anan, H., Tan, D., Zhang, Y. (2000). Scalable Digital Libraries Based on NCSTRL/Dienst. In: Borbinha, J., Baker, T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2000. Lecture Notes in Computer Science, vol 1923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45268-0_16

Download citation

  • DOI: https://doi.org/10.1007/3-540-45268-0_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41023-2

  • Online ISBN: 978-3-540-45268-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics