Abstract
Exceptionally large amounts of both distributed data and computational resources are becoming available through the Grid. This will enable efficient exchange and processing of very large amounts of data combined with CPU intensive computations, as required by many scientific applications. We propose a customizable Grid-based query processor built on top of an established Grid infrastructure, NorduGrid. It allows users to submit queries wrapping user-defined long running operations that filter and transform distributed customized data. Limitations imposed by the used Grid infrastructure influence the architecture. For example, resource requirements have to be specified before Grid jobs are started and delays may occur based on the availability of required resources for a job. We are developing a fully functional prototype to investigate the viability of the approach and its applicability. Our first application area is Particle Physics where scientists analyze huge amount of data produced by a collider or simulators to identify particles.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ATLAS Data Challenges (2001), [Online], http://atlas.web.cern.ch/Atlas/GROUPS/SOFTWARE/DC/doc/AtlasDCs.pdf
ATLAS Data Challenge 1, Draft 3.0 (2003), [Online], http://atlas.web.cern.ch/Atlas/GROUPS/SOFTWARE/DC/DC1/DC1-Report -V3[1].0-22092003.pdf
Bethke, S., Calvetti, M., Hoffmann, H.F., Jacobs, D., Kasemann, M., Linglin, D.: Report of the Steering Group of the LHC Computing Review, CERN/LHCC/2001–004 (2001), Available at http://lhcb-comp.web.cern.ch/lhcb-comp/Reviews/LHCComputing2000/Report_final.pdf
Bukhres, O., Elmagarmid, A. (eds.): Object-Oriented Multidatabase Systems: a Solution for Advanced Applications. Prentice Hall, Englewood Cliffs (1996)
Bouganim, L., Fabret, F., Porto, F., Valduriez, P.: Processing Queries with Expensive Functions and Large Objects in Distributed Mediator Systems. In: Proc. of Int. Conf. on Data Engineering, ICDE (2001)
Bisset, M., Moortgat, F., Moretti, S.: Trilepton+top signal from chargino-neutralino decays of MSSM charged Higgs bosons at the LHC. Eur.Phys.J. C30, 419–434 (2003)
Brun, R., Rademakers, F.: ROOT – An Object Oriented Data Analysis Framework. In: Proceedings AIHENP 1996 Workshop, Lausanne (September 1996); Nucl. Inst. & Meth. in Phys. Res. A 389, 81–86 (1997)
Condor-G System, [Online], http://www.cs.wisc.edu/condor/condorg/
Czajkowski, K., Foster, I., Karonis, N., Kesselman, C., Martin, S., Smith, W., Tuecke, S.: A Resource Management Architecture for Metacomputing Systems. In: Proc. IPPS/SPDP 1998 Workshop on Job Scheduling Strategies for Parallel Processing, pp. 62–82 (1998)
Directed Acyclic Graph Manager, [Online], http://www.cs.wisc.edu/condor/dagman/
Eerola, P., Ekelof, T., Ellert, M., Hansen, J.R., Hellman, S., Konstantinov, A., Kónya, B., Myklebust, T., Nielsen, J.L., Ould-Saada, F., Smirnova, O., Wäänänen, A.: Atlas Data- Challenge 1 on NorduGrid, ECONF C0303241:MOCT011 (2003)
Elmasri, R., Navathe, S.B.: Enhanced Entity-Relationship and UML Modeling. In: Fundamentals of Database Systems, 4th edn., pp. 85–121. Addison-Wesley, Reading (2004)
Flodin, S., Hansson, M., Josifovski, V., Katchaounov, T., Risch, T., Sköld, M.: Amos II Release 6 User’s Manual, [Online] (2004), http://user.it.uu.se/~udbl/amos/doc/amos_users_guide.html
Foster, I., Kesselman, C.: Globus: A Toolkit-Based Grid Architecture. In: The Grid: Blueprint for a New Computing Infrastructure, pp. 259–278. Morgan Kaufmann, San Francisco (1999)
Fahl, G., Risch, T.: Query Processing over Object Views of Relational Data. The VLDB Journal 6(4), 261–281 (1997), Available at http://www.dis.uu.se/~udbl/publ/vldbj97.pdf
Frey, J., Tannenbaum, T., Foster, I., Livny, M., Tuecke, S.: Condor-G: A Computation Management Agent for Multi-Institutional Grids. In: Proceedings of the Tenth IEEE Symposium on High Performance Distributed Computing (HPDC), San Francisco, California, pp. 7–9 (2001)
Globus I/O API, [Online], http://www-unix.globus.org/api/c-globus-3.2/globus_io/html/
Gounaris, A., Paton, N.W., Fernances, A.A.A., Sakellariou, R.: Adaptive Query Processing: A Survey. In: Proceedings of the 19th British National Conference on Databases: Advances in Databases (2002)
Garcia-Molina, H., Papakonstantinou, Y., Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J., Vassalos, V., Widom, J.: The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems (JIIS) 8(2), 117–132 (1997)
Garcia-Molina, H., Salem, K.: Main Memory Database Systems: An Overview. IEEE Transactions on Knowledge and Data Engineering 4(6), 509–516 (1992)
Haas, L., Kossmann, D., Wimmers, E.L., Yang, J.: Optimizing Queries across Diverse Data Sources. In: 23rd Intl. Conf. on Very Large Databases (VLDB 1997), pp. 276–285 (1997)
Josifovski, V., Risch, T.: Query Decomposition for a Distributed Object-Oriented Mediator System. Distributed and Parallel Databases, J. Kluwer, 307–336 (2002), Available at http://user.it.uu.se/~torer/publ/dpdb.pdf
Karlsson, J.S., Lal, A., Leung, C., Pham, T.: IBM DB2 Everyplace: A Small Footprint Relational Database System. In: 17th International Conference on Data Engineering (2001)
Konstantinov, A.: The NorduGrid Grid Manager and GridFTP Server, Description and Administrator’s Manual, [Online] (2004), http://www.nordugrid.org/papers.html
Kónya, B.: The NorduGrid Information System, [Online] (2002), http://www.nordugrid.org/documents/ng-infosys.pdf
The Large Hadron Collider, [Online], http://lhc-new-homepage.web.cern.ch/lhc-new-homepage/
Liu, L., Pu, C.: An Adaptive Object-Oriented Approach to Integration and Access of Heterogeneous Information Sources. Distributed and Parallel Databases 5(2), 167–205 (1997)
Lin, H., Risch, T., Katchanounov, T.: Adaptive data mediation over XML data. In: special issue on Web Information Systems Applications. Journal of Applied System Studies (JASS), Cambridge International Science Publishing, 3(2) (2002), Available at http://user.it.uu.se/~torer/publ/jass01.pdf
The Message Passing Interface (MPI) Standard, [Online], http://www-unix.mcs.anl.gov/mpi/
The NorduGrid User Guide, [Online] (2004), http://www.nordugrid.org/documents/userguide.pdf
NorduGrid middleware, the Advance Resource Connector, [Online], http://www.nordugrid.org/middleware/
ORACLE Inc.: Oracle8i Lite: The Internet Platform for Mobile Computing
Özsu, M.T., Valduriez, P.: Principles of Distributed Database Systems, 2nd edn. Prentice Hall, Englewood Cliffs (1999)
Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J., Widom, J.: Querying Semistructured Heterogeneous Information. In: Ling, T.-W., Vieille, L., Mendelzon, A.O. (eds.) DOOD 1995. LNCS, vol. 1013, pp. 319–344. Springer, Heidelberg (1995)
Risch, T., Josifovski, V.: Distributed Data Integration by Object-Oriented Mediator Servers. Concurrency and Computation: Practice and Experience J. 13(11), John Wiley & Sons (September 2001), Available at http://user.it.uu.se/~udbl/publ/ddiooms.pdf
ROOT: An Object-Oriented Data Analysis Framework – User’s Guide, [Online] (2004), http://root.cern.ch/root/doc/RootDoc.html
Swegrid, [Online], www.swegrid.se
SYBASE Inc.: SQL Anywhere Studio, http://www.sybase.com/mobile/
Smirnova, O.: Extended Resource Specification Language, [Online] (2003), http://www.nordugrid.org/documents/xrsl.pdf
Stonebraker, M., Brown, P.: Object-Relational DBMSs: Tracking the Next Great Wave, 2nd edn. Morgan Kaufmann Publishers, San Francisco (1999)
Smirnova, O., Eerola, P., Ekelöf, T., Ellert, M., Hansen, J.R., Konstantinov, A., Kónya, B., Nielsen, J.L., Ould-Saada, F., Wäänänen, A.: The NorduGrid Architecture and Middleware for Scientific Applications. In: International Conference on Computational Science, pp. 264–273 (2003)
Smith, J., Gounaris, A., Watson, P., Paton, N.W., Fernandes, A.A.A., Sakellariou, R.: Distributed Query Processing on the Grid. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 279–290. Springer, Heidelberg (2002)
Tomasic, L., Raschid, P.: Valduriez: Scaling Access to Heterogeneous Data Sources with DISCO. IEEE Transactions on Knowledge and Date Engineering 10(5), 808–823 (1998)
Wiederhold, G.: Mediators in the architecture of future information systems. IEEE Computer 25(3), 38–49 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fomkin, R., Risch, T. (2004). Managing Long Running Queries in Grid Environment. In: Meersman, R., Tari, Z., Corsaro, A. (eds) On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops. OTM 2004. Lecture Notes in Computer Science, vol 3292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30470-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-30470-8_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23664-1
Online ISBN: 978-3-540-30470-8
eBook Packages: Springer Book Archive