Active File Systems for Data Mining and Multimedia
Data mining and multimedia applications require huge amounts of storage. These applications are also compute-intensive. Active disks make use of the computational power available in the disk to reduce storage traffic. Many of the file system proposals for active disks work at the block level. In this paper we argue for the necessity of filtering at application level. We propose two file systems for active disks: active file system (ACFS) which binds files and filters at the file system level and active network file system (ANFS) which extends ACFS over networks. These file systems preserve the familiar Unix file system semantics to a large extent. We present an implementation of the file systems which makes minimal changes to the existing file system code in Linux.
KeywordsFile System System Call Block Level Read Request Input Queue
Unable to display preview. Download preview PDF.
- Erik Riedel, Garth A. Gibson, and Christos Faloutsos. Active storage for largescale data mining and multimedia. In Proc. 24th Int. Conf. Very Large Data Bases, pages 62–73, 1998. 398, 400Google Scholar
- Alan F Benner. Fiber channel for SANs. McGraw-Hill, 2001. 398Google Scholar
- R Hernandez, C K Chal, Geo. Cole, and K Carmichael. NAS and iSCSI Solutions. IBM Redbook, Feb 2002. 398Google Scholar
- G A Gibson, David F Nagle, K Amiri, F W Chang, E M Feinberg, H Gobio., C Lee, B Ozceri, Erik Reidel, DR ochberg, and J Zelenka. File server scaling with network attached disk. In ACM International Conference on Measurement and Modelling of Computer Systems, June 1997. 398, 399Google Scholar
- X Ma and A L Narasimha Reddy. MVSS: Multi View Storage System. In Proc. of ICDCS, Apr 2001. 399, 400Google Scholar
- H Lim, V Kapoor, C Wighe, and David Du. Active disk file system: A distributed, scalable file system. In Proc. of the Eighteenth IEEE Symposium on Mass Storage Systems, pages 101–116, Apr 2001. 399Google Scholar
- A Acharya, M Uysal, and J Saltz. Active disks programming model, algorithms and evaluation. In International Conference on Architectural Support for Programming Languages and Operating Systems, Oct 1998. 399, 400Google Scholar
- S Berchtold, C Bohm, and H P Kriegel. Improving the query performance of high-dimensional index structures by bulk load operations. In Proc. of the Int. Conf. on Extending Database Technology, pages 216–230, Mar 1998. 399Google Scholar
- S Berchtold, C Bohm, B Braunmuller, DA Keim, and H P Kriegel. Fast parallel similarity search in multimedia databases. In Proc ACM SIGMOD Int. Conf. on Management of Data, pages 1–12, 1997. 399Google Scholar
- Rakesh Agrawal and Ramakrishnan Srikant. Fast algorithms for mining association rules. In Proc. 20th Int. Conf. Very Large Data Bases, 1994. 399Google Scholar
- Hyeran Lim, Vikram Kapoor, Chirag Wighe, and David H.-C Du. Active disk file system: A distributed scalable file system. In IEEE Symposium on Mass Storage Systems, 2001. 400Google Scholar
- Uresh Vahalia. Unix internals: The new frontiers. Prentice-Hall, 1996. 403Google Scholar