About these proceedings
Understanding the mechanisms involved in life (e. g. , discovering the biological functionofasetofproteins,inferringtheevolutionofasetofspecies)isbecoming increasinglydependent onprogressmade inmathematics,computer science,and molecular engineering. For the past 30 years, new high-throughput technologies have been developed generating large amounts of data, distributed across many data sources on the Web, with a high degree of semantic heterogeneity and di?erentlevelsofquality. However,onesuchdatasetisnot,byitself,su?cientfor scienti?c discovery. Instead, it must be combined with other data and processed by bioinformatics tools for patterns, similarities, and unusual occurrences to be observed. Both data integration and data mining are thus of paramount importance in life science. DILS 2007 was the fourth in a workshop series that aims at fostering d- cussion, exchange, and innovation in research and development in the areas of data integration and data management for the life sciences. Each previous DILS workshop attracted around 100 researchers from all over the world. This year, the number of submitted papers again increased. The Program Committee - lected 19 papers out of 52 full submissions. The DILS 2007 papers cover a wide spectrum of theoretical and practical issues including scienti?c work?ows, - notation in data integration, mapping and matching techniques, and modeling of life science data. Among the papers, we distinguished 13 papers presenting research on new models, methods, or algorithms and 6 papers presenting imp- mentation of systems or experience with systems in practice. In addition to the presented papers, DILS 2007 featured two keynote talks by Kenneth H. Buetow, National Cancer Institute, and Junhyong Kim, University of Pennsylvania.
Annotation In silico architecture bioinformatics biology biomedical data analysis clustering data analysis databases genome indexing instance-based matching interoperability life sciences medical