L-SME: A System for Mining Loosely Structured Motifs

Fassetti, Fabio; Greco, Gianluigi; Terracina, Giorgio

doi:10.1007/978-3-642-23808-6_42

Fabio Fassetti²³,
Gianluigi Greco²⁴ &
Giorgio Terracina²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6913))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

5504 Accesses

Abstract

We present L-SME, a system to efficiently identify loosely structured motifs in genome-wide applications. L-SME is innovative in three aspects. Firstly, it handles wider classes of motifs than earlier motif discovery systems, by supporting boxes swaps and skips in the motifs structure as well as various kinds of similarity functions. Secondly, in addition to the standard exact search, it supports search via randomization in which guarantees on the quality of the results can be given a-priori based on user-definable resource (time and space) constraints. Finally, L-SME comes equipped with an intuitive graphical interface through which the structure for the motifs of interest can be defined, the discovery method can be selected, and results can be visualized. The tool is flexible and scalable, by allowing genome-wide searches for very complex motifs and is freely accessible at http://siloe.deis.unical.it/l-sme. A detailed description of the algorithms underlying L-SME is available in [1].

Download to read the full chapter text

Chapter PDF

SimpLiSMS: A Simple, Lightweight and Fast Approach for Structured Motifs Searching

Motif-based analysis of large nucleotide data sets using MEME-ChIP

Article 22 May 2014

MoTeX-II: structured MoTif eXtraction from large-scale datasets

Article Open access 08 July 2014

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Fassetti, F., Greco, G., Terracina, G.: Mining loosely structured motifs from biological data. IEEE Transaction on Knowledge and Data Engineering 20(11), 1472–1489 (2008)
Article Google Scholar
Hughes, J.D., Estep, P.W., Tavazoie, S., Church, G.M.: Computational identification of cis-regulatory elements associated with groups of functionally related genes in saccharomyces cerevisiae. Journal of Molecular Biology 296(5), 1205–1214 (2000)
Article Google Scholar
Marsan, L., Sagot, M.-F.: Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification. Journal of Computational Biology 7(3-4), 345–362 (2000)
Article Google Scholar
Osanai, M., Takahashi, H., Kojima, K.K., Hamada, M., Fujiwara, H.: Essential motifs in the 3’ untranslated region required for retrotransposition and the precise start of reverse transcription in non-long-terminal-repeat retrotransposon SART1. Mol. Cell. Biol. 24(19), 7902–7913 (2004)
Article Google Scholar
Sandve, G.K., Drabls, F.: A survey of motif discovery methods in an integrated framework. Biology Direct 1(11), 1–16 (2006)
Google Scholar
Sinha, S., Tompa, M.: YMF: A program for discovery of novel transcription factor binding sites by statistical overrepresentation. Nucleic Acid Research 31(13), 3586–3588 (2003)
Article Google Scholar
Tu, Z., Li, S., Mao, C.: The changing tails of a novel short interspersed element in aedes aegypti: genomic evidence for slippage retrotransposition and the relationship between 3’ tandem repeats and the poly(da) tail. Genetics 168(4), 2037–2047 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

ICAR-CNR, Italy
Fabio Fassetti
Dep. of Mathematics, Via P. Bucci, 87036, Rende, CS, Italy
Gianluigi Greco & Giorgio Terracina

Authors

Fabio Fassetti
View author publications
You can also search for this author in PubMed Google Scholar
Gianluigi Greco
View author publications
You can also search for this author in PubMed Google Scholar
Giorgio Terracina
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics and Telecommunications, University of Athens, Panepistimioupolis, Ilisia, 15784, Athens, Greece
Dimitrios Gunopulos
Google Switzerland GmbH, Brandschenkestrasse 110, 8002, Zurich, Switzerland
Thomas Hofmann
Department of Computer Science, University of Bari “Aldo Moro”, via Orabona 4, 70125, Bari, Italy
Donato Malerba
Deptartment of Informatics, Athens University of Economics and Business, Patision 76, 10434, Athens, Greece
Michalis Vazirgiannis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fassetti, F., Greco, G., Terracina, G. (2011). L-SME: A System for Mining Loosely Structured Motifs. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6913. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23808-6_42

Download citation

DOI: https://doi.org/10.1007/978-3-642-23808-6_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23807-9
Online ISBN: 978-3-642-23808-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

L-SME: A System for Mining Loosely Structured Motifs

Abstract

Chapter PDF

Similar content being viewed by others

SimpLiSMS: A Simple, Lightweight and Fast Approach for Structured Motifs Searching

Motif-based analysis of large nucleotide data sets using MEME-ChIP

MoTeX-II: structured MoTif eXtraction from large-scale datasets

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

L-SME: A System for Mining Loosely Structured Motifs

Abstract

Chapter PDF

Similar content being viewed by others

SimpLiSMS: A Simple, Lightweight and Fast Approach for Structured Motifs Searching

Motif-based analysis of large nucleotide data sets using MEME-ChIP

MoTeX-II: structured MoTif eXtraction from large-scale datasets

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation