Skip to main content

SeqinR 1.0-2: A Contributed Package to the R Project for Statistical Computing Devoted to Biological Sequences Retrieval and Analysis

  • Chapter

Part of the book series: Biological and Medical Physics, Biomedical Engineering ((BIOMEDICAL))

The seqinR package for the R environment is a library of utilities to retrieve and analyze biological sequences. It provides an interface between: (i) the R language and environment for statistical computing and graphics, and (ii) the ACNUC sequence retrieval system for nucleotide and protein sequence databases such as GenBank, EMBL, SWISS-PROT. ACNUC is very efficient in providing direct access to subsequences of biological interest (e.g., protein coding regions, tRNA, or rRNA coding regions) present in GenBank and in EMBL. Thanks to a simple query language, it is then easy under R to select sequences of interest and then use all the power of the R environment to analyze them. The ACNUC databases can be locally installed but they are more conveniently accessed through a web server to take advantage of centralized daily updates. The aim of this chapter is to provide a handout on basic sequence analyses under seqinR with a special focus on multivariate methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Ihaka, R.Gentleman, J. Comp. Graph. Stat. 3, 299 (1996)

    Google Scholar 

  2. R Development Core Team, R: A language and environment for statistical computing (ISBN 3-900051-00-3, 2004) http://www.R-project.org

  3. F. Leisch, Proceedings in Computational Statistics, 575 (2002) (ISBN 3-7908-1517-9)

    Google Scholar 

  4. K. Hornik, The R FAQ (ISBN 3-900051-08-9, 2005) http://CRAN.R-project.org/doc/FAQ/

  5. J. Keogh, Australian Patent Office application number AU 2001100012 A4 (2001). www.ipmenu.com/archive/AUI_2001100012.eps

  6. J.R. Lobry, N. Sueoka, Genome Biology3, research0058.1(2002) http://genomebiology.com/2002/3/10/research/0058

  7. J. Buckheit, D.L. Donoho, in Wavelets and Statistics, ed. by A. Antoniadis (Springer, Berlin, New York, 1995)

    Google Scholar 

  8. D. Charif, J. Thioulouse, J.R. Lobry, G. Perrière, Bioinformatics 21, 545 (2005); http://pbil.univ-lyon1.fr/members/lobry/repro/bioinfo04/

  9. R. Rudner, J.D. Karkas, E. Chargaff, Proc. Natl. Acad. Sci. USA 63, 152 (1969)

    Article  ADS  Google Scholar 

  10. J.R. Lobry, Lecture Notes Comput. Sci. 3039, 679 (2004). http://pbil. univ-lyon1.fr/members/lobry/repro/lncs04/

  11. A.C. Frank, J.R. Lobry, Bioinformatics 16, 560 (2000)

    Article  Google Scholar 

  12. P. Mackiewicz, J. Zakrzewska-Czerwinska, A. Zawilak, M.R. Dudek, S. Cebrat, Nucleic Acids Res. 32, 3781 (2004)

    Article  Google Scholar 

  13. P. Legendre, Y. Desdevises, E. Bazin, Syst. Biol. 51, 217 (2002)

    Article  Google Scholar 

  14. N. Saitou, M. Nei, Mol. Biol. Evol. 4, 406 (1984)

    Google Scholar 

  15. T.H. Jukes, C.R. Cantor, in Mammalian Protein Metabolism, ed. by H.N. Munro (Academic, New York, 1969) pp. 21-132

    Google Scholar 

  16. M. Kimura, J. Mol. Evol. 16, 111 (1980)

    Article  Google Scholar 

  17. G. Perrière, J. Thioulouse, Nucleic Acids Res. 30, 4548 (2002)

    Article  Google Scholar 

  18. . C. Gautier, Ph.D. Thesis (1987), Université Claude Bernard - Lyon I

    Google Scholar 

  19. . J.R. Lobry, C. Gautier, Nucleic Acids Res. 22, 3174 (1994). http://pbil.univ-lyon1.fr/members/lobry/repro/nar94/

  20. . J.R. Lobry, D. Chessel, J. Appl. Genet.44, 235(2003). http://jay.au.poznan.pl/html1/JAG/pdfy/lobry.eps

  21. W.-H. Li, J. Mol. Evol. 36, 96 (1993)

    Article  Google Scholar 

  22. L.D. Hurst, Trends Genet. 18, 486 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Charif, D., Lobry, J.R. (2007). SeqinR 1.0-2: A Contributed Package to the R Project for Statistical Computing Devoted to Biological Sequences Retrieval and Analysis. In: Bastolla, U., Porto, M., Roman, H.E., Vendruscolo, M. (eds) Structural Approaches to Sequence Evolution. Biological and Medical Physics, Biomedical Engineering. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-35306-5_10

Download citation

Publish with us

Policies and ethics