Skip to main content

Using the Object-Oriented PowerShell for Simple Proteomics Data Analysis

  • Protocol
  • First Online:
Mass Spectrometry Data Analysis in Proteomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2051))

Abstract

Scripting languages such as Python and Bash are appreciated for solving simple, everyday tasks in bioinformatics. A more recent, object-oriented command shell and scripting language, PowerShell, has many attractive features: an object-oriented interactive command line, fluent navigation and manipulation of XML files, ability to explore and consume Web services from the command line, consistent syntax and grammar, rich regular expressions, and advanced output formatting. The key difference between classical command shells and scripting languages, such as bash, and object-oriented ones, such as PowerShell, is that in the latter the result of a command is a structured object with inherited properties and methods rather than a simple stream of characters. Conveniently, PowerShell is included in all new releases of Microsoft Windows and is available for Linux and macOS, making any data processing script portable. In this chapter we demonstrate how PowerShell in particular allows easy interaction with mass spectrometry data in XML formats, connection to Web services for tools such as BLAST, and presentation of results as formatted text or graphics. These features make PowerShell much more than “yet another scripting language.”

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Stajich JE (2007) An introduction to BioPerl. Methods Mol Biol 406:535–548

    CAS  PubMed  Google Scholar 

  2. Cock PJ et al (2009) Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25(11):1422–1423

    Article  CAS  Google Scholar 

  3. Reimers M, Carey VJ (2006) Bioconductor: an open source framework for bioinformatics and computational biology. Methods Enzymol 411:119–134

    Article  CAS  Google Scholar 

  4. Schmitt T et al (2011) Letter to the editor: SeqXML and OrthoXML: standards for sequence and orthology information. Brief Bioinform 12(5):485–488

    Article  Google Scholar 

  5. NeXML - phylogenetic data as xml (2012). http://www.nexml.org

  6. Han MV, Zmasek CM (2009) phyloXML: XML for evolutionary biology and comparative genomics. BMC Bioinformatics 10:356

    Article  Google Scholar 

  7. Spellman PT et al (2002) Design and implementation of microarray gene expression markup language (MAGE-ML). Genome Biol 3(9):RESEARCH0046

    Article  Google Scholar 

  8. Picardi E et al (2011) ExpEdit: a webserver to explore human RNA editing in RNA-Seq experiments. Bioinformatics 27(9):1311–1312

    Article  CAS  Google Scholar 

  9. SRA Format (2012). http://www.ebi.ac.uk/ena/about/sra_format

  10. Pedrioli PG et al (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol 22(11):1459–1466

    Article  CAS  Google Scholar 

  11. Deutsch E (2008) mzML: a single, unifying data format for mass spectrometer output. Proteomics 8(14):2776–2777

    Article  CAS  Google Scholar 

  12. Martens L et al (2011) mzML—a community standard for mass spectrometry data. Mol Cell Proteomics 10(1):R110.000133

    Article  Google Scholar 

  13. Keller A et al (2005) A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol 1:2005.0017

    Article  Google Scholar 

  14. Eisenacher M (2011) mzIdentML: an open community-built standard format for the results of proteomics spectrum identification algorithms. Methods Mol Biol 696:161–177

    Article  CAS  Google Scholar 

  15. Microsoft Powershell Documentation (2018). https://docs.microsoft.com/en-us/powershell/

  16. GitHub Powershell Project (2018). https://github.com/PowerShell/PowerShell

  17. Fourment M, Gillings MR (2008) A comparison of common programming languages used in bioinformatics. BMC Bioinformatics 9:82

    Article  Google Scholar 

  18. Dudley JT, Butte AJ (2009) A quick guide for developing effective bioinformatics programming skills. PLoS Comput Biol 5(12):e1000589

    Article  Google Scholar 

  19. Hay RT (2005) SUMO: a history of modification. Mol Cell 18(1):1–12

    Article  CAS  Google Scholar 

  20. Aasland R et al (2002) Normalization of nomenclature for peptide motifs as ligands of modular protein domains. FEBS Lett 513(1):141–144

    Article  CAS  Google Scholar 

  21. MacDonald R (2011) Charting with PowerShell. http://blogs.technet.com/b/richard_macdonald/archive/2009/04/28/3231887.aspx

    Google Scholar 

  22. Lam H et al (2007) Development and validation of a spectral library searching method for peptide identification from MS/MS. Proteomics 7(5):655–667

    Article  CAS  Google Scholar 

  23. Camacho C, Madden T (2011) SOAP-based BLAST Web Service. http://www.ncbi.nlm.nih.gov/books/NBK55699/

    Google Scholar 

  24. Oinn T et al (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17):3045–3054

    Article  CAS  Google Scholar 

  25. Altschul SF et al (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410

    Article  CAS  Google Scholar 

  26. NCBI BLAST (SOAP) (2011). http://www.ebi.ac.uk/Tools/webservices/services/sss/ncbi_blast_soap

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yassene Mohammed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Mohammed, Y., Palmblad, M. (2020). Using the Object-Oriented PowerShell for Simple Proteomics Data Analysis. In: Matthiesen, R. (eds) Mass Spectrometry Data Analysis in Proteomics. Methods in Molecular Biology, vol 2051. Humana, New York, NY. https://doi.org/10.1007/978-1-4939-9744-2_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-9744-2_17

  • Published:

  • Publisher Name: Humana, New York, NY

  • Print ISBN: 978-1-4939-9743-5

  • Online ISBN: 978-1-4939-9744-2

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics