Abstract
Scripting languages such as Python and Bash are appreciated for solving simple, everyday tasks in bioinformatics. A more recent, object-oriented command shell and scripting language, PowerShell, has many attractive features: an object-oriented interactive command line, fluent navigation and manipulation of XML files, ability to explore and consume Web services from the command line, consistent syntax and grammar, rich regular expressions, and advanced output formatting. The key difference between classical command shells and scripting languages, such as bash, and object-oriented ones, such as PowerShell, is that in the latter the result of a command is a structured object with inherited properties and methods rather than a simple stream of characters. Conveniently, PowerShell is included in all new releases of Microsoft Windows and is available for Linux and macOS, making any data processing script portable. In this chapter we demonstrate how PowerShell in particular allows easy interaction with mass spectrometry data in XML formats, connection to Web services for tools such as BLAST, and presentation of results as formatted text or graphics. These features make PowerShell much more than “yet another scripting language.”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Stajich JE (2007) An introduction to BioPerl. Methods Mol Biol 406:535–548
Cock PJ et al (2009) Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25(11):1422–1423
Reimers M, Carey VJ (2006) Bioconductor: an open source framework for bioinformatics and computational biology. Methods Enzymol 411:119–134
Schmitt T et al (2011) Letter to the editor: SeqXML and OrthoXML: standards for sequence and orthology information. Brief Bioinform 12(5):485–488
NeXML - phylogenetic data as xml (2012). http://www.nexml.org
Han MV, Zmasek CM (2009) phyloXML: XML for evolutionary biology and comparative genomics. BMC Bioinformatics 10:356
Spellman PT et al (2002) Design and implementation of microarray gene expression markup language (MAGE-ML). Genome Biol 3(9):RESEARCH0046
Picardi E et al (2011) ExpEdit: a webserver to explore human RNA editing in RNA-Seq experiments. Bioinformatics 27(9):1311–1312
SRA Format (2012). http://www.ebi.ac.uk/ena/about/sra_format
Pedrioli PG et al (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol 22(11):1459–1466
Deutsch E (2008) mzML: a single, unifying data format for mass spectrometer output. Proteomics 8(14):2776–2777
Martens L et al (2011) mzML—a community standard for mass spectrometry data. Mol Cell Proteomics 10(1):R110.000133
Keller A et al (2005) A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol 1:2005.0017
Eisenacher M (2011) mzIdentML: an open community-built standard format for the results of proteomics spectrum identification algorithms. Methods Mol Biol 696:161–177
Microsoft Powershell Documentation (2018). https://docs.microsoft.com/en-us/powershell/
GitHub Powershell Project (2018). https://github.com/PowerShell/PowerShell
Fourment M, Gillings MR (2008) A comparison of common programming languages used in bioinformatics. BMC Bioinformatics 9:82
Dudley JT, Butte AJ (2009) A quick guide for developing effective bioinformatics programming skills. PLoS Comput Biol 5(12):e1000589
Hay RT (2005) SUMO: a history of modification. Mol Cell 18(1):1–12
Aasland R et al (2002) Normalization of nomenclature for peptide motifs as ligands of modular protein domains. FEBS Lett 513(1):141–144
MacDonald R (2011) Charting with PowerShell. http://blogs.technet.com/b/richard_macdonald/archive/2009/04/28/3231887.aspx
Lam H et al (2007) Development and validation of a spectral library searching method for peptide identification from MS/MS. Proteomics 7(5):655–667
Camacho C, Madden T (2011) SOAP-based BLAST Web Service. http://www.ncbi.nlm.nih.gov/books/NBK55699/
Oinn T et al (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17):3045–3054
Altschul SF et al (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410
NCBI BLAST (SOAP) (2011). http://www.ebi.ac.uk/Tools/webservices/services/sss/ncbi_blast_soap
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Mohammed, Y., Palmblad, M. (2020). Using the Object-Oriented PowerShell for Simple Proteomics Data Analysis. In: Matthiesen, R. (eds) Mass Spectrometry Data Analysis in Proteomics. Methods in Molecular Biology, vol 2051. Humana, New York, NY. https://doi.org/10.1007/978-1-4939-9744-2_17
Download citation
DOI: https://doi.org/10.1007/978-1-4939-9744-2_17
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-4939-9743-5
Online ISBN: 978-1-4939-9744-2
eBook Packages: Springer Protocols