Training Bidirectional Recurrent Neural Network Architectures with the Scaled Conjugate Gradient Algorithm

Agathocleous, Michalis; Christodoulou, Chris; Promponas, Vasilis; Kountouris, Petros; Vassiliades, Vassilis

doi:10.1007/978-3-319-44778-0_15

Training Bidirectional Recurrent Neural Network Architectures with the Scaled Conjugate Gradient Algorithm

Michalis Agathocleous¹⁶,
Chris Christodoulou¹⁶,
Vasilis Promponas¹⁷,
Petros Kountouris¹⁸ &
…
Vassilis Vassiliades^16,19

Conference paper
First Online: 13 August 2016

2679 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9886))

Abstract

Predictions on sequential data, when both the upstream and downstream information is important, is a difficult and challenging task. The Bidirectional Recurrent Neural Network (BRNN) architecture has been designed to deal with this class of problems. In this paper, we present the development and implementation of the Scaled Conjugate Gradient (SCG) learning algorithm for BRNN architectures. The model has been tested on the Protein Secondary Structure Prediction (PSSP) and Transmembrane Protein Topology Prediction problems (TMPTP). Our method currently achieves preliminary results close to 73 % correct predictions for the PSSP problem and close to 79 % for the TMPTP problem, which are expected to increase with larger datasets, external rules, ensemble methods and filtering techniques. Importantly, the SCG algorithm is training the BRNN architecture approximately 3 times faster than the Backpropagation Through Time (BPTT) algorithm.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Schuster, M., Paliwal, K.K.: IEEE Trans. Signal Proces. 45, 2673–2681 (1997)
Article Google Scholar
Dietterich, T.G.: Machine learning for sequential data: a review. In: Caelli, T.M., Amin, A., Duin, R.P.W., Kamel, M.S., de Ridder, D. (eds.) SSPR&SPR 2002. LNCS, vol. 2396, pp. 15–30. Springer, Heidelberg (2002)
Chapter Google Scholar
Elman, J.L.: Cogn. Sci. 14, 179–211 (1990)
Article Google Scholar
Werbos, P.J.: Proc. IEEE 78(10), 1550–1560 (1990)
Article Google Scholar
Frasconi, P., Gori, M., Sperduti, A.: IEEE Trans. Neural Netw. 9, 768–786 (1998)
Article Google Scholar
Møller, M.F.: Neural Netw. 6, 525–533 (1993)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Neural Comput. 9, 1735–1780 (1997)
Article Google Scholar
Baldi, P., Brunak, S., Frasconi, P., Soda, G., Pollastri, G.: Bioinformatics 15, 937–946 (1999)
Article Google Scholar
Kountouris, P., Agathocleous, M., Promponas, V., Christodoulou, G., Hadjicostas, S., Vassiliades, V., Christodoulou, C.: IEEE ACM Trans. Comput. Biol. Bioinform. 9, 731–739 (2012)
Article Google Scholar
Agathocleous, M., Christodoulou, G., Promponas, V., Christodoulou, C., Vassiliades, V., Antoniou, A.: Protein secondary structure prediction with Bidirectional recurrent neural nets: can weight updating for each residue enhance performance? In: Papadopoulos, H., Andreou, A.S., Bramer, M. (eds.) AIAI 2010. IFIP AICT, vol. 339, pp. 128–137. Springer, Heidelberg (2010)
Chapter Google Scholar
Nugent, T., Jones, D.T.: BMC Bioinf. 10, 159 (2009)
Article Google Scholar
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, A., Zhang, Z., Miller, W., Lipman, D.J.: Nucleic Acids Res. 25, 3389–3402 (1997)
Article Google Scholar
Cuff, J.A., Barton, G.J.: Proteins 34, 508–519 (1999)
Article Google Scholar
Richards, F., Kundrot, C.: Proteins 3, 71–84 (1988)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Cyprus, P.O. Box 20537, 1678, Nicosia, Cyprus
Michalis Agathocleous, Chris Christodoulou & Vassilis Vassiliades
Dept. of Biological Sciences, University of Cyprus, P.O. Box 20537, 1678, Nicosia, Cyprus
Vasilis Promponas
The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
Petros Kountouris
Inria, Nancy - Grand Est, France
Vassilis Vassiliades

Authors

Michalis Agathocleous
View author publications
You can also search for this author in PubMed Google Scholar
Chris Christodoulou
View author publications
You can also search for this author in PubMed Google Scholar
Vasilis Promponas
View author publications
You can also search for this author in PubMed Google Scholar
Petros Kountouris
View author publications
You can also search for this author in PubMed Google Scholar
Vassilis Vassiliades
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chris Christodoulou .

Editor information

Editors and Affiliations

University of Lausanne, Lausanne, Switzerland
Alessandro E.P. Villa
University of Lausanne, Lausanne, Switzerland
Paolo Masulli
Universitat Politécnica de Catalunya, Terrrassa, Spain
Antonio Javier Pons Rivero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agathocleous, M., Christodoulou, C., Promponas, V., Kountouris, P., Vassiliades, V. (2016). Training Bidirectional Recurrent Neural Network Architectures with the Scaled Conjugate Gradient Algorithm. In: Villa, A., Masulli, P., Pons Rivero, A. (eds) Artificial Neural Networks and Machine Learning – ICANN 2016. ICANN 2016. Lecture Notes in Computer Science(), vol 9886. Springer, Cham. https://doi.org/10.1007/978-3-319-44778-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-44778-0_15
Published: 13 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44777-3
Online ISBN: 978-3-319-44778-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics