Implementing LOD Surfer as a Search System for the Annotation of Multiple Protein Sequence Alignment

Yamaguchi, Atsuko; Toh, Hiroyuki

doi:10.1007/978-3-030-04284-4_29

Atsuko Yamaguchi¹⁹ &
Hiroyuki Toh²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11341))

Included in the following conference series:

Joint International Semantic Technology Conference

821 Accesses
1 Citations

Abstract

Many life science databases have been provided as Linked Open Data (LOD). To promote the utilization of these databases, we had developed a method that can be referred to as LOD Surfer, that employed federated query search along a path of class–class relationships. In this study, we developed a specified version of the LOD Surfer for the annotation of multiple protein sequence alignment. The system comprised a web application programming interface (API) and a client system for the API. The web API provides a list of classes, and a list of paths between the classes that are specified by a user. The client presents the list of classes and the list of paths obtained from the API and assists a user in selecting classes and paths to acquire the required annotation of proteins. Additionally, the client system generates SPARQL queries to execute a federated query search for a selected path. During the development of the system, we can observe that (1) the client system should display some instances with human readable information because class selection is not an easy task for biological researchers, and (2) it is preferable that the client system stores paths that are selected by a user for reuse by other users because path selection may be time consuming at times and because the selected paths may be valuable for other researchers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Heath, T., Bizer, C.: Linked Data: Evolving the Web into a Global Data Space. Synthesis Lectures on the Semantic Web: Theory and Technology, 1st edn., vol. 1, no. 1, pp. 1–136. Morgan & Claypool (2011)
Google Scholar
Heim, P., Hellmann, S., Lehmann, J., Lohmann, S., Stegemann, T.: RelFinder: revealing relationships in RDF knowledge bases. In: Chua, T.-S., Kompatsiaris, Y., Mérialdo, B., Haas, W., Thallinger, G., Bailer, W. (eds.) SAMT 2009. LNCS, vol. 5887, pp. 182–187. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-10543-2_21
Chapter Google Scholar
Yamaguchi, A., Kozaki, K., Yamamoto, Y., Masuya, H., Kobayashi, N.: Semantic graph analysis for federated LOD surfing in life sciences. In: Wang, Z., Turhan, A.-Y., Wang, K., Zhang, X. (eds.) JIST 2017. LNCS, vol. 10675, pp. 268–276. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-70682-5_18
Chapter Google Scholar
Larkin, M.A., et al.: Clustal W and Clustal X version 2.0. Bioinformatics 23(21), 2947–2948 (2007)
Article Google Scholar
Nakamura, T., Yamada, K.D., Tomii, K., Katoh, K.: Parallelization of MAFFT for large-scale multiple sequence alignments. Bioinformatics 34(14), 2490–2492 (2018)
Article Google Scholar
Yamaguchi, A., Kozaki, K., Lenz, K., Yamamoto, Y., Masuya, H., Kobayashi, N.: Semantic data acquisition by traversing class–class relationships over linked open data. In: Li, Y.-F., et al. (eds.) JIST 2016. LNCS, vol. 10055, pp. 136–151. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-50112-3_11
Chapter Google Scholar
Yamamoto, Y., Yamaguchi, A., Splendiani, A.: YummyData: providing high-quality open life science data. Database, 2018 (2018). https://doi.org/10.1093/database/bay022

Download references

Acknowledgments

The authors would thank members of LOD Surfer project including Kouji Kozaki, Osaka University, Norio Kobayashi and Hiroshi Masuya, RIKEN, and Yasunori Yamamoto, DBCLS, for valuable discussion. This work was supported by JSPS KAKENHI grant numbers 17K00434 and by the National Bioscience Database Center (NBDC) of the Japan Science and Technology Agency (JST).

Author information

Authors and Affiliations

Database Center for Life Science (DBCLS), Research Organization of Information and Systems (ROIS), 178-4-4 Wakashiba, Kashiwa, Chiba, 277-0871, Japan
Atsuko Yamaguchi
Department of Biomedical Chemistry, School of Science and Technology, Kwansei Gakuin University, 2-1 Gakuen, Sanda, Hyogo, 669-1337, Japan
Hiroyuki Toh

Authors

Atsuko Yamaguchi
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Toh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Atsuko Yamaguchi .

Editor information

Editors and Affiliations

National Institute of Informatics, Tokyo, Japan
Ryutaro Ichise
Accenture Labs, Dublin, Ireland
Freddy Lecue
Japan Science and Technology Agency, Tokyo, Japan
Takahiro Kawamura
Peking University, Beijing, China
Dongyan Zhao
Imperial College London, London, UK
Stephen Muggleton
Osaka University, Osaka, Japan
Kouji Kozaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yamaguchi, A., Toh, H. (2018). Implementing LOD Surfer as a Search System for the Annotation of Multiple Protein Sequence Alignment. In: Ichise, R., Lecue, F., Kawamura, T., Zhao, D., Muggleton, S., Kozaki, K. (eds) Semantic Technology. JIST 2018. Lecture Notes in Computer Science(), vol 11341. Springer, Cham. https://doi.org/10.1007/978-3-030-04284-4_29

Download citation

DOI: https://doi.org/10.1007/978-3-030-04284-4_29
Published: 14 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04283-7
Online ISBN: 978-3-030-04284-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics