Towards Multimodal Capture, Annotation and Semantic Retrieval from Performing Arts

Kannan, Rajkumar; Andres, Frederic; Ferri, Fernando; Grifoni, Patrizia

doi:10.1007/978-3-642-22726-4_10

Towards Multimodal Capture, Annotation and Semantic Retrieval from Performing Arts

Rajkumar Kannan⁶,
Frederic Andres⁷,
Fernando Ferri⁸ &
…
Patrizia Grifoni⁸

Conference paper

1827 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 193))

Abstract

A well-annotated dance media is an essential part of a nation’s identity, transcending cultural and language barriers. Many dance video archives suffer from tremendous problems concerning authoring and access, because of the multimodal nature of human communication and complex spatio-temporal relationships that exist between dancers. A multimodal dance document consists of video of dancers in space and time, their dance steps through gestures and emotions and accompanying song and music.This work presents the architecture of an annotation system capturing information directly through the use of sensors, comparing and interpreting them using a context and a user’s model in order to annotate, index and access multimodal documents.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ann Hutchinson, G.: Dance Notation: Process of recording movemen. Dance Books, London (1984)
Google Scholar
Chitra, D., Manthe, A., Nack, F., Rutledge, L., Sikora, T., Zettl, H.: Media Semantics: Who needs it and why? In: Proceedings of ACM Multimedia, pp. 580–583 (2002)
Google Scholar
Herbison, D., Evans: Dance, Video, Notation and Computers. Leonardo 21(1), 45–50 (1988)
Article Google Scholar
George, P.: Computers and Dance: A bibliography. Leonardo 23(1), 87–90 (1990)
Article Google Scholar
Calfert, T.W., Chapman, J.: Notation of movement with computer assistance. In: Proceedings of ACM Annual Conference, pp. 731–736 (1978)
Google Scholar
Hatol, J., Kumar, V.: Semantic representation and interaction of dance objects. In: Proceedings of LORNET Conference, Poster (2005)
Google Scholar
Hachimura, K.: Digital archiving of dancing. Review of the National Center for Digitization 8, 51–66 (2006)
Google Scholar
Hattori, M., Takamori, T.: The description of human movement in computer based on movement score. In: Proceedings of 41st SICE, pp. 2370–2371 (2002)
Google Scholar
Calaban: (2002), http://www.bham.ac.uk/calaban/frame.htm
Bimas, U., Simon, W., Peter, R.: NUNTIUS: A computer system for the interactive composition and analysis of music and dance. Leonardo 25(1), 59–68 (1992)
Article Google Scholar
Led & Linter: An X-Windows Editor / Interpreter for Labanotation (2006), http://wwwstaff.it.uts.edu.au/don/pubs/led.html
MacBenesh: Behesh notation editor for Apple Macintosh (2004), http://members.rogers.com/dancewrite/macbenesh/macbenesh.htm
Ilene, F.: Documentation Technology for the 21st Century. In: Proceedings of World Dance Academic Conference, pp. 137–142 (2000)
Google Scholar
Kalajdziski, S., Davcev, D.: Augmented reality system interface for dance analysis and presentation based on MPEG-7. In: Proceedings of IASTED Conference on Visualization, Imaging, and Image Processing, pp. 725–730 (2004)
Google Scholar
Forouzan, G., Pegge, V., Park, Y.C.: A multimedia information repository for cross cultural dance studies. Multimedia Tools and Applications 24, 89–103 (2004)
Article Google Scholar
Athanasios, C., Gkoritsas, Marios, C.A.: COSMOS-7: A video content modeling framework for MPEG-7. In: Proceedings of IEEE Multi Media Modeling, pp. 123–130 (2005)
Google Scholar
IBM VideoAnnEx (2002), http://www.alphaworks.ibm.com/tech/videoannex
Tra-Thusng, T., Roisin, C.: Multimedia modeling using MPEG-7 for authoring multimedia integration. In: Proceedings of ACM Multimedia Information Retrieval, pp. 171–178 (2003)
Google Scholar
Ryn, J., Sohn, J., Kin, M.: MPEG-7 metadata authoring tool. In: Proceedings of ACM Multimedia, pp. 267–270 (2002)
Google Scholar
Haoran, Y.I., Rajan, D., Liang-Tien, C.: Automatic generation of MPEG-7 complaint XML document for motion trajectory description in sports video. Multimedia Tools and Applications 26(2), 191–206 (2005)
Article Google Scholar
Rajkumar, K., Andres, F., Guetl, C.: DanVideo: A Mpeg7 Authoring and Retrieval System for Dance Videos. Multimedia Tools and Applications 46(2), 545–572 (2009)
Google Scholar
Devillers, L., Vidrascu, L., Lamel, L.: Challenges in real-life emotion annotation and machine learning based detection. Neural Networks 18, 407–422 (2005)
Article Google Scholar
Popescu-Belis, A.: Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions. In: Thiran, J.-P., Marques, F., Bourlard, H. (eds.) Multimodal Signal Processing for Human-Computer Interaction, pp. 183–203. Elsevier/ Academic Press (2009)
Google Scholar
Callejas, Z., Lòpez-Còzar, R.: Influence of contextual information in emotion annotation for spoken dialogue systems. Speech Communication (2008), doi: 10.1016/j.specom, 01.001
Google Scholar
Yu, C., Zhou, J., Riekki, J.: Expression and Analysis of Emotions: Survey and Experiment. In: Symposia and Workshops on Ubiquitous, Autonomic and Trusted Computing, UIC-ATC, pp. 428–433 (2009)
Google Scholar
Harada, I., Tadenuma, M., Nakai, T., Suzuki, R., Hikawa, N., Makino, M., Inoue, M.: An Interactive and Concerted Dance System?? Emotion Extraction and Support for Emotional Concert. In: Fifth International Conference on Information Visualisation (IV 2001), vol. iv, p. 0303 (2001)
Google Scholar
Glowinski, D., Camurri, A., Volpe, G., Dael, N., Scherer, K.: Technique for automatic emotion recognition by body gesture analysis. In: 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPRW, pp. 1–6 (2008)
Google Scholar
Grassi, M.: Developing HEO human emotions ontology. In: Fierrez, J., Ortega-Garcia, J., Esposito, A., Drygajlo, A., Faundez-Zanuy, M. (eds.) BioID MultiComm2009. LNCS, vol. 5707, pp. 244–251. Springer, Heidelberg (2009)
Chapter Google Scholar
Sorci, M., Antonini, G., Cruz, J., Robin, T., Bierlaire, M., and Thiran, J.: Modelling human perception of static facial expressions. Image Vision Comput. 28(5), 790–806 (2010), doi:http://dx.doi.org/ 10.1016/j.imavis. 2009.10.003
Google Scholar
Oviatt, S., Choen, P.: Perceptual user interfaces: multimodal interfaces that process what comes naturally. Comm. of ACM 43, 45–53 (2000)
Google Scholar
D’Ulizia, A., Ferri, F., Grifoni, P.: Generating Multimodal Grammars for Multimodal Dialogue Processing. IEEE Transactions on Systems, Man, and Cybernetics, Part A 40(6), 1130–1145 (2010)
Article Google Scholar
Mankoff, J., Abowd, G.D., Hudson, S.E.: OOPS: a toolkit supporting mediation techniques for resolving ambiguity in recognition-based interfaces. Computers & Graphics 24(6), 819–834 (2000)
Article Google Scholar
Caschera, M.C., Ferri, F., Grifoni, P.: Ambiguity detection in multimodal systems. In: Proc. AVI 2008, pp. 331–334 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Bishop Heber College(Autonomous), Tiruchirappalli, India
Rajkumar Kannan
National Institute of Informatics, Tokyo, Japan
Frederic Andres
IRPPS-CNR, Rome, Italy
Fernando Ferri & Patrizia Grifoni

Authors

Rajkumar Kannan
View author publications
You can also search for this author in PubMed Google Scholar
Frederic Andres
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Ferri
View author publications
You can also search for this author in PubMed Google Scholar
Patrizia Grifoni
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Machine Intelligence Research Labs (MIR Labs), Auburn, 98071-2259, Washington, USA
Ajith Abraham
Departamento de Comunicaciones, Universidad Politcnica de Valencia, 46071, Valencia, Spain
Jaime Lloret Mauri
Avaya Labs Research, Basking Ridge, NJ, USA
John F. Buford
University of Massachusetts, 100 Morrissey Blvd., 02125-3393, Boston, MA, USA
Junichi Suzuki
Rajagiri School of Engineering and Technology, Rajagiri Valley Kakkanad, 682 039, Kochi, India
Sabu M. Thampi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kannan, R., Andres, F., Ferri, F., Grifoni, P. (2011). Towards Multimodal Capture, Annotation and Semantic Retrieval from Performing Arts. In: Abraham, A., Mauri, J.L., Buford, J.F., Suzuki, J., Thampi, S.M. (eds) Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, vol 193. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22726-4_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-22726-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22725-7
Online ISBN: 978-3-642-22726-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics