Providing Content Aware Enterprise Communication Services

  • Xiaotao Wu
  • Krishna Dhara
  • Venkatesh Krishnaswamy
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5310)


Intelligent communication often requires context information to trigger proper communication services. In many cases, names, locations, as well as activities and status of communication participants are used to enable context-aware communication. In this paper, we propose a new context-aware communication paradigm, namely content-aware communication, which infers context information based on the content of ongoing conversations. New communication services can then be introduced by utilizing the content and the inferred information.

Content-aware communication employs Automatic Speech Recognition (ASR) to acquire conversation content, Information Extraction (IE) to help identify useful context information, and Information Retrieval (IR) to find related information. The existing ASR and IR technologies can already provide applicable approaches to enable content-aware communication for a single user on his or her personal computer. However, there are still very few existing content-aware voice communication services and also lacks a secure and scalable way to integrate different technologies and resources for enterprise wide deployment of the services. In this paper, we first categorize enterprise content-aware communication services and illustrate some new content-aware services. We then define an architecture with distributed media processing and centralized call control to manage enterprise content-aware communications. This architecture also helps manage feature interactions when integrating content-aware services with other enterprise communication features. In addition, we allow enterprise users to experience content-aware communication on different devices and in different modalities.


Session Initiation Protocol Application Server Automatic Speech Recognition Interactive Voice Response Audio Stream 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Campbell, E.B., Rosenberg, J., Schulzrinne, H., Huitema, C., Gurle, D. (eds.): Session Initiation Protocol (SIP) extension for instant messaging. RFC 3428, Internet Engineering Task Force (December 2002)Google Scholar
  2. 2.
    Campbell, E.B., Mahy, R., Jennings, C. (eds.): The Message Session Relay Protocol (MSRP). RFC 4975, Internet Engineering Task Force (September 2007)Google Scholar
  3. 3.
    Deroo, O.: A short introduction to speech recognition,
  4. 4.
    Dhara, K., Krishnaswamy, V.: A SIP-based application framework for intelligent end-points. In: The International SIP Conference 2003, January 14-17 (2003)Google Scholar
  5. 5.
    Fiscus, J., Fisher, W., Martin, A., Przybocki, M., Pallett, D.: NIST evaluation of conversational speech recognition over the telephone: English and Mandarin performance results. In: DARPA Broadcast News Workshop (2000)Google Scholar
  6. 6.
    Franz, A., Milch, B.: Searching the web by voice. In: 19th International Conference on Computational Linguistics (COLING), pp. 1213–1217 (August 2002)Google Scholar
  7. 7.
    Garcia-Martin, M., Isomaki, M., Camarillo, G., Loreto, S.: A Session Description Protocol (SDP) offer/answer mechanism to enable file transfer. Internet draft, Internet Engineering Task Force, Work in progress (December 2006)Google Scholar
  8. 8.
    Gauvain, J.L., Lamel, L., Adda, G., Adda-Decker, M., Barras, C., Chen, L., de Kercadio, Y.: Processing broadcast audio for information access. In: 39th Annual Meeting on Association for Computational Linguistics (ACL) Morristown, NJ, USA, pp. 2–9 (2001)Google Scholar
  9. 9.
    Ecma International. XML protocol for computer supported telecommunications applications (CSTA) phase III. Standard 323, Ecma International (December 2006)Google Scholar
  10. 10.
    Johnston, A., Levin, O.: Session Initiation Protocol (SIP) Call Control - Conferencing for User Agents. RFC 4579, Internet Engineering Task Force (August 2006)Google Scholar
  11. 11.
    Lonnfors, M., Kiss, K.: User agent capability presence status extension. Internet Draft draft-ietf-simple-prescaps-ext-08, Internet Engineering Task Force, Work in progress (September 2007)Google Scholar
  12. 12.
    Moreno, P.J., Van Thong, J.M., Logan, B., Jones, G.J.F.: From multimedia retrieval to knowledge management. Computer 35(4), 58–66 (2002)CrossRefGoogle Scholar
  13. 13.
    Java Community Process. SIP servlet API v1.1. Java Specification Requests 289, Java Community Process (December 2007)Google Scholar
  14. 14.
    Roach, A.: Session initiation protocol (SIP)-specific event notification. RFC 3265, Internet Engineering Task Force (June 2002)Google Scholar
  15. 15.
    Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A.R., Peterson, J., Sparks, R., Handley, M., Schooler, E.: SIP: Session Initiation Protocol. RFC 3261, Internet Engineering Task Force (June 2002)Google Scholar
  16. 16.
    Rosenberg, J., et al.: An INVITE inititiated dialog event package for the session initiation protocol (SIP). RFC 4235, Internet Engineering Task Force (December 2005)Google Scholar
  17. 17.
    Witbrock, M., Hauptmann, A.: Speech recognition and information retrieval. In: Proceedings of the 1997 DARPA Speech Recognition Workshop, Chantilly, February 2-5 (1997)Google Scholar
  18. 18.
    Wu, X., Krishnaswamy, V.: Using SIP event package and consent request for media recording. Internet draft, Internet Engineering Task Force (June 2006) (Expired)Google Scholar
  19. 19.
    Email Archive Visualization Workshop (June 2005),

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Xiaotao Wu
    • 1
  • Krishna Dhara
    • 1
  • Venkatesh Krishnaswamy
    • 1
  1. 1.Avaya Labs ResearchLincroftUSA

Personalised recommendations