Keywords

1 Introduction

The World Wide Web is perfectly adapted for use by people in developed countries. It is visual, text-based, and mainly written in English or other world languagesFootnote 1. The Web depends on the availability of computers, datacenters, glass fiber backbones, fixed and wireless networks, 3G mobile telephony and transport of large volumes of data at high speed. In remote rural areas in many developing countries, conditions are different. Poor infrastructure, lack of equipment, low levels of literacy, and use of under-resourced local languages, seriously hamper the access to the Web for many people.

There is a general consensus that the global Information Society must benefit all people in the world. The United Nations Millennium Declaration contains a commitment for developing a people-centered, inclusive and development-oriented Information Society so that people everywhere can create, access, utilize and share information and knowledge to attain the internationally agreed development goals and objectives, including the Millennium Development Goals. Footnote 2

Yet, in many rural regions in Africa community radio is the only source of information. People have radios at home and listen to programs broadcast in local languages every day. Many people have access to simple voice-based (2G) mobile phone, but text messaging is hardly used [2].

The availability of both mobile phone and radio is opening opportunities for new services. E.g. radio listeners phone to the radio station and leave voice messages that they want to have broadcasted, or react to popular radio programs leaving news, opinion, regional information etc. Community radio here operates as an important local information hub, where people bring information for further dissemination.

Radio stations in rural areas in Africa operate under harsh conditions. Only the largest and state financed radio stations have a computer and an internet connection. Due to lack of funds many radio stations still use old-fashioned, analogue equipment, such as tape recorders. Yet, it is in the line of expectation that more and more radio stations will have computers and an internet connection in the coming years.

In the current situation the information broadcast by the community radio is volatile: it is not stored and kept for later access or re-use. Radios do not have means to manage, reuse and index this voice-based content.

In this paper we introduce a radio platform as a new interface to the Web. It enables management of radio content in an efficient way, making it accessible and searchable, so that it can serve a broad audience, e.g. Africans in the diaspora, who want to have news from their home villagesFootnote 3.

Additional, the voice-based radio content on this radio platform might be linked to other data sources on the Web, enabling community radios in Africa to become an interface to the Web of Data. An example of a system that manages market information based on Linked Data principles and produces voice-output as broadcasts for African community radios, is described by De Boer et al. [3]. In the future new applications providing locally relevant information from the Web of Data, such as pluviometric data, agricultural data, market prices etc. might become available through the radio platform.

The radio platform described in this paper not only facilitates production, consumption and management of voice-and web-based radio content, but it also enables access to the Web for people who do not have a computer or the Internet.

Contributions of this paper are:

  • A radio platform with both a web and a voice-based mobile interface that allows content creation, retrieval and indexing of spoken radio content.

  • African community radio, introduced as a new interface to the Web of Data

This paper is structured as follows. In Sect. 2 we describe related work. In Sect. 3 the architecture of the radio platform is described, the use cases collected from three different radios in Mali, as well as the principles used to manage the content. In Sect. 4 we describe challenges related to the organization of the voice-based radio content. In Sect. 5 we discuss future work that must be done on the Web of Radios, including the sustainability aspects.

2 Related Work

Related work on the development of a similar platform was done in the Freedom FoneFootnote 4. Freedom Fone is a project initiated by The Kubatana Trust of Zimbabwe, a civil and information activist platform from Zimbabwe. Freedom Fone is open source software for creating audio content using phone. Freedom Fone provides a voice platform similar to the basic setup proposed in this paper, but without the Linked Data enabled data management.

Research on speech recognition started in the 1930s and resulted in commercial deployments of voice-based services in the 1970s. Major achievements on language recognition, mainly for English, took place in the 1980s and 1990s and culminated in the development of VoiceXML by the W3C Voice Browser group, in 1999, facilitating and standardizing the development of voice applications [4].

Sheetal Agarwal et al. from IBM Research India, developed a system to enable authorship of voice content for 2G phone in a web space, they named the WWTW or World-Wide Telecom Web. The system is not connected to the Web, therefore not allowing access by third party search engines. The system represents a closed web space, within the phone network. Especially the lack of open search possibility constrains its growth [5].

From Burundi a system has been reported [6] to use tagging software and multimedia mobile data collection. The software is named EthnoCorderFootnote 5. The NGO that co-developed this app was Help Channel Burundi. However, because of the current unavailability of multimedia devices in the given rural context, this technical solution may be still out of reach of community radio stations targeted in this study.

A related project on the Semantic XO and Linked Data for developing countries is described by Guéret et al. [7]. The Semantic XO is a system that connects rugged, low-power, low-cost robust small laptops (aka the XO promoted by the One Laptop Per Child organization) for the empowerment of poor communities, based on Linked Data principles in order to publish previously unpublished data.

De Boer et al. [3] describe a distributed voice- and web-based market information system, named Radio Marché, aimed at stimulating agricultural trade in rural areas of Africa. This system connects to regionally distributed market information systems, using Linked Data approaches.

3 The Radio Platform

The design of the radio platform described in this section is based on extensive use case and requirements analysis, performed in Mali, with the collaboration of radio journalists from community radio stations. The research was done as part of the Foroba BlonFootnote 6 projectFootnote 7, funded by the International Press Institute, and the VOICES projectFootnote 8, partially funded by the EU, within the 7th Framework Programme. The Foroba Blon project is aimed at supporting and promoting citizen journalism in developing countries. The VOICES project is aimed at developing innovative mobile voice services to support users in underprivileged communities in African countries (Fig. 1).

Fig. 1.
figure 1

Conceptual design of the radio platform as a voice-interface to the Web for people who are out of reach of computers and the Internet, but do have phone or radio.

3.1 Operation of Community Radios in Mali

In Mali many community radio stations exist. Some are state funded and connected to the national broadcasting service ORTM (Office Radio Télévision du Mali). Others are privately funded or completely self-supportive. According to their business, funding scheme, size and location some radio stations do have computers and internet, some have computers without internet connection and some do not have any computer facilities at all. All these radio stations are situated within the coverage area of mobile telephony.

The Malian community radios have large bases of listeners and the radius of coverage ranges between 100 and 200 km. These radio stations create their own programs and broadcast local and regional news, music, informative programs, round table programs and paid announcements. Two radio station stations are involved in the projects described in this paper. These are: Radio ORTM Ségou, a state owned radio, that has computers and a 2 Mbps fixed line (DSL) internet connection. Radio ORTM Ségou broadcasts programs in French and Bambara, the most widely spoken language in Mali.

Fig. 2.
figure 2

Presenter Radio Moutian.

The second radio station is Radio Moutian, in Tominian, see Fig. 2. This radio is independent and its funding is based on paid airtime for announcements and private gifts from third parties. Radio Moutian has a computer but no internet connectivity. Programs are mainly broadcasted in Bomu, a local language fro the Tominian region. The third radio is Radio Seno in Bankass. This radio is independent from the Malian state and has only analogue equipment. There are no computers, there is no internet connection here, but the radio has many listeners in the region around Bankass. The main language spoken here is Dogon. The activities of the three above mentioned radio stations are related to three types of end-users or customers:

  • NGOs that buy airtime to broadcast public announcements about informative and educational topics, such as agriculture and public health information. This type of service is usually based on fixed monthly subscriptions to airtime for recurring broadcasts.

  • Non-commercial listeners from the region, who buy a few minutes of airtime and pay a broadcast fee per minute airtime. The information is usually brought to the radio, or communicated via phone and subsequently written down on paper by the radio staff. Some listeners call in on a given time slot (one hour per week) and leave a short voice message (few seconds only) as a reaction to a program that was broadcast on a certain popular topic. These messages are named letters to the editors (LTE).

  • Journalists or trusted village reporters that phone to the radio and leave local news or interviews on a regular base. In the current situation, all incoming phone calls are attended by a radio staff member and annotated in tabular form on paper.

Fig. 3.
figure 3

Current situation: incoming messages are registered in a caller log.

3.2 The Radio Platform Architecture

The proposed radio platform, which we named Foroba Blon (FB), consists of a data store containing recorded voice messages and related meta-information. FB will replace the existing caller log, see Fig. 3.

The interface to the FB radio platform for entering new content is either through mobile phone or via the web. Users of the mobile interface are the listeners from the region, who enter letters to the editor (LTE). These people only have mobile phones but no access to the Internet. Their calls are answered by the system with a pre-recorded welcome message in a local Malian voice inviting them to leave their message. For the sake of user-friendliness, the user interface and the dialogue for this category of users is kept as short and simple as possible, since the expected callers will be unfamiliar with interactive voice response systems and may not respond to a complex computer-generated dialogue asking to press buttons, etc.

Fig. 4.
figure 4

Architceture of Foroba Blon, showing the core system and its connections to the Semantic Web and the Voice Web

Another category of users of FB are the trusted reporters calling from the field, and also using the mobile interface. They phone in and leave their spoken report for broadcasting. These users are previously registered, having their phone number, name, address and preferred language in FB. These users will be trained to navigate the voice-menu, and use the IVR system, asking to press a button on the phone to confirm or answer a question about their current location, subject of the message, etc. The FB system always answers the registered caller in his/her preferential language (Fig. 4).

The voice messages are stored as audio files in the FB data store, together with meta-information being the date and time of the call, the length of phone call in seconds, the phone number of the caller. Messages from trusted users are linked to the owner, his/her address, and his/her preferred language. For all users of the system, confidentiality and anonymity will be ensured, according to the broadcast policies used by the radio stations in Mali.

The FB radio platform also has a normal web interface, where internet-connected end-users/customers can access and upload a voice message. Depending of their customer relationship to the radio, they can login to the radio-platform as (i) registered users such as NGOs, and trusted reporters, or (ii) as unregistered users. There is an option to sign up and create a user account by registering the name, phone number, village and preferred language. Unregistered users can access former broadcasts since this is public information.

For the radio user, FB provides a web-based interface, enabling them to manage the data in the data store. It provides a file list where they can access, listen, broadcast, delete files, and add/update/delete meta-information, see Fig. 5.

The radio station that has no computer nor internet, only has a limited interface to the radio platform, since this is the constraint of a voice interface. The radio user receives a welcome message asking if she wants to hear the last ten messages, or if she wants to manage the welcome messages to the end-users. The FB radio platform is hosted either locally, on a stand-alone computer, or in the cloud. The FB consists of a voice platform running an open source web server and a local voice browser that handles the voice interaction. The FB radio platform uses a GSM gateway device, e.g. OfficeRoute,Footnote 9 a device that handles incoming and outbound calls and streams the voice messages to and from the phone.

Fig. 5.
figure 5

Screenshot of the web-based interface of Foroba Blon for managing audio content generated by citizens

The FB radio platform could in theory be physically hosted anywhere in the world, on any web server, connected to the Internet. However, in this actual case in rural regions of Mali, this is not possible. Firstly, the radio platform has to be accessible using an inexpensive local Malian phone number, so it must be connected to a Malian phone network. Secondly, the web service accessed over the Internet must also be accessible locally. Since the internet connectivity in Mali is usually of low bandwidth and of high latency, voice web services hosted in datacenters in the US or Europe, are too slow for proper deployment in Mali. For these two reasons, the system has to be preferably hosted locally in Mali. In the absence of good and reliable datacenters or hosting providers in Mali, the radios can decide to deploy the FB radio platform on a local computer at their own premises. Obviously before this can be done, the radio staff members have to be trained how to do operational maintenance of the FB platform, and especially how they can cope with frequent power outages, and bring the system back to a consistent state.

3.3 Semantic Datamodel

Figure 6 shows the semantic model used in Foroba Blon system. We use ‘fb’ as abbreviation of the Foroba Blon namespace. We reuse existing schemas Dublin Core (dct) and FOAF (foaf) as well as the GeoNames dataset (geonames).

The main class is the fb:recording class. An instance of this class is linked to the URI of the actual audio file using the fb:audiofile property. The recording class has two literal properties: dc:created for the recording time and the fb:caller_id which is associated to the phone number of the caller. Both are extracted automatically by the system at recording time.

A recording is related to a specific radio station, which has a phone number predicate and uses FOAF properties to list its name and location. For the location, we use GeoNames entities where possible. If specific villages are not present in GeoNames, we provide our own resources, mapped to the GeoNames hierarchy.

A recording is also related to an instance of fb:Annotation. This instance holds all the information added by an radio station editor. The current version of the interface allows the editor to enter free text annotations in a number of fields. Through this, the system will gather persons, subjects, locations, products etc. associated with the radio message. We will enrich this free text data and map it to structured vocabularies, to be curated by the radio editors. Specific type of concepts will be linked to existing linked data sources (places to GeoNamesFootnote 10, product types to AgrovocFootnote 11 etc.). Figure 6 shows the design of the semantic model where this is the case.

The annotation lists the caller as a foaf:Person as well as the location from which the call originates as a GeoNames resource. The Dublin Core dct:type property is used to denote the type of the call (e.g., Announcement, Request, News Item, ...), which we model as SKOS concepts. The property dct:subject is used to further classify the subject of a call. This might be a literal value or (as is the case in Fig. 6, a SKOS concept. The radio editor is noted as the fb:annotator of the annotation which is also linked to a radio station. The annotation also has a separate automatically stored creation datetime. Lastly, the fb:comment predicate is used to link free text comments to the recording.

Fig. 6.
figure 6

Semantic Datamodel as used in the Foroba Blon system. The image shows a part of the semantic graph. Ellipses denote resources with their classes (italicized).

4 Organizing the Radio Content

The next challenge is how to manage the spoken content of un-resourced languages such as Bomu, Bambara and Dogon. Since up to present no interactive voice response (IVR) systems exist for these languages, the voice-based content cannot be indexed by conventional search engines. Therefore collecting as much meta-information as possible is essential. Very simple ways of indexing the messages are based on owner (known through phone number) automatic language recognition, time slot, (e.g. all messages collected on January 13 between 10 and 11 a.m. are related to the radio program on harvesting shea nuts). The radio journalist can manually enter meta-information such as keywords, village region, language, name or any other attribute to an audio file using her radio-web interface. In the future existing tagging systems such as EthnoCorder may be considered, to facilitate meta-data collection.

4.1 Linked Data Sharing and Re-use

Initially, the data of the radio platform will be only used locally, but we explicitly designed the semantic datastore to ensure sharing and reuse of the data collected in the different Foroba Blon instances. We specifically envision reuse (1) across different instances of Foroba Blon (for different radio stations); (2) across information services, specifically the RadioMarché platform and (3) through aggregations of data, usable by third parties.

  1. 1.

    We are now in the process of installing Foroba Blon in two radio stations in the Tominian region in Mali. For specific recording types and for specific themes, sharing submissions can be of great value. Specifically, we envision that the sharing and spreading of local news reports can provide an efficient form of citizen journalism. The spreading of spoken news items in this way is akin to that of Twitter messages. In the region there are a great number of languages and dialects (57 in Mali alone). Linked Data is specifically well-suited to deal with multiple languages as its core concepts are resources rather than textual terms. A single resource, identified by a URI (i.e. http://example.org/shea_nuts) can have multiple labels (e.g. Shea Nuts@en and Amande de Karité@fr).

  2. 2.

    The Foroba Blon data will be linked to the data of Radio Marché. This platform is based on the same technology set and linked data principles as Foroba Blon to share and spread market information. As Foroba Blon will target mainly rural agricultural areas, we expect that much of the information and the stakeholders will overlap with that of RadioMarché. Specifically, the data will be linked through places (GeoNames), people (FOAF) and themes (SKOS concepts). These links can then be exploited by both platforms to enhance or augment the information provided. RadioMarché and Foroba Blon are only two examples. We are working towards an ecosystem of commercial and journalistic services running on the same radio platform where data is shared and reused across services.

  3. 3.

    Voice recordings can not only be shared across radiostations but also with other datasources and applications on the web. One possibility we are pursuing in the project is to open up certain messages to web users world wide through a web log. On this blog, people with access to Web infrastructure can listen to the journalist reports or announcements. Specifically, we foreseee that expatriates originating from the radio station’s area will be interested in reports from their region of origin.

By exposing the produced data as linked data, we do not only open the possibility of expected, but also unexpected reuse. At all times, privacy issues related to the voice messages will be taken into account.

If the Foroba Blon radio platform proves to be a success, other instances of Foroba Blon may be installed at local radio stations in Mali, across borders, in neighbouring countries, Burkina Faso, Ghana, Senegal, Guinée where conditions with regards to illiteracy, local languages, mobile telephony and community radio are similar to those in Mali. This will create a Web of African community radios that are linked to each other and that will eventually become part of the Web of Data.

4.2 Organize an Open Source Community of Developers to Create Applications for the Radio Platform

In the VOICES and Foroba Blon projects one instance of the radio platform is developed by a small team of developers, in collaboration with end-usersFootnote 12 sponsored by the International Press Institute as a pilot project. However, to enable further development of the radio platform, and to expand the scale of the web of radios, it is important to look at new ways of production and consumption of data and services. African community radios operate in a low-income region where the sustainability of a system relies on the underlying business model. Community radios do not have enough earnings to invest in new systems, and their listeners-base is large, but poor. Application development will therefore to be organized in a cost-effective way. We propose to organize an open source community of developers and to rely on commons-based peer production for the development of applications that will open the Web of Data to radio using voice-modality.

5 Discussion and Future Work

From this paper it becomes clear that the Web of African Radios can only emerge as an interface to the Web of Data, when sufficient applications are built, that link voice-based content. For the navigation of voice menus and other voice-based dialogues small subsets of the local languages such as Bambara and Bomu have to be recorded and resourced using time-consuming techniques and efforts. The user interfaces have to be extensively tested and validated with end-users in the local situation, since these are culturally sensitive topics. For the resourcing of more local languages crowd-sourcing techniques may be applied. The issue of meta-information is another important topic. In the model presented for the FB radio platform in this paper, only a small amount of meta-data is collected. When the repositories of spoken content start to become larger, new innovative ways of describing spoken content have to be developed.

The annotation of the audio recordings will initially be done by radio employees. However, within the Web for Regreening in Africa initiativeFootnote 13, we are currently developing Text To Speech (TTS) and Automated Speech Recognition (ASR) libraries for local dialects of French as well as local languages Bambara and Bomu. The TTS will allow us to a generate voice prompts in a dynamic way for the IVR system. The ASR system will be employed to automatically recognize parts of the audio content submitted by callers. This will further reduce the burden on the annotator. For larger languages such as English ASR is already used for automated call handling.

To contribute to a critical mass of content and applications that are necessary in this rural domain, a socio-technical network has to be put in place, that must be supported by a community of contributors: web developers, listeners that provide meta-information, local ICT-entrepreneurs, people who are willing to produce and consume data. According to Kazman and Hong-Mei Chen [9] organizing a community of developers around an open source service requires a consolidated kernel infrastructure, allowing peripheral services to be created by a de-centralized community of developers. Specific social and technical mechanisms are needed to ensure long-term participation and to encourage community engagement. In this case this is justified by the aim to open the Web of Data to people who are out of reach of computers and the internet.