Automated Marker Augmentation and Path Discovery in Indoor Navigation for Visually Impaired

ShahSani, Raees Khan; Ullah, Sehat; Rahman, Sami Ur

doi:10.1007/978-3-319-60922-5_33

Raees Khan ShahSani¹⁶,
Sehat Ullah¹⁶ &
Sami Ur Rahman¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10324))

Included in the following conference series:

International Conference on Augmented Reality, Virtual Reality and Computer Graphics

3908 Accesses
1 Citations

Abstract

The past two decades have seen abundance of applications of Augmented Reality (AR), from gaming to medical, engineering, and academic fields. Certain work has been done to employ AR techniques for assisting the blind and visually impaired people to navigate in large indoor environments. This research contributes to the existing solutions by providing a viable technique, using merely a mobile phone camera and fiducial markers. The markers are detected and connected to generate a floor plan with the help of our proposed automatic path generation algorithm. Similarly, path augmentation algorithm efficiently populate the generated path with auditory and textual information. The proposed solution also provides a way to edit an already stored path when we need to extend the floor plan for inclusion of additional paths. An android application is developed to implement these algorithms. Time benchmarking the system shows effective results in automatic path generation, path augmentation, and path extension processes.

Download conference paper PDF

Literature Survey: Indoor Navigation Using Augmented Reality

ASSIST: Personalized Indoor Navigation via Multimodal Sensors and High-Level Semantic Information

Visual Indoor Navigation Using Mobile Augmented Reality

Keywords

1 Introduction

Augmented Reality and its applications have progressively achieved attention of both academia and industry, especially during the past two decades. It works by placing virtual information or objects over physical environment being captured with a video camera – leading to a mixed reality having both virtual and physical environments in a meaningful context [1]. Thus the environment surrounding the user gets interactive, which can be manipulated digitally using AR technology [2].

AR-based tracking can be broadly categorized into two groups, marker-based and marker-less tracking [3]. Marker-less tracking use feature based or model based approach for calculating camera’s pose and orientation. While marker-based techniques employ fiducial markers positioned in a real environment and tracked with a camera. These are usually passive markers with no electro coating over them, and with a variety of different patterns printed on a plain paper. ARToolkit [4], ARToolkit Plus [5], and ARTag [6] are a few popular marker-based AR systems.

Indoor navigation has always been challenging for visually impaired people to carry out their routine tasks. According to World Health Organization’s fact file, about 285 million people are facing problems in vision [7]. These include around 13% complete blind people and 87% being visually impaired. White cane stick and guide dogs have been affective in many scenarios for helping blind people in mobility. White cane stick works well with obstacles within its range, i.e. a meter away. Guide dogs can assist in already known places, which however are unacceptable in some societies [8].

Indoor positioning systems currently use various technologies for user’s localization. Wireless methods are comprised of GPS-based [9,10,11], Infrared-based [8], NFC-based [12], Bluetooth-based [13, 14], and RFID-based [15, 16] techniques. Major drawback of these systems is that installation of physical infrastructure is required in the target environment, e.g. Wi-Fi routers, RFID sensors, and Bluetooth beacons [17]. Yet such solutions have a tendency of localization errors and inaccurate results [18]. In contrast, previous studies have shown that Computer Vision techniques may be effective in navigation systems and indoor positioning [19].

Despite of various approaches proposed by researchers; no existing applications help visually impaired people to navigate easily inside large indoor buildings. The primary objectives of this research are:

An automated system for generating and augmenting path in an indoor environment using marker-based computer vision methods with the help of a smartphone camera.
Development of an Android application to facilitate a visually impaired person to navigate easily in a large indoor environment using merely a smartphone device.

2 Related Work

For outdoor navigation, GPS has been an ideal and de facto solution for positioning and user tracking. However, for indoor environments no such unique technology has been developed so far to solve the problem. To address the challenge, various approaches have been proposed in the literature. Commercial solutions have also been introduced in the market, which utilize various sensors/hardware of the smartphone for user’s current position localization. Such solutions include (a) dead reckoning systems which employ accelerometer, gyroscope, and magnetometer of the smartphone [20]; (b) received signal strength indication systems like Wi-Fi, Bluetooth, and RFID; (c) computer vision-based systems use the high computational capabilities and high performance cameras of smartphones in either marker-based or marker-less approaches for calculating user locality and orientation in indoor environments.

Authors in [21] have proposed a marker-based navigation system using ARToolkit markers, which uses images sequence to locate user position, and overlays user’s view with location information. Video stream obtained from a camera mounted to the user’s head and connected with the tablet, is transmitted wirelessly to a remote computer; which performs detection of ARToolkit markers, location recognition, and image sequence matching. This location information is then transmitted back the user’s tablet. The system does not store any pre-defined map of the indoor environment, so the shortest path to the destination cannot be calculated. It heavily relies on Wi-Fi network infrastructure to be deployed in the building for connecting to the remote server. The image recognition process is also very slow as it tries to match an input image to a buffer of 64 images each time to calculate user location.

In [22], authors have deployed ARToolkit markers in various positions in an indoor environment, which are detected using a camera attached to a laptop device. The laptop displays a pre-defined 2D map of the indoor environment. A route planner algorithm is developed that calculates the current location of the user. The algorithm uses a pre-defined matrix, which represents the links between any two locations on the map. It assists the user with both an audio clip associated with the current location, as well as displaying navigational information over the video stream using AR technique. The route planner algorithm lacks the capability of calculating the shortest path. Moreover, user needs to carry a laptop with a camera being connected.

Subakti and Jiang in [23] have used a combination of different hardware and software for guidance and navigation system for fresh students to experience the indoor building of a university. They have used an HMD for augmented display, android application for guidance and navigation, microcontrollers deployed in the building for sensing light, temperature, and sound. BLE beacons are deployed at various locations in the building to propagate location packets for sensing in the android application. The system works in two mode – marker-based using location-aware QR codes, and with invisible markers using BLE packets for navigational purposes. Map of the building is created as graph of BLE beacons and QR codes in which shortest path can be calculated with Dijkstra’s shortest path algorithm [24]. The system works well but its deployment is accompanied with complex BLE and microcontroller sensors infrastructure.

Yin et al. [25] proposed a peer-to-peer (P2P) based indoor navigation system that works with no aid from predefined maps or connectivity to a location service. Previous travelers record the path on which they navigate in a building and share it with other new travelers. The existing path is merged with Wi-Fi measurements and other key points like turns, stairs, etc. to create a consolidated path information. Smartphone application, ppNav, is developed to assist a new user in employing the path traces of the reference path, generated by previous users. [26] has used ultrasonic sensors and visual markers to assist a blind user navigation in indoor environments. Obstacles are sensed with ultrasonic modules connected to a pair of glasses. RGB camera is also attached with the glasses to detect markers in the environment. Map of the building is stored manually in the software, which makes it difficult to modify or edit paths.

Zeb et al. [8] have developed a desktop application using ARToolkit library to detect markers with the help of a webcam attached to a laptop. Markers are deployed inside a building and their connectivity is manually carried out using hardcoded entries in the application’s database along with auditory information about each marker. A blind user can then navigate through the building by detecting the markers with a webcam, and getting response audio information using headphones. The solution well addresses the situation but needs the user to carry a laptop device. Moreover, hardcoding the path manually into the application makes it harder to extend/update the current path setup.

In [27], authors have developed an indoor navigation system with the help of a laptop attached to the back of the user, an HMD for displaying augmented information, a camera and an inertial tracker are attached to the user’s head. A wrist-mounted touchscreen device is used to display a UI for application monitoring and tracking. ARToolkit markers are deployed in the building, which are tracked by the head mounted camera, and fed into the laptop for comparing to the pre-stored map of the building. The results are displayed on the HMD along with navigational aids using AR techniques. The system works well but is bulky and under low light conditions, it does not accurately identify the markers. Similarly, map generation and storage also requires manual coordinate editing.

Al-Khalifa and Al-Razgan in [28] have developed a system named Ebsar, which uses Google Glass connected to a smartphone to assist a visually impaired person in indoor navigation and positioning. The building is prepared with the help of a sighted person, called a map builder, who moves around the indoors of the building and explores different paths. The map builder marks every room, office, etc. with QR codes generated by Ebsar installed on a smartphone. Distance and direction between the QR codes is determined with the help of smartphone’s accelerometer and compass sensors. All the information gathered is used to create a floor plan graph with each node representing a checkpoint in the building like a room, office, or stairs; and edges for number of steps and direction between the checkpoints. The map is then uploaded to some central web server, which is available to any user with Ebsar installed on smartphone. At the first entrance to the building, the Google glass worn by a visually impaired user detects the QR code, and the application automatically downloads the corresponding map file of the building to the user’s phone. The user can then use voice commands for both input and output of information about the current location. The system is evaluated for performance and accuracy with several sighted and blind users yielding acceptable results. Although, it heavily relies on the smartphone’s accelerometer that can cause certain margin of error in calculating the steps; and the user should constantly have to wear a Google glass connected via Wi-Fi to the phone.

Another research in [29] has developed an indoor navigation system using smartphone by utilizing custom 2D colored markers, and accelerometer for step detection. Colored markers printed on plain papers are displayed on the entrance and other key intersection points inside the building. However, the exact position of each marker in the building has to be recorded offline, i.e. it lacks the automatic buildup of indoor paths. Distance between the markers is measured with accelerometer of the phone. The system proves to be scalable and simple yet has several drawbacks like poor detection of colored markers in low light conditions, incapable of working with multiple-floors building, and inaccuracy found in measuring steps using accelerometer.

Authors in [30] propose an indoor navigation system using smartphone, a newer version of Bluetooth, known as Bluetooth Low Energy (BLE), and visual 2D markers. The building is split into multiple logical regions where each region is installed with a BLE beacon device. The visual markers, ArUco [31], are pasted on the floors of the building, which are then detected by a user with a phone camera pointed towards the floor. Location information decoded from the marker is used by the smartphone application along with the beacon’s data to localize the user in the environment. Although the markers are not inter-related, they provide information about the current position only. The system provides an efficient and accurate positioning but requires beacons infrastructure to be deployed in the overall building, while no path calculation algorithm is proposed.

3 Proposed System

3.1 System Design

Indoor navigation system should be designed in manner to ease the path generation and path augmentation processes; as well as provide a robust and accurate user localization in the environment. Such a system should be flexible to assist in path editing and map extension.

With these goals in mind, we have designed a system thatautomatically detects fiducial markers and creates a floor plan, augments the markers with localization information; and at the same time providing an intuitive way to assist a visually impaired in indoor navigation.

The system’s major function is to assist a visually impaired person in indoor navigation inside hospitals, universities, shopping malls, museums, and other large buildings. It would facilitate the navigation with auditory information to be augmented with the real-world video stream. We have used ARToolkit markers, which are printed on plain papers. These makers are capable of detection using an average quality camera under normal lighting conditions of an indoor environment [32]. The person has to carry a smartphone device, and headphones connected to the phone. The phone should have both the rear camera with a flashlight and the front facing camera. The phone is installed with our indoor navigation application developed using ARToolkit SDK (Fig. 1).

3.2 Path Generation

The given steps are carried out for preparing markers to identify the paths inside a building.

Path generation process starts with the step of registering ARToolkit markers in a library.
The required number of markers is prepared for all of the possible paths in a single-floor or multiple-floor building.
The markers are printed on plain paper, and pasted on ceilings of the identified paths inside the building, i.e., in front of each point of interest location like a room, office, lab, etc.
When completed, the user with the help of the Android application on a smartphone starts to scan the building and detects each marker with phone camera.
As a marker is detected, a node for it is created in a graph data structure storing information like marker unique identifier, andits direction.
Upon detection of next marker, the application connects it to the previously detected marker using an adjacency matrix, using the given algorithm:
- Suppose we have detected the first marker m ₁and created its node in the graph.
- Upon the detection of next marker m ₂, the application checks the angle θ between the y-axis of the camera and the y-axis of the marker.
- If θ = 0°, it means m ₂ is in straight direction to the m ₁.
- If θ = 90°, m ₂ is to the right direction of m ₁.
- If θ = 180°, m ₂ lies behind m ₁.
- If θ = 270°, m ₂ is to the left direction of m ₁.
- We take a ±45° range at each direction calculation, because the camera’s y-axis does not have to be in precise angle along the marker’s y-axis. For instance, connecting m ₂ in straight direction to m ₁, we consider angle range 45° to −45° (i.e. 315°).
- Similarly, for connecting in right side direction we take range of 45° to 135°.
This way all of the hallways and corridors of the building are covered-up and a graph of the entire vicinity is build up in the application’s database.

3.3 Path Augmentation

After the path generation process is completed, another process, path augmentation, is carried out using the given steps:

A sighted operator with a mobile phone having the application installed, traverses again through the building holding the phone camera in a position to capture a video stream of the ceiling and intersection points.
When a marker is detected, the application asks the operator for auditory information to be augmented with the marker. The operator records an audio information for the marker and its corresponding location inside the building like Room No. 4, Office, Lab, etc.
Here the application also gives an option to add some textual information for the detected marker, which will be used to translate it into other languages by the application, when desired by the user.
This way all of the hallways and corridors of the building are traversed and the application database is populated with the auditory and textual information about the entire markers.

4 Technical Assessment and Discussion

For testing the system and the proposed algorithms for path generation and path augmentation, we have designed several experiments. The experiments have been carried were on first floor of the Academic Block, University of Malakand. The actual floor plan of the selected building is Fig. 2.

4.1 Path Selection

We selected four different paths for testing the path generation and the path augmentation algorithms. Here path 1 passes linearly across the corridor of CS department from Research Lab to the HOD office, while the other paths have been selected through the marker distribution in the department hallway as shown in Fig. 3.

4.2 Experiment 1 – Path Generation

The primary objective of this experiment is to find out the average time taken to detect markers using smartphone camera, identify them, and connect them with each other to define a pathway in indoor of the building. We will also check the level of accuracy in interconnections between the detected markers to match with the actual deployment of the markers inside the building.

The time taken to scan each path and subsequently generate the graph for it in the application’s database is shown in the given table (Table 1).

Table 1. Time taken in path generation

Full size table

On comparison of the floor graph created by the application for each path with the actual path inside the building; there were found no errors in marker interconnections.

4.3 Experiment 2 – Path Augmentation

In this experiment, the time taken to augment the selected paths with auditory and textual information has been calculated. The average time taken by the application for this task, which is about half a minute, seems efficient. The results are shown in the given table (Table 2).

Table 2. Time taken in path augmentation

Full size table

4.4 Experiment 3 – Path Extension

In this experiment, we have extended the already stored path graph with some additional markers. This situation is needed when we add new markers to the indoor building where we already have generated paths. Considering the Path-1 in the selected paths, we wish to extend the path and attach the markers having id: 29, 28, and 27; we start with selecting marker id: 2, and moving toward the new path, thus scanning with phone camera till the last marker (id: 27), the final path becomes (Fig. 4):

5 Conclusion

After reviewing several proposals and implementations presented in various researches for assisting the visually impaired people in indoor navigation and localization, we have proposed a novel approach towards the same situation. The solution has been implemented as an Android application, and tested in an indoor environment for efficiency and effectiveness. It has a vital advantage over other solutions in that it requires only a smartphone with the application installed, and its camera features for detection and identification of plain markers – thus localizing the user inside an indoor environment. It presents an automating path generation algorithm that simplifies the creation of pathways inside the building by merely detection and connection of pre-deployed markers. Similarly, the path augmentation algorithm adds auditory and textual information to the path graph. Both algorithms have been tested in a real scenario and the experiments have shown comparatively acceptable benchmarks. We also have tested the path extension algorithm, with which we can efficiently extend the existing path graph to include newly deployed markers.

References

Dunleavy, M., Dede, C., Mitchell, R.: Affordances and limitations of immersive participatory augmented reality simulations for teaching and learning. J. Sci. Educ. Technol. J. Art. 18(1), 7–22 (2009)
Article Google Scholar
Aloor, J.J., Sahana, P.S., Seethal, S., Thomas, S., Pillai, M.T.R.: Design of VR headset using augmented reality. In: 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), pp. 3540–3544 (2016)
Google Scholar
Rabbi, I., Ullah, S., Alam, A.: Marker based tracking in augmented reality applications using ARToolkit: a case study. J. Eng. Appl. Sci. (JEAS), Univ. Eng. Technol. Peshawar 34(1), 15–25 (2015)
Google Scholar
ARToolkit. http://www.hitl.washington.edu/artoolkit/
Wagner, D., Schmalstieg, D.: ARToolKitPlus for pose tracking on mobile devices. In: 12th Computer Vision Winter Workshop (CVWW 2007), Sankt Lambrecht, Austria (2007)
Google Scholar
Fiala, M.: ARTag, a fiducial marker system using digital techniques. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 2, pp. 590–596. IEEE (2005)
Google Scholar
Visual impairment and blindness – WHO, August 2014. http://www.who.int/mediacentre/factsheets/fs282/en/
Zeb, A., Ullah, S., Rabbi, I.: Indoor vision-based auditory assistance for blind people in semi controlled environments. In: 2014 4th International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 1–6 (2014)
Google Scholar
Barnard, M.E.: The global positioning system. IEEE Rev. 38(3), 99–102 (1992)
Article Google Scholar
Abbott, E., Powell, D.: Land-vehicle navigation using GPS. Proc. IEEE 87(1), 145–162 (1999)
Article Google Scholar
Panzieri, S., Pascicci, F., Ulivi, G.: An outdoor navigation system using GPS and inertial platform. IEEE Trans. Mechatron. 7(2), 134–142 (2002)
Article Google Scholar
Ozdenizci, B., Ok, K., Coskun, V., Aydin, M.N.: Development of an indoor navigation system using NFC technology. In: 2011 Fourth International Conference on Information and Computing, pp. 11–14 (2011)
Google Scholar
Blattner, A., Vasilev, Y., Harriehausen-Mühlbauer, B.: Mobile indoor navigation assistance for mobility impaired people. Procedia Manuf. 3, 51–58 (2015)
Article Google Scholar
Mahmood, A., Javaid, N., Razzaq, S.: A review of wireless communications for smart grid. Renew. Sustainable Energy Rev. 41, 248–260 (2015)
Article Google Scholar
Yelamarthi, K., Haas, D., Nielsen, D., Mothersell, S.: RFID and GPS integrated navigation system for the visually impaired. In: 2010 53rd IEEE International Midwest Symposium on Circuits and Systems, pp. 1149–1152 (2010)
Google Scholar
Fallah, N., Apostolopoulos, I., Bekris, K., Folmer, E.: Indoor human navigation systems: a survey. Interact. Comput. 25, 21–33 (2013)
Google Scholar
Mautz, R.: Indoor positioning technologies. Doctoral dissertation, Department of Civil, Environment Geomatic Engineering, Institute of Geodesy Photogrammetry, ETH Zurich, Zurich (2012)
Google Scholar
Levchev, P., Krishnan, M.N., Yu, C., Menke, J., Zakhor, A.: Simultaneous fingerprinting and mapping for multimodal image and WiFi indoor positioning. In: Proceedings of Indoor Positioning Indoor Navigation, pp. 442–450 (2014)
Google Scholar
Abu Doush, I., Alshatnawi, S., Al-Tamimi, A.-K., Alhasan, B., Hamasha, S.: ISAB: integrated indoor navigation system for the blind. Interact. Comput. 29(2), 181–202 (2016)
Google Scholar
Yang, L., Dashti, M., Jie, Z.: Indoor localization on mobile phone platforms using embedded inertial sensors. In: 2013 10th Workshop on Positioning, Navigation and Communication (WPNC), pp. 1–5 (2013)
Google Scholar
Kim, J., Jun, H.: Vision-based location positioning using augmented reality for indoor navigation. IEEE Trans. Consumer Electron. 54(3), 954–962 (2008)
Article Google Scholar
Huey, L.C., Sebastian, P., Drieberg, M.: Augmented reality based indoor positioning navigation tool. In: 2011 IEEE Conference on Open Systems, pp. 256–260 (2011)
Google Scholar
Subakti, H., Jiang, J.R.: A marker-based cyber-physical augmented-reality indoor guidance system for smart campuses. In: 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), pp. 1373–1379 (2016)
Google Scholar
Dijkstra, E.W.: A note on two problems in connexion with graphs. Numer. Math. J. Art. 1(1), 269–271 (1959)
Article MathSciNet MATH Google Scholar
Yin, Z., Wu, C., Yang, Z., Liu, Y.: Peer-to-Peer indoor navigation using smartphones. IEEE J. Select. Areas Commun. 35(5), 1141–1153 (2017)
Article Google Scholar
Simões, W.C.S.S., de Lucena, V.F.: Blind user wearable audio assistance for indoor navigation based on visual markers and ultrasonic obstacle detection. In: 2016 IEEE International Conference on Consumer Electronics (ICCE), pp. 60–63 (2016)
Google Scholar
Kalkusch, M., Lidy, T., Knapp, N., Reitmayr, G., Kaufmann, H., Schmalstieg, D.: Structured visual markers for indoor pathfinding. In: Augmented Reality Toolkit, The First IEEE International Workshop, pp. 8–16. IEEE (2002)
Google Scholar
Al-Khalifa, S., Al-Razgan, M.: Ebsar: indoor guidance for the visually impaired. Comput. Electr. Eng. 54, 26–39 (2016)
Article Google Scholar
Chandgadkar, A., Knottenbelt, W.: An indoor navigation system for smartphones. Imperial College London, London, UK (2013)
Google Scholar
La Delfa, G.C., Catania, V.: Accurate indoor navigation using smartphone, bluetooth low energy and visual tags. In: Proceedings of the 2nd Conference on Mobile and Information Technologies in Medicine (2014)
Google Scholar
Garrido-Jurado, S., Muñoz-Salinas, R., Madrid-Cuevas, F.J., Marín-Jiménez, M.J.: Automatic generation and detection of highly reliable fiducial markers under occlusion. Pattern Recogn. 47(6), 2280–2292 (2014)
Article Google Scholar
Rabbi, I., Ullah, S., Javed, M., Zen, K.: Analysis of ARToolKit Fiducial Markers Attributes for Robust Tracking (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & I.T., University of Malakand, Chakdara, Pakistan
Raees Khan ShahSani, Sehat Ullah & Sami Ur Rahman

Authors

Raees Khan ShahSani
View author publications
You can also search for this author in PubMed Google Scholar
Sehat Ullah
View author publications
You can also search for this author in PubMed Google Scholar
Sami Ur Rahman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Raees Khan ShahSani .

Editor information

Editors and Affiliations

University of Salento, Lecce, Italy
Lucio Tommaso De Paolis
University of Paris-Sud, Orsay, France
Patrick Bourdot
University of Salento, Lecce, Italy
Antonio Mongelli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

ShahSani, R.K., Ullah, S., Rahman, S.U. (2017). Automated Marker Augmentation and Path Discovery in Indoor Navigation for Visually Impaired. In: De Paolis, L., Bourdot, P., Mongelli, A. (eds) Augmented Reality, Virtual Reality, and Computer Graphics. AVR 2017. Lecture Notes in Computer Science(), vol 10324. Springer, Cham. https://doi.org/10.1007/978-3-319-60922-5_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-60922-5_33
Published: 08 June 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60921-8
Online ISBN: 978-3-319-60922-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics