First Person Vision for Activity Prediction Using Probabilistic Modeling

Abstract

Identifying activities of daily living is an important area of research with applications in smart-homes and healthcare for elderly people. It is challenging due to reasons like human self-occlusion, complex natural environment and the human behavior when performing a complicated task. From psychological studies, we know that human gaze is closely linked with the thought process and we tend to “look” at the objects before acting on them. Hence, we have used the object information present in gaze images as the context and formed the basis for activity prediction. Our system is based on HMM (Hidden Markov Models) and trained using ANN (Artificial Neural Network). We begin with extracting motion information from TPV (Third Person Vision) streams and object information from FPV (First Person Vision) cameras. The advantage of having FPV is that the object information forms the context of the scene. When context is included as input to the HMM for activity recognition, the precision increases. For testing, we used two standard datasets from TUM (Technische Universitaet Muenchen) and GTEA Gaze+ (Georgia Tech Egocentric Activities). In the first round, we trained our ANNs only with activity information and in the second round added the object information as well. We saw a significant increase in the precision (and accuracy) of predicted activities from 55.21% (respectively 85.25%) to 77.61% (respectively 93.5%). This confirmed our initial hypothesis that including the focus of attention of the actor in the form of object seen in FPV can help in predicting activities better.

Authors and Affiliations

Shaheena Noor, Vali Uddin

Keywords

Related Articles

Identification of Urdu Ghazal Poets using SVM

Urdu literature has a rich tradition of poetry, with many forms, one of which is Ghazal. Urdu poetry structures are mainly of Arabic origin. It has complex and different sentence structure compared to our daily language...

Thermodynamic Analysis of Combined Vapor Compression and Vapor Absorption Refrigeration System

Two of the popular refrigeration cycles, VC (Vapor Compression), and VA (Vapor Absorption) are used extensively for refrigeration purposes. In this paper, a system is proposed that works using both cycles powered by an I...

A New Hybrid Metaheuristic Algorithm for Wind Farm Micrositing

This work focuses on proposing a new algorithm, referred as HMA (Hybrid Metaheuristic Algorithm) for the solution of the WTO (Wind Turbine Optimization) problem. It is well documented that turbines located behind one ano...

Effect of Canal Bank Filtration on Quality of Water of Hyderabad City

The focus of the present study was to examine the effect of canal bank filtration on the quality of water and the geological settings along the banks of canals at the shallow depth aquifers. The four Model wells were dri...

Effect of Compaction on Compressive Strength of Unfired Clay Blocks

This study investigates the possible use of unfired compacted clay blocks as a substitute of CSEB (Compressed Stabilized Earth Blocks) for the construction of economical houses. Cubes of 150 mm size were cut from the cla...

Download PDF file
  • EP ID EP394637
  • DOI 10.22581/muet1982.1804.09
  • Views 102
  • Downloads 0

How To Cite

Shaheena Noor, Vali Uddin (2018). First Person Vision for Activity Prediction Using Probabilistic Modeling. Mehran University Research Journal of Engineering and Technology, 37(4), 545-558. https://europub.co.uk./articles/-A-394637