Object Detection and Tracking using Faster R-CNN
Chakradhara Panda

Chakradhara Panda, Advanced Analytics and Cognitive Computing, IBM India Private Limited, Bengaluru,  India.
Manuscript received on 13 August 2019. | Revised Manuscript received on 19 August 2019. | Manuscript published on 30 September 2019. | PP: 4894-4900| Volume-8 Issue-3 September 2019 | Retrieval Number: C5580098319/2019©BEIESP | DOI: 10.35940/ijrte.C5580.098319
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: This paper uses a deep learning model called Faster R-CNN to detect and track objects in images. Two backbone networks such as ResNet-101 and VGG-16 are tested on a self-created dataset and PASCAL VOC dataset. Intersection over union (IoU) technique is used for the purpose of object tracking. The impacts of batch size, number of iterations and learning rate are analysed. The paper finds that ResNet-101 outperforms VGG-16 significantly by 13% on test data. This finding reinforces that deeper network is better in feature extractions and generalizations. IoU is able to track multiple objects and can identify the loss of track. The processing of frames per second is found to be 5 fps. The study has implications for many computer vision applications. For example, the deep learning based object detection and tracking can either augment the capability of LiDARs and Sensors or become an alternative to them in self-driving vehicles.
Keywords: Faster R-CNN, ResNet-101, VGG-16, Object Detection and Tracking.
Scope of the Article: CNN