Ishan Gupta

Projects

I have been working on projects related to computer vision, deep learning, video understanding and compression from the past few years. Here are some of the maintained projects which I worked on.

Image to Latex conversion using Visual Attention Model

Developed a Deep Neural Network using Soft Attention to predict the latex markup from latex-rendered images of mathematical formulae. Achieved 75% accuracy in exactly reproducing the input image from the generated latex syntax. Introduced a novel technique for regularizing the attention model by using a running average of the attention weights. Created a web interface in bootstrap for better user experience and cross platform performance validation.

Report
Code

Driver-Activity-Recognition

The project involved solving one of the major challenges involved in accounting driver vigilance for safe driving systems. I was involved in data collection and deciding basic activities performed by drivers while driving. We analyzed different approaches like frame by frame classification and RNN based attention model for activity classification. This projects helped in proving major cues to generate self alarms for when should the autonomous car switch from autonomous to manual mode.

Report
Code

Face Analysis using Deep Learning

We explored a multi-task learning frame-work for face analysis. It is the first tensorflow implementation of one of the state of the arts in the domain of face analysis and facial landmark localization. The training was performed on AFLW datset and algorithms for iterative region proposal and landmark based NMS were writtten from scratch.

Report
Code

Fall Risk Measurement using Computer Vision

Developed Computer Vision Algorithms to automate analysis of videos of SPPB(Short Physical Performance Battery) tests for measuring risk of fall in senior adults. Came up with a novel algorithm to ascertain gait speed and gait speed variability of a subject from videos using Mixture of Gaussians. Collaborated with UCSD Health Care for data collection and other experiments.

Fall Risk Measurement using Computer Vision

3D Bounding Box Estimation Using Visual Geometry and Deep Learning Methods

Currently working on developing a 3D bounding box estimation module as a part of complete computer vision based visual tracker system.The module will be able to predict the projection of the bottom face center of surrounding vehicles with respect to ego vehicle.The final dimension and visual yaw predictions can parametrize the 3D bounding of the surrounding vehicles.The proposed architecture will remove the dependence on LIDAR of current multi object tracking systems.

Ishan Gupta

University of California San Diego

Birla Institute of Techology and Sciences, Pilani

Academic Achievements

Industrial Experience

A9.com (An Amazon company)

Broadcom Research

Broadcom Research

Nutonomy

Projects

Image to Latex conversion using Visual Attention Model

Driver-Activity-Recognition

Face Analysis using Deep Learning

Fall Risk Measurement using Computer Vision

Fall Risk Measurement using Computer Vision

3D Bounding Box Estimation Using Visual Geometry and Deep Learning Methods

Misc

Intel IOT Hackathon using Edison