Vihang Agarwal

I am a Research Assistant at University of Michigan, where I work on computer vision and deep learning for medical imaging advised by Prof. James Balter. I recently completed my masters in EECS, Computer Vision from University of Michigan where I was fortunate enough to work with Prof. David Fouhey on problems in Robot Navigation/3D visual scene understanding and compete in The Alexa Prize Socialbot Grand Challenge . I also have an undergraduate degree in Mechanical Engineering from IIT Kanpur, India along with a minor in Artificial Intelligence.

Email  /  CV  /  Google Scholar  /  LinkedIn

profile photo
Research

My research interests are in computer vision, deep learning, reinforcement learning, and medical imaging.

fastmri Accelerating MRI Acquisition using Cascaded Attention U-Net with Prior Information
AAPM/COMP, 2020   (Oral Presentation)
video

Reconstructing detail rich MRI using prior information from multi-imaging protocols and spatial non-local MR image patch regularities.

alexa Audrey: A Personalized Open-Domain Conversational Bot
The Alexa Prize Proceedings, 2020

An open-domain conversational chat-bot that aims to engage users on various conversational levels with socially-aware models such as Emotion Detection.

social_robot A Narrative Speech, Gaze and Gesturing Robot Accessing to Human Emotion and Memory
International Conference on Innovative Computing, Information and Control, 2017

A study of changes in attention and episodic memory of human participants due to Human-Robot social interactions.

Projects
pawpal PawPal
EECS 504: Foundations of Computer Vision
Instructor: Dr. Jason Corso
poster / code / report

Using state-of-the-art computer vision algorithms, we developed a dog localization and activity recognition system that can determine what your dog is doing from a home surveillance camera. An innovative solution allowing autonomous active monitoring and safekeeping of pets without requiring any interference from the owner.

trafficflow Improving Traffic Flow with Deep RL
EECS 545: Machine Learning
Instructor: Clayton Scott
code / report

We compared Action-specific Deep Q networks (ADQN) and Action specific Recurrent Deep Q networks (ADRQN) to drive vehicles in the Deep Traffic Simulator. The recurrent network can handle partial observability induced by only having access to a partial field of view per timestep and converges faster in this setting.

3D Visual Scene Understanding
Independent Study
Instructor: David Fouhey

I implemented a ResNet-DenseNet encoder-decoder network for estimating depth maps, normals and occlusion edges from single images on the NYUv2 dataset . The hope was to exploit learnt feature representations and intra task dependencies for efficient transfer learning.

greedy Gluttonous: A greedy Algorithm for Steiner Forest
EECS 598: Approximation Algorithms
Instructor: Viswanath Nagarajan
presentation / report / original arxiv paper

We summarized the core ideas behind Greedy Algorithms for solving Steiner Forest problems in a short technical report and presented the intuition behind these algorithms in an easily understandable and comprehensive presentation.

humanai Lazy Human AI Teams
EECS 598: Human-AI Interaction and Crowdsourcing
Instructor: Walter S. Lasecki
code / report

We explored the use of Human Effort as an additional task dimension in Human-AI teams and showed that optimising Human Effort in an active learning paradigm improves overall team performance. I simulated a variety of active learning experiments on the Swicthboard corpus and designed a simple web interface to conduct experiments.

policysketch Generalizing Navigation Behaviors with Policy Sketches
Independent Study
Instructor: David Fouhey
proposed abstract

We explored long range 3D Navigation in the virtual House 3D environment. I defined behavioral primitives important for navigation at large scales and implemented Imitation Learning to learn these primitives as policy sketches through supervised demonstrations. The goal was to learn a general navigation policy as a sequence of such policy sketches.

painting VAE for Painting Timeline
CS 771A: Introduction to Machine Learning
Instructor: Purushottam Kar
report

The current architecture of VAE suffers from latent space saturation (with inefficient packing) and mode collapse. We aimed to deal with mode collapse by introducing more than one encoder in the architecture, with the latent spaces mapped to a single decoder. We attempted to incorporate different techniques to get good reconstructions.

Teaching

SI 301: Models of Social Information Processing - Instructional Aide


Thanks Jon for this amazing template.