Beautiful Beach Photograph by Taiwan Nans0410

Academic Works

Don't be a Hypocrite: Hierarchical Reinforcement Learning via Composition

Code Repository
A flexible deep hierarchical reinforcement learning algorithm that is able to support an arbitrary number of layers of subpolicies, providing a unifying framework for many common algorithms. The resulting policy is both recursively optimal and inherently explainable. Work in Progress

CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation

Code Repository
Reinforcement Learning framework applied to real-world charging station data for recommendations to avoid long wait times. We exploit a common substructure among various possible actions (recommendations) to make reinforcement learning algorithms tractable in this space, resulting in more than a 30% improvement over baselines. Work In Progress.

Like a Light Switch: Conditional Weights for Deep Meta-Learning

Code Repository
Many tasks in the same domain make use of some of, but not all of, the same features. We examine a differentiable method for sharing weights between related tasks by multiplying the shared weights with the output of a sigmoid function, which biases to values near 0 or 1, enabling computationally tractable meta-learning. Current Course Project.

Time Series Shapelet Classification through Learned Distances

Code Repository
Traditional shapelet methods attempt to classify a time-series by measuring distance from a distinctive subsequence. However, these methods are limited in expressiveness by their use of Euclidean distance, which fails to account for different scaling of variables. In this work, we extend shapelet methods to a general class of Mahalanobis distances.

Scene Prediction in Reinforcement Learning using Generative Adversarial Networks

PDF
Most model-based reinforcement learning methods attempt to approximate the dynamics of their respective environments by stepping forward in time frame by frame. For temporally extended sequences, this can be computationally expensive and redundant. This class project tested using a GAN to generate plausible states from variable time ranges in the future.

Foodsby : Mobile App

Google Play Store
Worked with operations and marketing to develop a sleek app for the startup Foodsby. The app is multi-platform, and cleanly handles thousands of orders per-day, dynamically showing customers the best restaurants in their area and adapting to their preferences. The app remains mostly unchanged today, with over 200,000 downloads on the Google & Apple app stores!