User profiles for Kunal Menda
Kunal MendaBlackRock AI Labs Verified email at alumni.stanford.edu Cited by 314 |
Ensembledagger: A bayesian approach to safe imitation learning
K Menda, K Driggs-Campbell… - 2019 IEEE/RSJ …, 2019 - ieeexplore.ieee.org
Although imitation learning is often used in robotics, the approach frequently suffers from data
mismatch and compounding errors. DAgger is an iterative algorithm that addresses these …
mismatch and compounding errors. DAgger is an iterative algorithm that addresses these …
Deep reinforcement learning for event-driven multi-agent decision processes
The incorporation of macro-actions (temporally extended actions) into multi-agent decision
problems has the potential to address the curse of dimensionality associated with such …
problems has the potential to address the curse of dimensionality associated with such …
Structured mechanical models for robot learning and control
Abstract Model-based methods are the dominant paradigm for controlling robotic systems,
though their efficacy depends heavily on the accuracy of the model used. Deep neural …
though their efficacy depends heavily on the accuracy of the model used. Deep neural …
[HTML][HTML] Explaining COVID-19 outbreaks with reactive SEIRD models
K Menda, L Laird, MJ Kochenderfer, RS Caceres - Scientific Reports, 2021 - nature.com
COVID-19 epidemics have varied dramatically in nature across the United States, where some
counties have clear peaks in infections, and others have had a multitude of unpredictable …
counties have clear peaks in infections, and others have had a multitude of unpredictable …
A general framework for structured learning of mechanical systems
Learning accurate dynamics models is necessary for optimal, compliant control of robotic
systems. Current approaches to white-box modeling using analytic parameterizations, or black-…
systems. Current approaches to white-box modeling using analytic parameterizations, or black-…
Dropoutdagger: A bayesian approach to safe imitation learning
K Menda, K Driggs-Campbell… - arXiv preprint arXiv …, 2017 - arxiv.org
While imitation learning is becoming common practice in robotics, this approach often
suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that …
suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that …
Scalable identification of partially observed systems with certainty-equivalent EM
Abstract System identification is a key step for model-based control, estimator design, and
output prediction. This work considers the offline identification of partially observed nonlinear …
output prediction. This work considers the offline identification of partially observed nonlinear …
[PDF][PDF] Structured mechanical models for efficient reinforcement learning
Learning accurate dynamics models is necessary for optimal, compliant control of robotic
systems. Model-based reinforcement learning attempts to learn accurate models by interacting …
systems. Model-based reinforcement learning attempts to learn accurate models by interacting …
[PDF][PDF] Frame interpolation using generative adversarial networks
Video frame interpolation is an elusive but coveted technology with the potential to have a
far reaching impact in the video streaming service industry. In this paper, we present a novel …
far reaching impact in the video streaming service industry. In this paper, we present a novel …
Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Forecasting with Application to Electricity Demand
Some real-world decision-making problems require making probabilistic forecasts over
multiple steps at once. However, methods for probabilistic forecasting may fail to capture …
multiple steps at once. However, methods for probabilistic forecasting may fail to capture …