User profiles for Kunal Menda

Kunal Menda

BlackRock AI Labs
Verified email at alumni.stanford.edu
Cited by 314

Ensembledagger: A bayesian approach to safe imitation learning

K Menda, K Driggs-Campbell… - 2019 IEEE/RSJ …, 2019 - ieeexplore.ieee.org
Although imitation learning is often used in robotics, the approach frequently suffers from data
mismatch and compounding errors. DAgger is an iterative algorithm that addresses these …

Deep reinforcement learning for event-driven multi-agent decision processes

K Menda, YC Chen, J Grana, JW Bono… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
The incorporation of macro-actions (temporally extended actions) into multi-agent decision
problems has the potential to address the curse of dimensionality associated with such …

Structured mechanical models for robot learning and control

JK Gupta, K Menda, Z Manchester… - … for Dynamics and …, 2020 - proceedings.mlr.press
Abstract Model-based methods are the dominant paradigm for controlling robotic systems,
though their efficacy depends heavily on the accuracy of the model used. Deep neural …

[HTML][HTML] Explaining COVID-19 outbreaks with reactive SEIRD models

K Menda, L Laird, MJ Kochenderfer, RS Caceres - Scientific Reports, 2021 - nature.com
COVID-19 epidemics have varied dramatically in nature across the United States, where some
counties have clear peaks in infections, and others have had a multitude of unpredictable …

A general framework for structured learning of mechanical systems

JK Gupta, K Menda, Z Manchester… - arXiv preprint arXiv …, 2019 - arxiv.org
Learning accurate dynamics models is necessary for optimal, compliant control of robotic
systems. Current approaches to white-box modeling using analytic parameterizations, or black-…

Dropoutdagger: A bayesian approach to safe imitation learning

K Menda, K Driggs-Campbell… - arXiv preprint arXiv …, 2017 - arxiv.org
While imitation learning is becoming common practice in robotics, this approach often
suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that …

Scalable identification of partially observed systems with certainty-equivalent EM

K Menda, J De Becdelievre, J Gupta… - International …, 2020 - proceedings.mlr.press
Abstract System identification is a key step for model-based control, estimator design, and
output prediction. This work considers the offline identification of partially observed nonlinear …

[PDF][PDF] Structured mechanical models for efficient reinforcement learning

K Menda, JK Gupta, Z Manchester… - … on Structure and …, 2019 - eringrant.github.io
Learning accurate dynamics models is necessary for optimal, compliant control of robotic
systems. Model-based reinforcement learning attempts to learn accurate models by interacting …

[PDF][PDF] Frame interpolation using generative adversarial networks

M Koren, K Menda, A Sharma - 2017 - cs231n.stanford.edu
Video frame interpolation is an elusive but coveted technology with the potential to have a
far reaching impact in the video streaming service industry. In this paper, we present a novel …

Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Forecasting with Application to Electricity Demand

A Jamgochian, D Wu, K Menda, S Jung… - arXiv preprint arXiv …, 2022 - arxiv.org
Some real-world decision-making problems require making probabilistic forecasts over
multiple steps at once. However, methods for probabilistic forecasting may fail to capture …