User profiles for Karan Desai
Karan DesaiUniversity of Michigan Verified email at umich.edu Cited by 1049 |
Virtex: Learning visual representations from textual annotations
The de-facto approach to many vision tasks is to start from pretrained visual representations,
typically learned via supervised training on ImageNet. Recent methods have explored …
typically learned via supervised training on ImageNet. Recent methods have explored …
Hyperbolic image-text representations
Visual and linguistic concepts naturally organize themselves in a hierarchy, where a textual
concept" dog" entails all images that contain dogs. Despite being intuitive, current large-…
concept" dog" entails all images that contain dogs. Despite being intuitive, current large-…
Nocaps: Novel object captioning at scale
Image captioning models have achieved impressive results on datasets containing limited
visual concepts and large amounts of paired image-caption training data. However, if these …
visual concepts and large amounts of paired image-caption training data. However, if these …
Redcaps: Web-curated image-text data created by the people, for the people
Large datasets of paired images and text have become increasingly popular for learning
generic representations for vision and vision-and-language tasks. Such datasets have been …
generic representations for vision and vision-and-language tasks. Such datasets have been …
Casting your model: Learning to localize improves self-supervised representations
Recent advances in self-supervised learning (SSL) have largely closed the gap with supervised
ImageNet pretraining. Despite their success these methods have been primarily applied …
ImageNet pretraining. Despite their success these methods have been primarily applied …
Probabilistic neural symbolic models for interpretable visual question answering
We propose a new class of probabilistic neural-symbolic models, that have symbolic functional
programs as a latent, stochastic variable. Instantiated in the context of visual question …
programs as a latent, stochastic variable. Instantiated in the context of visual question …
Learning visual representations via language-guided sampling
M El Banani, K Desai… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Although an object may appear in numerous contexts, we often describe it in a limited
number of ways. Language allows us to abstract away visual variation to represent and …
number of ways. Language allows us to abstract away visual variation to represent and …
[HTML][HTML] Baseline cardiometabolic profiles and SARS-CoV-2 infection in the UK Biobank
RJ Scalsky, YJ Chen, K Desai, JR O'Connell… - PLoS …, 2021 - journals.plos.org
Background SARS-CoV-2 is a rapidly spreading coronavirus responsible for the Covid-19
pandemic, which is characterized by severe respiratory infection. Many factors have been …
pandemic, which is characterized by severe respiratory infection. Many factors have been …
The effect of BMI on outcomes following complex abdominal wall reconstructions
KA Desai, SA Razavi, AM Hart… - Annals of plastic …, 2016 - journals.lww.com
Background The management of complex abdominal wall defects continues to be a challenging
process secondary to the high potential for wound healing issues and ventral hernia …
process secondary to the high potential for wound healing issues and ventral hernia …
Continual reinforcement learning in 3d non-stationary environments
High-dimensional always-changing environments constitute a hard challenge for current
reinforcement learning techniques. Artificial agents, nowadays, are often trained off-line in very …
reinforcement learning techniques. Artificial agents, nowadays, are often trained off-line in very …