User profiles for Yujia Bao

Yujia Bao

Massachusetts Institute of Technology
Verified email at csail.mit.edu
Cited by 521

Deriving machine attention from human rationales

Y Bao, S Chang, M Yu, R Barzilay - arXiv preprint arXiv:1808.09367, 2018 - arxiv.org
Attention-based models are successful when trained on large amounts of data. In this paper,
we demonstrate that even in the low-resource scenario, attention can be learned effectively. …

Few-shot text classification with distributional signatures

Y Bao, M Wu, S Chang, R Barzilay - arXiv preprint arXiv:1908.06039, 2019 - arxiv.org
In this paper, we explore meta-learning for few-shot text classification. Meta-learning has
shown strong performance in computer vision, where low-level patterns are transferable across …

Predict then interpolate: A simple algorithm to learn stable classifiers

Y Bao, S Chang, R Barzilay - International Conference on …, 2021 - proceedings.mlr.press
We propose Predict then Interpolate (PI), a simple algorithm for learning correlations that
are stable across environments. The algorithm follows from the intuition that when using a …

Using machine learning and natural language processing to review and classify the medical literature on cancer susceptibility genes

Y Bao, Z Deng, Y Wang, H Kim… - JCO Clinical Cancer …, 2019 - ascopubs.org
PURPOSE The medical literature relevant to germline genetics is growing exponentially.
Clinicians need tools that help to monitor and prioritize the literature to understand the clinical …

Natural language processing to facilitate breast cancer research and management

KS Hughes, J Zhou, Y Bao, P Singh, J Wang… - The Breast …, 2020 - Wiley Online Library
The medical literature has been growing exponentially, and its size has become a barrier
for physicians to locate and extract clinically useful information. As a promising solution, …

Learning to split for automatic bias detection

Y Bao, R Barzilay - arXiv preprint arXiv:2204.13749, 2022 - arxiv.org
Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning
to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, …

Validation of a semiautomated natural language processing–based procedure for meta-analysis of cancer susceptibility gene penetrance

Z Deng, K Yin, Y Bao, VD Armengol, C Wang… - JCO clinical cancer …, 2019 - ascopubs.org
PURPOSE Quantifying the risk of cancer associated with pathogenic mutations in germline
cancer susceptibility genes—that is, penetrance—enables the personalization of preventive …

Learning stable classifiers by transferring unstable features

Y Bao, S Chang, R Barzilay - International Conference on …, 2022 - proceedings.mlr.press
While unbiased machine learning models are essential for many applications, bias is a
human-defined concept that can vary across tasks. Given only input-label pairs, algorithms may …

Contextual vision transformers for robust representation learning

Y Bao, T Karaletsos - arXiv preprint arXiv:2305.19402, 2023 - arxiv.org
We present Contextual Vision Transformers (ContextViT), a method for producing robust
feature representations for images exhibiting grouped structure such as covariates. ContextViT …

Non-medullary thyroid cancer susceptibility genes: evidence and disease spectrum

J Zhou, P Singh, K Yin, J Wang, Y Bao, M Wu… - Annals of surgical …, 2021 - Springer
Background The prevalence of non-medullary thyroid cancer (NMTC) is increasing worldwide.
Although most NMTCs grow slowly, conventional therapies are less effective in advanced …