User profiles for Jason Fries

Jason Alan Fries

Stanford University
Verified email at stanford.edu
Cited by 5177

Multitask prompted training enables zero-shot task generalization

…, J Rozen, A Sharma, A Santilli, T Fevry, JA Fries… - arXiv preprint arXiv …, 2021 - arxiv.org
Large language models have recently been shown to attain reasonable zero-shot generalization
on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a …

[HTML][HTML] Snorkel: Rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries… - Proceedings of the …, 2017 - ncbi.nlm.nih.gov
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-…

[HTML][HTML] Snorkel: rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries, S Wu, C Ré - The VLDB Journal, 2020 - Springer
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-…

[HTML][HTML] Weakly supervised classification of aortic valve malformations using unlabeled cardiac MRI sequences

JA Fries, P Varma, VS Chen, K Xiao, H Tejeda… - Nature …, 2019 - nature.com
Biomedical repositories such as the UK Biobank provide increasing access to prospectively
collected cardiac imaging, however these data are unlabeled, which creates barriers to their …

[HTML][HTML] Ontology-driven weak supervision for clinical entity classification in electronic health records

JA Fries, E Steinberg, S Khattar, SL Fleming… - Nature …, 2021 - nature.com
In the electronic health record, using clinical notes to identify entities such as disorders and
their temporality (eg the order of an event relative to a time index) can inform many important …

Bloom: A 176b-parameter open-access multilingual language model

…, G Chhablani, H Wang, H Pandey, H Strobelt, JA Fries… - 2023 - inria.hal.science
Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

Promptsource: An integrated development environment and repository for natural language prompts

…, C Xu, G Chhablani, H Wang, JA Fries… - arXiv preprint arXiv …, 2022 - arxiv.org
PromptSource is a system for creating, sharing, and using natural language prompts. Prompts
are functions that map an example from a dataset to a natural language input and target …

[HTML][HTML] The shaky foundations of large language models and foundation models for electronic health records

…, E Steinberg, S Fleming, MA Pfeffer, J Fries… - npj Digital …, 2023 - nature.com
The success of foundation models such as ChatGPT and AlphaFold has spurred significant
interest in building similar models for electronic medical records (EMRs) to improve patient …

Bigbio: A framework for data-centric biomedical natural language processing

J Fries, L Weber, N Seelam, G Altay… - Advances in …, 2022 - proceedings.neurips.cc
Training and evaluating language models increasingly requires the construction of meta-datasets--diverse
collections of curated data with clear provenance. Natural language …

Language models in the loop: Incorporating prompting into weak supervision

R Smith, JA Fries, B Hancock, SH Bach - ACM/JMS Journal of Data …, 2024 - dl.acm.org
We propose a new strategy for applying large pre-trained language models to novel tasks
when labeled training data is limited. Rather than apply the model in a typical zero-shot or few-…