User profiles for Jason Fries
Jason Alan FriesStanford University Verified email at stanford.edu Cited by 5177 |
Multitask prompted training enables zero-shot task generalization
Large language models have recently been shown to attain reasonable zero-shot generalization
on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a …
on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a …
[HTML][HTML] Snorkel: Rapid training data creation with weak supervision
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-…
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-…
[HTML][HTML] Snorkel: rapid training data creation with weak supervision
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-…
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-…
[HTML][HTML] Weakly supervised classification of aortic valve malformations using unlabeled cardiac MRI sequences
Biomedical repositories such as the UK Biobank provide increasing access to prospectively
collected cardiac imaging, however these data are unlabeled, which creates barriers to their …
collected cardiac imaging, however these data are unlabeled, which creates barriers to their …
[HTML][HTML] Ontology-driven weak supervision for clinical entity classification in electronic health records
JA Fries, E Steinberg, S Khattar, SL Fleming… - Nature …, 2021 - nature.com
In the electronic health record, using clinical notes to identify entities such as disorders and
their temporality (eg the order of an event relative to a time index) can inform many important …
their temporality (eg the order of an event relative to a time index) can inform many important …
Bloom: A 176b-parameter open-access multilingual language model
Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …
a few demonstrations or natural language instructions. While these capabilities have led to …
Promptsource: An integrated development environment and repository for natural language prompts
PromptSource is a system for creating, sharing, and using natural language prompts. Prompts
are functions that map an example from a dataset to a natural language input and target …
are functions that map an example from a dataset to a natural language input and target …
[HTML][HTML] The shaky foundations of large language models and foundation models for electronic health records
The success of foundation models such as ChatGPT and AlphaFold has spurred significant
interest in building similar models for electronic medical records (EMRs) to improve patient …
interest in building similar models for electronic medical records (EMRs) to improve patient …
Bigbio: A framework for data-centric biomedical natural language processing
Training and evaluating language models increasingly requires the construction of meta-datasets--diverse
collections of curated data with clear provenance. Natural language …
collections of curated data with clear provenance. Natural language …
Language models in the loop: Incorporating prompting into weak supervision
We propose a new strategy for applying large pre-trained language models to novel tasks
when labeled training data is limited. Rather than apply the model in a typical zero-shot or few-…
when labeled training data is limited. Rather than apply the model in a typical zero-shot or few-…