Google Scholar

User profiles for Jacob Steinhardt

Jacob Steinhardt

Stanford University

Verified email at cs.stanford.edu

Cited by 16707

[PDF] thecvf.com

Natural adversarial examples

…, K Zhao, S Basart, J Steinhardt… - Proceedings of the …, 2021 - openaccess.thecvf.com

We introduce two challenging datasets that reliably cause machine learning model performance
to substantially degrade. The datasets are collected with a simple adversarial filtration …

Save Cite Cited by 1227 Related articles All 9 versions View as HTML

[PDF] arxiv.org

Certified defenses against adversarial examples

A Raghunathan, J Steinhardt, P Liang - arXiv preprint arXiv:1801.09344, 2018 - arxiv.org

While neural networks have achieved high accuracy on standard image classification
benchmarks, their accuracy drops to nearly zero in the presence of small adversarial perturbations …

Save Cite Cited by 1058 Related articles All 5 versions View as HTML

[PDF] arxiv.org

Unsolved problems in ml safety

…, N Carlini, J Schulman, J Steinhardt - arXiv preprint arXiv …, 2021 - arxiv.org

Machine learning (ML) systems are rapidly increasing in size, are acquiring new capabilities,
and are increasingly deployed in high-stakes settings. As with other powerful technologies, …

Save Cite Cited by 253 Related articles All 6 versions View as HTML

[PDF] arxiv.org

The malicious use of artificial intelligence: Forecasting, prevention, and mitigation

…, H Anderson, H Roff, GC Allen, J Steinhardt… - arXiv preprint arXiv …, 2018 - arxiv.org

This report surveys the landscape of potential security threats from malicious uses of AI, and
proposes ways to better forecast, prevent, and mitigate these threats. After analyzing the …

Save Cite Cited by 976 Related articles All 4 versions Library Search View as HTML

[PDF] arxiv.org

Concrete problems in AI safety

D Amodei, C Olah, J Steinhardt, P Christiano… - arXiv preprint arXiv …, 2016 - arxiv.org

Rapid progress in machine learning and artificial intelligence (AI) has brought increasing
attention to the potential impacts of AI technologies on society. In this paper we discuss one …

Save Cite Cited by 2610 Related articles All 9 versions View as HTML

[PDF] neurips.cc

Grounding representation similarity through statistical testing

F Ding, JS Denain, J Steinhardt - Advances in Neural …, 2021 - proceedings.neurips.cc

To understand neural network behavior, recent works quantitatively compare different
networks' learned representations using canonical correlation analysis (CCA), centered kernel …

Save Cite Cited by 41 Related articles All 3 versions View as HTML

[PDF] wustl.edu

Robust moment estimation and improved clustering via sum of squares

PK Kothari, J Steinhardt, D Steurer - … of the 50th Annual ACM SIGACT …, 2018 - dl.acm.org

We develop efficient algorithms for estimating low-degree moments of unknown distributions
in the presence of adversarial outliers and design a new family of convex relaxations for k-…

Save Cite Cited by 151 Related articles All 5 versions

[PDF] arxiv.org

Measuring massive multitask language understanding

…, A Zou, M Mazeika, D Song, J Steinhardt - arXiv preprint arXiv …, 2020 - arxiv.org

We propose a new test to measure a text model's multitask accuracy. The test covers 57 tasks
including elementary mathematics, US history, computer science, law, and more. To attain …

Save Cite Cited by 1294 Related articles All 4 versions View as HTML

[PDF] neurips.cc

Certified defenses for data poisoning attacks

J Steinhardt, PWW Koh… - Advances in neural …, 2017 - proceedings.neurips.cc

Abstract Machine learning systems trained on user-provided data are susceptible to data
poisoning attacks, whereby malicious users inject false training data with the aim of corrupting …

Save Cite Cited by 824 Related articles All 10 versions View as HTML

[PDF] thecvf.com

The many faces of robustness: A critical analysis of out-of-distribution generalization

…, M Guo, D Song, J Steinhardt… - Proceedings of the …, 2021 - openaccess.thecvf.com

We introduce four new real-world distribution shift datasets consisting of changes in image
style, image blurriness, geographic location, camera operation, and more. With our new …

Save Cite Cited by 1308 Related articles All 8 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Jacob Steinhardt

Jacob Steinhardt

Natural adversarial examples

Certified defenses against adversarial examples

Unsolved problems in ml safety

The malicious use of artificial intelligence: Forecasting, prevention, and mitigation

Concrete problems in AI safety

Grounding representation similarity through statistical testing

Robust moment estimation and improved clustering via sum of squares

Measuring massive multitask language understanding

Certified defenses for data poisoning attacks

The many faces of robustness: A critical analysis of out-of-distribution generalization

Related searches