User profiles for David Silver

David Silver

DeepMind, UCL
Verified email at google.com
Cited by 194200

Human-level control through deep reinforcement learning

V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness… - nature, 2015 - nature.com
The theory of reinforcement learning provides a normative account 1 , deeply rooted in
psychological 2 and neuroscientific 3 perspectives on animal behaviour, of how agents may …

Mastering the game of Go with deep neural networks and tree search

D Silver, A Huang, CJ Maddison, A Guez, L Sifre… - nature, 2016 - nature.com
The game of Go has long been viewed as the most challenging of classic games for artificial
intelligence owing to its enormous search space and the difficulty of evaluating board …

Mastering the game of go without human knowledge

D Silver, J Schrittwieser, K Simonyan, I Antonoglou… - nature, 2017 - nature.com
A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa,
superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to …

[HTML][HTML] Reward is enough

D Silver, S Singh, D Precup, RS Sutton - Artificial Intelligence, 2021 - Elsevier
In this article we hypothesise that intelligence, and its associated abilities, can be understood
as subserving the maximisation of reward. Accordingly, reward is enough to drive …

Continuous control with deep reinforcement learning

…, A Pritzel, N Heess, T Erez, Y Tassa, D Silver… - arXiv preprint arXiv …, 2015 - arxiv.org
We adapt the ideas underlying the success of Deep Q-Learning to the continuous action
domain. We present an actor-critic, model-free algorithm based on the deterministic policy …

Grandmaster level in StarCraft II using multi-agent reinforcement learning

…, K Kavukcuoglu, D Hassabis, C Apps, D Silver - Nature, 2019 - nature.com
Many real-world applications require artificial agents to compete and coordinate with other
agents in complex environments. As a stepping stone to this goal, the domain of StarCraft has …

Playing atari with deep reinforcement learning

V Mnih, K Kavukcuoglu, D Silver, A Graves… - arXiv preprint arXiv …, 2013 - arxiv.org
We present the first deep learning model to successfully learn control policies directly from
high-dimensional sensory input using reinforcement learning. The model is a convolutional …

Improved protein structure prediction using potentials from deep learning

…, K Simonyan, S Crossan, P Kohli, DT Jones, D Silver… - Nature, 2020 - nature.com
Protein structure prediction can be used to determine the three-dimensional shape of a
protein from its amino acid sequence 1 . This problem is of fundamental importance as the …

[HTML][HTML] Highly accurate protein structure prediction with AlphaFold

…, M Pacholska, T Berghammer, S Bodenstein, D Silver… - Nature, 2021 - nature.com
Proteins are essential to life, and understanding their structure can facilitate a mechanistic
understanding of their function. Through an enormous experimental effort 1, 2, 3, 4, the …

Deterministic policy gradient algorithms

D Silver, G Lever, N Heess, T Degris… - International …, 2014 - proceedings.mlr.press
In this paper we consider deterministic policy gradient algorithms for reinforcement learning
with continuous actions. The deterministic policy gradient has a particularly appealing form: …