Google Scholar

User profiles for Sepp Hochreiter

Sepp Hochreiter

Institute for Machine Learning, Johannes Kepler University Linz

Verified email at ml.jku.at

Cited by 149688

[PDF] xpgreat.com

Long short-term memory

S Hochreiter, J Schmidhuber - Neural computation, 1997 - ieeexplore.ieee.org

Learning to store information over extended time intervals by recurrent backpropagation takes
a very long time, mostly because of insufficient, decaying error backflow. We briefly review …

Save Cite Cited by 103038 Related articles All 46 versions Library Search

[PDF] jku.at

The vanishing gradient problem during learning recurrent neural nets and problem solutions

S Hochreiter - International Journal of Uncertainty, Fuzziness and …, 1998 - World Scientific

Recurrent nets are in principle capable to store past inputs to produce the currently desired
output. Because of this property recurrent nets are used in time series prediction and process …

Save Cite Cited by 3513 Related articles All 15 versions

[PDF] neurips.cc

Gans trained by a two time-scale update rule converge to a local nash equilibrium

…, B Nessler, S Hochreiter - Advances in neural …, 2017 - proceedings.neurips.cc

Generative Adversarial Networks (GANs) excel at creating realistic images with complex
models for which maximum likelihood is infeasible. However, the convergence of GAN training …

Save Cite Cited by 11488 Related articles All 12 versions View as HTML

[PDF] arxiv.org

Fast and accurate deep network learning by exponential linear units (elus)

DA Clevert, T Unterthiner, S Hochreiter - arXiv preprint arXiv:1511.07289, 2015 - arxiv.org

We introduce the "exponential linear unit" (ELU) which speeds up learning in deep neural
networks and leads to higher classification accuracies. Like rectified linear units (ReLUs), …

Save Cite Cited by 6730 Related articles All 13 versions View as HTML

Related searches

[PDF] neurips.cc

Self-normalizing neural networks

…, T Unterthiner, A Mayr, S Hochreiter - Advances in neural …, 2017 - proceedings.neurips.cc

Deep Learning has revolutionized vision via convolutional neural networks (CNNs) and natural
language processing via recurrent neural networks (RNNs). However, success stories of …

Save Cite Cited by 3063 Related articles All 14 versions View as HTML

[PDF] researchgate.net

[PDF][PDF] Gradient flow in recurrent nets: the difficulty of learning long-term dependencies

S Hochreiter, Y Bengio, P Frasconi, J Schmidhuber - 2001 - researchgate.net

Recurrent networks (crossreference Chapter 12) can, in principle, use their feedback
connections to store representations of recent input events in the form of activations. The most …

Save Cite Cited by 2615 Related articles All 12 versions View as HTML

[HTML] frontiersin.org

[HTML][HTML] DeepTox: toxicity prediction using deep learning

…, G Klambauer, T Unterthiner, S Hochreiter - Frontiers in …, 2016 - frontiersin.org

The Tox21 Data Challenge has been the largest effort of the scientific community to
compare computational methods for toxicity prediction. This challenge comprised 12,000 …

Save Cite Cited by 895 Related articles All 17 versions Cached

[PDF] neurips.cc

LSTM can solve hard long time lag problems

S Hochreiter, J Schmidhuber - Advances in neural …, 1996 - proceedings.neurips.cc

Standard recurrent nets cannot deal with long minimal time lags between relevant signals.
Several recent NIPS papers propose alter (cid: 173) native methods. We first show: problems …

Save Cite Cited by 1303 Related articles All 15 versions View as HTML

[PDF] jku.at

Flat minima

S Hochreiter, J Schmidhuber - Neural computation, 1997 - direct.mit.edu

We present a new algorithm for finding low-complexity neural networks with high
generalization capability. The algorithm searches for a “flat” minimum of the error function. A flat …

Save Cite Cited by 895 Related articles All 21 versions

Learning to learn using gradient descent

S Hochreiter, AS Younger, PR Conwell - Artificial Neural Networks …, 2001 - Springer

This paper introduces the application of gradient descent methods to meta-learning. The
concept of “meta-learning”, ie of a system that improves or discovers a learning algorithm, has …

Save Cite Cited by 781 Related articles All 11 versions

Create alert

Cite

Advanced search

Saved to My library

User profiles for Sepp Hochreiter

Sepp Hochreiter

Long short-term memory

The vanishing gradient problem during learning recurrent neural nets and problem solutions

Gans trained by a two time-scale update rule converge to a local nash equilibrium

Fast and accurate deep network learning by exponential linear units (elus)

Related searches

Self-normalizing neural networks

[PDF][PDF] Gradient flow in recurrent nets: the difficulty of learning long-term dependencies

[HTML][HTML] DeepTox: toxicity prediction using deep learning

LSTM can solve hard long time lag problems

Flat minima

Learning to learn using gradient descent