Deep Learning

Ian Goodfellow, Yoshua Bengio, Aaron Courville

MIT Press, Nov 18, 2016 - Computers - 800 pages

An introduction to a broad range of topics in deep learning, covering mathematical and conceptual background, deep learning techniques used in industry, and research perspectives.

“Written by three experts in the field, Deep Learning is the only comprehensive book on the subject.”
—Elon Musk, cochair of OpenAI; cofounder and CEO of Tesla and SpaceX

Deep learning is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts. Because the computer gathers knowledge from experience, there is no need for a human computer operator to formally specify all the knowledge that the computer needs. The hierarchy of concepts allows the computer to learn complicated concepts by building them out of simpler ones; a graph of these hierarchies would be many layers deep. This book introduces a broad range of topics in deep learning.

The text offers mathematical and conceptual background, covering relevant concepts in linear algebra, probability theory and information theory, numerical computation, and machine learning. It describes deep learning techniques used by practitioners in industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling, and practical methodology; and it surveys such applications as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames. Finally, the book offers research perspectives, covering such theoretical topics as linear factor models, autoencoders, representation learning, structured probabilistic models, Monte Carlo methods, the partition function, approximate inference, and deep generative models.

Deep Learning can be used by undergraduate or graduate students planning careers in either industry or research, and by software engineers who want to begin using deep learning in their products or platforms. A website offers supplementary material for both readers and instructors.

Preview this book »

Introduction	1

Applied Math and Machine Learning Basics	27

Modern Practices	161

Deep Learning Research	475

Bibliography	711

Index	767

Copyright

Other editions - View all

Deep Learning
Ian Goodfellow,Yoshua Bengio,Aaron Courville
Limited preview - 2016

Deep Learning
Ian Goodfellow,Yoshua Bengio,Aaron Courville
No preview available - 2016

Deep Learning
Ian Goodfellow,Yoshua Bengio,Aaron Courville
No preview available - 2023

Common terms and phrases

able activation allows applied approach approximate autoencoder back-propagation Bengio Boltzmann machines called chapter component computational computational graph conditional connections containing convolutional corresponding dataset deep learning defined dependencies derivatives described direction effect encoder equation error estimate et al example expected factors figure function given gradient graph hidden units Hinton hyperparameters idea important improve increase independent inference initial input kind language layer learning algorithm linear machine learning manifold mapping matrix mean methods minimize multiple neural networks objective observed obtain operation optimization output parameters perform positive possible predict probability distribution problem random recurrent region regularization represent representation requires respect sampling sequence simple single solve space sparse specific statistical step structure task typically unsupervised update usually variables variance variational vector weights zero

About the author (2016)

Ian Goodfellow is a Research Scientist at Google.

Yoshua Bengio is Professor of Computer Science at the Université de Montréal.

Aaron Courville is Assistant Professor of Computer Science at the Université de Montréal.

Bibliographic information

Title	Deep Learning Adaptive Computation and Machine Learning series
Authors	Ian Goodfellow, Yoshua Bengio, Aaron Courville
Edition	illustrated
Publisher	MIT Press, 2016
ISBN	0262035618, 9780262035613
Length	800 pages
Subjects	Computers › Artificial Intelligence › General Computers / Artificial Intelligence / General Computers / Computer Science Computers / Data Science / Machine Learning

Export Citation	BiBTeX EndNote RefMan

About Google Books - Privacy Policy - Terms of Service - Information for Publishers - Report an issue - Help - Google Home

Books

Deep Learning

Contents

Other editions - View all

Common terms and phrases

About the author (2016)

Bibliographic information