The "Black" Art & "Elusive" Science of Training Deep Neural Networks
School of Computing, University of Nebraska-Lincoln Spring 2021 (3-week session): CSCE 496/896Synopsis: Deep Learning, a cornerstone of modern Artificial Intelligence, tackles complex challenges in computer vision, natural language processing, and speech recognition. Deep Neural Networks (DNNs)—vast networks of artificial neurons arranged in hierarchical layers—uncover intricate patterns by creating increasingly meaningful data representations. Unlocking their potential requires a deep understanding of effective architectures, learning algorithms, and optimization strategies. Often dubbed a "black art" due to its complexity, training DNNs can feel like summoning a genie. This course aims to demystify this process, exploring state-of-the-art DNN architectures and delving into the "black" art and "elusive" science of training them.
Philosophy: Inspired by my mentor Richard Feynman's philosophy on learning—“What I cannot create, I do not understand”—this course emphasizes hands-on implementation. True mastery of DNN training comes from building models from scratch, without high-level libraries like Scikit-Learn, Keras, or PyTorch. Begin with a Multi-Layer Perceptron (MLP) using NumPy (pseudocode), then tackle a Convolutional Neural Network (CNN) using NumPy (pseudocode).
- Instructor
- Dr. M. R. Hasan
- Office Hours
- See the course Canvas page
- Lecture Time
- Monday through Friday: 10:00 AM - 11:30 AM via Zoom
Explore my GitHub tutorials on Machine Learning and Deep Learning for additional resources. Key repositories include:
- Convolutional Neural Networks
- Recurrent Neural Networks
- Using TensorFlow-Keras for Building Effective MLP Classifiers
- CNN: Image Classification - Tricks of the Trade
- Data Augmentation for Deep Learning
- Efficient Methodology for Deep Learning
Schedule
Text Resources
- Lecture slides and Jupyter notebooks provide a thorough and extensive account of the topics.
- Core Deep Learning Texts:
- Dive into Deep Learning by Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola
- Deep Learning by Ian Goodfellow, Yoshua Bengio, and Aaron Courville
- Deep Learning with Python by Francois Chollet
- Hands-On Machine Learning with Scikit-Learn, Keras and TensorFlow (2nd Edition, 2019) by Aurélien Géron (O'Reilly)
- Foundational Machine Learning Texts:
- Machine Learning: A Probabilistic Perspective by Kevin P. Murphy
- Pattern Recognition and Machine Learning by Christopher M. Bishop
- Introduction to Machine Learning (3rd ed.) by Ethem Alpaydin
- Python Refresher/Introduction:
- Data Science from Scratch by Joel Grus (O’Reilly)
- Python for Data Analysis (2nd Edition) by Wes McKinney (O'Reilly)
- Python Machine Learning by Sebastian Raschka (Packt Publishing)
- Statistics, Linear Algebra & Calculus:
- Advanced Engineering Mathematics (10th Ed.) by Erwin Kreyszig
- All of Statistics: A Concise Course in Statistical Inference by Larry Wasserman
- Interesting & Enlightening Texts:
- The Master Algorithm by Pedro Domingos
- Artificial Intelligence: A Guide for Thinking Humans by Melanie Mitchell
- The Deep Learning Revolution by Terrence J. Sejnowski
- Rebooting AI: Building Artificial Intelligence We Can Trust by Gary Marcus and Ernest Davis
- Thinking, Fast and Slow by Daniel Kahneman
- Interpretable Machine Learning by Christoph Molnar
Deep Learning Courses Elsewhere
- Introduction to Deep Learning - Carnegie Mellon University
- Convolutional Neural Networks for Visual Recognition - Stanford University
- Deep Learning - Stanford University
- Deep Learning Specialization - Andrew Ng (Coursera)
Collaboration Tool
Google Colab Tutorials
- Get Started with Google Colaboratory (Coding TensorFlow)
- Getting Started with TensorFlow in Google Colaboratory (Coding TensorFlow)
Python
Open Data Repositories
- UC Irvine ML Repository
- Kaggle Datasets
- Amazon’s AWS Datasets
- Statlib Datasets Archive
- The CIFAR-10 Dataset (Canadian Institute For Advanced Research) for Computer Vision Problems
- MILA Public Dataset
ML Podcasts
Journals
- Machine Learning
- Journal of Machine Learning Research
- IEEE Transactions on Neural Networks and Learning Systems
Conference Proceedings
- Conference and Workshop on Neural Information Processing Systems (NeurIPS)
- International Conference on Learning Representations (ICLR)
- International Conference on Machine Learning (ICML)
- Conference on Computer Vision and Pattern Recognition (CVPR)
- International Conference on Computer Vision (ICCV)
- European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD)
- The Advancement of Artificial Intelligence (AAAI)
- International Joint Conference on Artificial Intelligence (IJCAI)
- Annual Meeting of the Association for Computational Linguistics (ACL)
- Conference on Empirical Methods in Natural Language Processing (EMNLP)
- DeepMind Research
- Google Research
- Facebook Research
- arXiv Machine Learning Publications