| 1 | 
            Jan 28 | 
            Course Introduction and Overview | 
             | 
             | 
             | 
             | 
          
          
            | 2 | 
            Jan 30 | 
            Introduction to HPC / Systems | 
             | 
             | 
             | 
             | 
          
          
             | 
            Feb 4 | 
            (contd.) | 
             | 
             | 
             | 
             | 
          
          
            | 3 | 
            Feb 6 | 
            Introduction to GPU Programming | 
             | 
             | 
             | 
             | 
          
          
            | 4 | 
            Feb 11 | 
	    Introduction to Triton Programming | 
             | 
             | 
             | 
             | 
          
          
            | 5 | 
            Feb 13 | 
            Introduction to Deep Learning | 
             | 
             | 
             | 
             | 
          
          
            | 6 | 
            Feb 18 | 
	    Transformers and Performance Modeling | 
            Attention 2017 | 
             | 
             | 
             | 
          
          
             | 
            Feb 20 | 
            No class | 
          
          
            | 7 | 
            Feb 25 | 
            Challenges in High Performance DL | 
            COTS HPC 2013 | 
             | 
            Extra-Deep 2023 | 
             | 
          
          
            | 8 | 
            Feb 27 | 
            Parallel Training | 
            PyTorch DDP 2020 | 
            UB, KB | 
            PyTorch FSDP 2023 | 
            AB, ZCh | 
          
          
             | 
            Mar 4 | 
            (contd.) | 
            Megatron-LM 2019 | 
            PD, CDz | 
            AxoNN 2024 | 
            Guest - Siddharth Singh | 
          
          
             | 
            Mar 6 | 
	    Pipeline and Hybrid Parallel Training | 
            GPipe 2018 | 
            DE, AH | 
            Hybrid Parallelism 2021 | 
            Guest - Deepak Narayanan [video] | 
          
          
            | 10 | 
            Mar 11 | 
            Optimizing GPU Kernels | 
            Sputnik 2020 | 
            HH, LH | 
            Flash Attention 2022 | 
            Guest - Tri Dao [video] | 
          
          
             | 
            Mar 13 | 
	    Deep Learning Compilers | 
            TVM 2018 | 
            DJ, DK | 
            TorchDynamo 2024 | 
            Guest - Jason Ansel [video] | 
          
          
             | 
            Mar 18 | 
            Spring Break | 
          
          
             | 
            Mar 20 | 
            Spring Break | 
          
          
             | 
            Mar 25 | 
            Optimizers | 
            HyLo 2022 | 
            BM, AN | 
            Distributed Shampoo 2023 | 
            Guest - Shi & Iwasaki [video] | 
          
          
             | 
            Mar 27 | 
	    Sparsity in Training | 
            MoE 2017 | 
            SP, SS | 
            MegaBlocks 2022 | 
            Guest - Trevor Gale [video] | 
          
          
             | 
            Apr 1 | 
	    Memory offload | 
            vDNN 2016 | 
            IR, MS | 
            ZeRO-Infinity 2021 | 
             | 
          
          
             | 
            Apr 3 | 
            Introduction to Inference | 
            Transformers 2022 | 
            XT, PU | 
            vLLM 2023 | 
             | 
          
          
             | 
            Apr 8 | 
	    Approximating Attention | 
            Top-k 2021 | 
            CU, WW | 
            H2O 2023 | 
             | 
          
          
             | 
            April 10 | 
            Midterm Exam (during class) | 
          
          
             | 
            Apr 15 | 
            Long context optimizations | 
            LongBench 2023 | 
            CW, YW | 
            RingAttention 2023 | 
             | 
             | 
          
          
             | 
            Apr 17 | 
	    Quantization | 
            LLM.int8() 2022 | 
            SY, NY | 
            AWQ 2024 | 
             | 
            GPTQ 2022 | 
          
          
             | 
            Apr 22 | 
            Optimizing Data Movement | 
            PCCL 2025 | 
            YZ, ZZ | 
            DataStates-LLM 2024 | 
            Guest - Avinash Maurya [video] | 
          
          
             | 
            Apr 24 | 
	    Hardware-software Co-design | 
            Cerebras 2023 | 
            RQ, YB | 
            DLRM 2021 | 
             | 
          
          
             | 
            Apr 29 | 
            Specific DL Models | 
            CAGNET 2020 | 
            ZCa, PC | 
            DLRM 2020 | 
            Guest - Bilge Acun [video] | 
          
          
             | 
            May 1 | 
            No class | 
          
          
             | 
            May 6 | 
            Project Presentations | 
             | 
             | 
             | 
             | 
          
          
             | 
            May 8 | 
            Project Presentations | 
             | 
             | 
             | 
             | 
          
          
             | 
            May 13 | 
            Project Presentations | 
             | 
             | 
             | 
             | 
          
          
             | 
            May 15 | 
            Final Project Due |