CS426/CS525 Parallel Computing
Parallel programming platforms: distributed memory, shared address space, accelerators. Principles of parallel algorithm design: decomposition techniques, tasks and interactions, mapping for load balancing, interaction overheads, parallel algorithm models (data-parallel, task-graph, work-pool, master-slave, pipeline). Basic communication operations. Analytical modeling of parallel programs: sources of parallel programming overhead, performance metrics for parallel systems, scalability of parallel systems (speedup, efficiency, cost, overhead function, isoefficiency, cost optimality, degree of concurrency, granularity), parallel programming paradigms: programming using MPI, programming shared address space platforms (threads, OpenMP, Intel Thread Building Blocks), programming GPUs (CUDA, OpenCL). Parallel computing kernels: matrix transposition, matrix-vector multiplication, matrix-matrix multiplication, matrix partitioning schemes for load-balancing and communication minimization.
Prerequisite: CS 342
Introduction to Parallel
Author: Ananth Grama, Anshul Gupta, George Karypis, Vipin Kumar
Dr. Özcan Öztürk Office Hours: 10:00 - 12:00, Thursday or by appointment.
Credit Hours: 3
Class Schedule: Check STARS for detailed schedule
Office Hours: TBD.
Midterm Exam 25%, in class
Final Exam 25%, in class
Projects (3-5) 35% No late assignments will be accepted.
Class participation & pop quizzes 15%
Minimum Requirements to Qualify for the Final Exam:
%50 minimum weighted grade average from midterm 1, project, quiz
Lecture Contents (Tentative!)