1 | Introduction & Matrix Multiplication (PDF - 6.9MB) |
2 | Bentley Rules for Optimizing Work (PDF - 3.5 MB) |
3 | Bit Hacks (PDF - 2.3MB) |
4 | Assembly Language and Computer Architecture (PDF - 6.4MB) |
5 | C to Assembly (PDF - 6.5MB) |
6 | Multicore Programming (PDF - 4.4MB) |
7 | Races and Parallelism (PDF - 4.4MB) |
8 | Analysis of Multithreaded Algorithms (PDF - 6.3MB) |
9 | What Compilers Can and Cannot Do (PDF - 7.4MB) |
10 | Measurement and Timing (PDF - 1.6MB) |
11 | Storage Allocation (PDF - 3.3MB) |
12 | Parallel Storage Allocation (PDF - 2.5MB) |
13 | The Cilk Runtime System (PDF - 4.7MB) |
14 | Caching and Cache-Efficient Algorithms (PDF - 2.8MB) |
15 | Cache-Oblivious Algorithms (PDF - 3.2MB) |
16 | Nondeterministic Parallel Programming (PDF - 3.3MB) |
17 | Synchronization Without Locks (PDF - 3.3MB) |
18 | Domain Specific Languages and Autotuning (PDF - 4.6MB) |
19 | Leiserchess Codewalk (PDF - 6.4MB) |
20 | Speculative Parallelism & Leiserchess (PDF - 6.2MB) |
21 | Tuning a TSP Algorithm (PDF - 1.6MB) |
22 | Graph Optimization (PDF - 2.9MB) |
23 | High Performance in Dynamic Languages (PDF) |