Programming Parallel Computers

Lecture 4

The lecture videos are available both on Panopto and on YouTube, in up to 4K resolution, with English and Finnish subtitles. The slides are also available in the PDF format.

Lectures

YouTube playlist with all parts of the lecture.

Part 4A: GPU programming (13 min)

Part 4B: GPU programming with CUDA (18 min)

Part 4C: Memory access patterns in CUDA programs (15 min)

Topics covered

Terminology

EnglishFinnish
blocklohko
block indexlohkon indeksi
kernelydinfunktio
memory requestmuistihaku
out-of-order executionepäjärjestyksessä suorittaminen
sequential codeperättäiskoodi
streaming multiprocessor (SM)SM-suoritin
thread indexsäikeen indeksi
warpwarp

Additional material