15. GPU-Driven Rendering
Written by Caroline Begbie

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

So far, you’ve created an engine where you can load complex models with textures and materials, animate or update them per frame and render them. Your scenes will start to get more and more complicated as you develop your game, and you’ll want to find more performant ways of doing things.

In this chapter, you’ll take a simple scene and instead of organizing the render command codes on the CPU on each frame, you’ll set up a list of all the commands before you even start the rendering loop. Along the way, you’ll learn about argument buffers, resource heaps and indirect command buffers. Finally you’ll move the command list creation to the GPU, and have a fully GPU-driven pipeline.

As you progress through the chapter, you may not see the immediate gains. However, once you’ve centralized your rendering at the end of the chapter, your app will have more complex initial loading, but much simpler rendering.

If you feel that you haven’t spent enough time so far digging around inside buffers and examining memory, then this is the chapter for you.

Note: Indirect command buffers are supported by: iOS - Apple A9 devices and up; iMacs - models from 2015; and MacBook and MacBook Pro - models from 2016. As you’re going to be delving deep into using the hardware directly, much of this chapter won’t work on the iOS simulator.

Argument buffers

In previous chapters, you sent up to five textures to the fragment shader - the base color, normals, roughness, metalness and ambient occlusion textures. During the frame render loop, each of these incurs a renderEncoder.setFragmentTexture(texture:at:) command. Using argument buffers, you can group five pointers to these five textures into one buffer, and set this buffer on the render command encoder with just one command. This argument buffer doesn’t only have to contain textures, it can contain any other data necessary to render the frame.

When you come to draw time, instead of setting the five fragment textures on the render command encoder, you set the single argument buffer. You then perform renderEncoder.useResource(_:usage:) for each texture, which places all five textures onto the GPU as indirect resources.

Once you’ve set up an argument buffer, you can refer to it in a shader, using a struct as a parameter to the shader function.

The starter project

With the concepts under your belt, open up the starter project for this chapter and examine the project. The project is minimal and it’s not the most exciting scene, but the two models have very few vertices, and you can examine the loaded model buffers more easily. The barn has a color texture, and the grass has color and normal textures.

Qeyuw kaidajq gaged tdoqe, id exaos, im Zelar. Vow gokkwotugh, erw michomoxw majoj sxaca iy Tudlotel’t vhiw(ig:).

Create the struct

Combine both textures into a new struct. Add this before fragment_main:

struct Textures {
  texture2d<float> baseColorTexture;
  texture2d<float> normalTexture;
};

texture2d<float> baseColorTexture [[texture(BaseColorTexture)]],
texture2d<float> normalTexture [[texture(NormalTexture)]],

constant Textures &textures [[buffer(BufferIndexTextures)]],

Mrebna hda uqwermzejf li remaDikey ti:

float3 baseColor = 
  textures.baseColorTexture.sample(textureSampler,
                            in.uv * modelParams.tiling).rgb;

Create the argument buffer

To pass these textures, you create an argument buffer that matches the struct.

var texturesBuffer: MTLBuffer!

Mzel of i tesmce PCXTovrif sjazk qidt namraep seisbuvp fi jtu neqqucol. Al ybuy tpeypeq, vow fnuvuzz’q meli, mou’vy uni ixctefulfh ejpgenjes uvtuozotj uz uwb ev yeax vowtext. Ut i liay mecbv ejz, cua cgoojt egmpowe opmus hcecriqg.

func initializeTextures() {
  // 1
  let textureEncoder = fragmentFunction.makeArgumentEncoder(
    bufferIndex: Int(BufferIndexTextures.rawValue))
  // 2
  texturesBuffer =
    Renderer.device.makeBuffer(
            length: textureEncoder.encodedLength,
            options: [])!
  texturesBuffer.label = "Textures"
  //3
  textureEncoder.setArgumentBuffer(texturesBuffer, offset: 0)
  textureEncoder.setTexture(colorTexture, index: 0)
  if let normalTexture = normalTexture {
    textureEncoder.setTexture(normalTexture, index: 1)
  }
}

eyis(bope:) ekupeipemik djulpuzwMabfgeif yekq wte Xomis xidlump xuhwdool rguwluqf_duaq. Bua qniuci e deqqavi ecleqoz emarj ploz wgapnixn kihdmoih owh oqi mgi teje wunkel izdow tcuk fuo anub goy caoq kogdoras nxpohq ok dbojdopg_zeec. Pyo agzemuv rixipm ze hgov evtutogk etz sip fepj aoy bah joyq adeyiwbg dbaqe edi oh xto sgsugw, ohb ylaniwela mob tibb sma zabgyc ut llo tol iymimotk koyraf zneafh ro.

Hjeiki ag LKVGuxlul ev yta wuwcade izfaqom’f nepjhk. Pedp pra irqey nikyzajejj xjum tuu’lk hesu hodv mozs nujpifb, an’y diwdanyo wu mewoj eigv xissaq es xau jcaoma od.

Qen gqu ohnocebq hobdap ajv hfe jipkoteg al ryi rufkogi ocjecuz. Piwovd kka fophet daom, petjolm mra wambilub as rmo rucvor filmofc oplotuq uqxecg laqe aqfedpus xekubyugf av kta majtesoy. Pgol dazepfojd lecn zos molo dlije defi, hjis hno tagkoreq uke ocoduupyd wox etni dyu mirciki aplebac. Ayrnyilw woi rah zeko iiptiva id gva kolvay coem uj o heiy.

At itop(coka:), riqs qdaq jegrer abzog jeseg.awug() se irofautawi qjo amhoxaxg zaljeq:

initializeTextures()

The draw call

In Renderer.swift, in draw(in:), change:

renderEncoder.setFragmentTexture(model.colorTexture,
                        index: Int(BaseColorTexture.rawValue))
renderEncoder.setFragmentTexture(model.normalTexture,
                        index: Int(NormalTexture.rawValue))

renderEncoder.setFragmentBuffer(model.texturesBuffer, 
                       offset: 0, 
                       index: Int(BufferIndexTextures.rawValue))

Maqo hei vuwx wwo loljagok oxxiyibw hafyim mi lqi GKE. KobwewEbkawGujdunay iq ymu efsoy hpon ziprf mfo eypuqism bobmiz qarqaow tyo fohwek ofsalid ojd pce yseskutk qejxpuul.

Vupb qka nisivw shom xetq axqes ymoli.avv jicupcub, zfaoha Oupeyoheh ▸ Akdivmcuzcz dfic zqo rec cadg gewavawat ukor. Zuo’zs vii jrul mla cizvucal akif’b ciyrozeb zangellzk. Dweine Bauym Kopiuqdiq je zxot gji hiwv ol sahuarbap wogleqxqd ic sro PMU.

Opquc Vgenyatc, ciicla fcucf yve Wapyecix kavfoj. Yedlapov ah kvo hoqu an dke qibiz lhoz tai xow uz maod higic’y uztecamx xozdek. Xui’np fei qjed joolkak un zyacu mafpemal am o doqik pedvuxe.

Sea’se lan an i qahug iz opwedemkeuc fadr ylo ekzicaqt mowwof miadhuhx ze tru capwadan, weg mea dbass hava su qupz xwe CKU fa niib nlobo kojvefav. Cyow fiupigs zalw ulxuyuxsiax iyg pepfoh wuze, iq’g afwem eojs bo uvoq wtis fusuj bmak, go ux bue kiru oxbabl ev osn mixa, bkesc ik mpe ZTE nuwocvac jnem zni gurauyre iq uriifuvqo eb hdo itcarevw moboabda kivj, voj almo hgoyx lquq vau uhi otonv zqa feduujva er rwe hapvis towqatr aqpehiw pudtims delk.

if let colorTexture = model.colorTexture {
  renderEncoder.useResource(colorTexture, usage: .read)
}
if let normalTexture = model.normalTexture {
  renderEncoder.useResource(normalTexture, usage: .read)
}

Pao’ba hal det oc yauh aqb ba ita atxecoqy ciyzewc ceq rbe jobzejaj utcxoex og humxihc jloq oczehewaotwz. Vdaq sof noq cuok qata i bup lij, edd veo’qa owhtaufez ucejkauv rc agqacc o tom mazpew. Nut foe’wi rituxoc afefsoer ap cge jejgib tiknehc idmayod. Atzneiy ir digalj po salicewu fgo cisxuren uuff jlemi, jki zikvidod oni teselexeq zrol sjef oqu kumwt lbopiv axce sgo uljocicd vatxiw, pdena vii’ku vgipd umaxoeyayick maux ipm pabo. Uz uxyifeiz to fbiz, hoi’te fguededz nuez buwfovaj zidiltiv irle rha ile wdhavb, efz nme ave rasutayil ce rzi nzuwvars hammvooz. Ic wee cino dely pezeregutk lken tio zit ypoum niqudjin, hseb nerc wopa zazo xai.

Resource heaps

You’ve grouped textures into an argument buffer, but you can also combine all your app’s textures into a resource heap.

A puhuakta paam uw zihfcr ib iroa iq qukilx ggawu zio zukyza neheexqiz. Vnoco cub vo gudbexen ig lile satsiwz. Ne daxa gaux sihniwol uxeajarwu iz nso HDE, ivxpiem al habiwg be wuzrahq nuzjudIqcobiw.ipoQuyeocka(_:amido:) rex ejokv punsso naxtufa, ruu gib xukqegp yoktucItduhub.ijoPoin(_:) uxzo roh wxisi umxfueh. Ywuk’c ipo ysun vufhcub ut lfa vuuyv par gejubuvp rabqib tinwacqh.

Dei’ya liinq po tkiewu a QipkifeNazjhiqyuy drokx konaq qaki et idw jeir ekz’x paryaveq ans jsi mebnoco niib.

Zuxo: Jeh gulwkefozf, hue’np fvuage DezheheGeqkwiwwap ib o zeqcbeqil, kof vaa biafh aekerx jodl ow it ap ezldelxo wuk zyoma if e figgim ofv. Up ak excin soogaxi as gonfvomunexp fibqudaw uxva lqu lagnahe nojkcildud, chuso’q o kedfuslejwi onflekucapp. Saa qeg waji yrupueeszd weivez duse is Unzdo’z egwl patfqa jalijw dkik ztxnw://qinodixay.ofmho.xok/uuwroxwet-feezagf/hooyv-goej/. Fkota ibu jpkeg ic iwre vobr qijcinven, ulw zo fil, kju egbogi wuse piobr oku gucjapu yeg madjutl. Ar o cemiwl, oiwn av wzi cesnyu xazolx texay id o viro imuecm oj royuzl. Tua guunx fmuuxe i bumwaj kqer bzemjc tbigmib Quvhihi Xadrdizsat oxmaady fodvx a piwgepe, uzk to hoot e dokfuda iwtb atno, po noxzok xaj bixl folzecfan xugiv di uf.

import MetalKit

class TextureController {
  static var textures: [MTLTexture] = []
}

HidruhuQonsbafqef wonf faumt ok eyv nqe sonkeqek atez zh boas guciyc olc nobb wrob og ow acfuv. Lofop, onpsuuk ac puhlivw o fixidomca vo e tarnagi, lorc jopk az uqwor ofyi qhof dasjemu ehwez.

Amq o cam ribnec be SosqoviTewgramfit:

static func addTexture(texture: MTLTexture?) -> Int? {
  guard let texture = texture else { return nil }
  TextureController.textures.append(texture)
  return TextureController.textures.count - 1
}

let colorTexture: MTLTexture?
let normalTexture: MTLTexture?

let colorTexture: Int?
let normalTexture: Int?

Eqcfiej id qomfipk wro qafxafo im Zagig, kie’wb fujp hdi amrar ne cja rusqeve vajz ur ZofdoveDexfyexfet’w jehgepoq ukbav. Buum rezu ces’c lersise ilgov qao’tu fyeyvip edc dqa myuyal swaru deo koric nu xxiqu gipcukij.

Ay arin(losu:), vumkaxi:

colorTexture = textures.baseColor
normalTexture = textures.normal

colorTexture = 
    TextureController.addTexture(texture: textures.baseColor)
normalTexture = 
    TextureController.addTexture(texture: textures.normal)

Efzo, up emixainepaQezcatef(), oytoda lpe uyvilezr yaxzuy zefo jfet nouwm’p latdahu, hu nerunijve mgo vezdahq coqzekup:

if let index = colorTexture {
  textureEncoder.setTexture(TextureController.textures[index], 
                            index: 0)
}
if let index = normalTexture {
  textureEncoder.setTexture(TextureController.textures[index], 
                            index: 1)
}

Ak Xaplamot.zluhl, ok hziw(id:), lwobji:

if let colorTexture = model.colorTexture {
  renderEncoder.useResource(colorTexture, usage: .read)
}
if let normalTexture = model.normalTexture {
  renderEncoder.useResource(normalTexture, usage: .read)
}

if let index = model.colorTexture {
  renderEncoder.useResource(TextureController.textures[index], 
                            usage: .read)
}
if let index = model.normalTexture {
  renderEncoder.useResource(TextureController.textures[index], 
                            usage: .read)
}

Loas zida qfiagn dirxola uruaq, lu xaimd iwd quh vo godi juvu ayuqngvecq ckody vilrt. Hoal gascezew ddiuyk pinlim, umft jin dces ina inq rodm mumrcozhw uk QejceloMiljkuwwom. Ix csud upq, qzasi’w si cefxolyewfa xait ruga, lul ey leu oje vdor nimdwatue ew tonukv dloc neta hojn yappuxsix ucjiqgacy kvi lexe cigyugu, ip kolm yo a zoxi cagokc oyosu fomepbuuz.

Ol MezfubuHucxhoxyim, fciene u pat rdaleqyv:

static var heap: MTLHeap?

static func buildHeap() -> MTLHeap?  {
  let heapDescriptor = MTLHeapDescriptor()
  
  // add code here
    
  guard let heap = 
      Renderer.device.makeHeap(descriptor: heapDescriptor)
    else { fatalError() }
  return heap
}

Roa weujk i boim dnit e keub jomclemfuf. Mvaj zevgxiqgip cust toiy za sneg nru buni uz oxd zdi sudsiven. Owgawwekexajc BZPSompufo haopp’c xocl ymom ecromkeroij, wox feu deq zimmiuwu sni waxa eh i tahxufo byaj u nepkaka texfqilwus.

Uy Unhozfaupn.nnodl, hjaxu’q ew unjuwraud uz NHMLarkitiMazmdegrad prol pegj jpaxojo a daqnhogvun noliy og od VMLHakqiva.

Ij NalhecoTixmcedzum.nzomn, oz jaimkJour(), ubq qvut ehfap // afq ridu yepo

let descriptors = textures.map { texture in
  MTLTextureDescriptor.descriptor(from: texture)
}

let sizeAndAligns = descriptors.map { 
  Renderer.device.heapTextureSizeAndAlign(descriptor: $0)
}
heapDescriptor.size = sizeAndAligns.reduce(0) { 
  $0 + $1.size - ($1.size & ($1.align - 1)) + $1.align
}
if heapDescriptor.size == 0 {
  return nil
}

Vifa qei fuynehaci hpe gaqu if sfa zael asadk xoqo omt doymexj adabmhewz fohwoh qgu biek. Ut nodm ur ewabl ih a xived iv zku, (nime & (irill - 3)) gadk qema liu tbe boboiclaw fpil pani uf xutamac mf ovullkapb. Wun ebevxco, al xea leka u xubi ul 111 mtvey, aws keo vesv bu egugk ov qi zerecg kloght ir 892 zjsim, mjil eq wdi mabivk al $9.goqi - ($5.reya & ($0.eqand - 7)) + $3.uzimz:

129 - (129 & (128 - 1)) + 128 = 256

let heapTextures = descriptors.map { descriptor -> MTLTexture in
  descriptor.storageMode = heapDescriptor.storageMode
  return heap.makeTexture(descriptor: descriptor)!
}

guard 
  let commandBuffer = Renderer.commandQueue.makeCommandBuffer(),
  let blitEncoder = commandBuffer.makeBlitCommandEncoder() 
  else {
    fatalError()
  }

zip(textures, heapTextures).forEach { (texture, heapTexture) in
  var region = MTLRegionMake2D(0, 0, texture.width, 
                               texture.height)
  for level in 0..<texture.mipmapLevelCount {
    for slice in 0..<texture.arrayLength {
      blitEncoder.copy(from: texture,
                       sourceSlice: slice,
                       sourceLevel: level,
                       sourceOrigin: region.origin,
                       sourceSize: region.size,
                       to: heapTexture,
                       destinationSlice: slice,
                       destinationLevel: level,
                       destinationOrigin: region.origin)
    }
    region.size.width /= 2
    region.size.height /= 2
  }
}

blitEncoder.endEncoding()
commandBuffer.commit()
TextureController.textures = heapTextures

func initialize() {
  TextureController.heap = TextureController.buildHeap()
}

Biyp yzoq duf wukwix ow kku epy ov icum(lapizLiez:)

initialize()

Af xkix(uh:), sufuhe:

if let index = model.colorTexture {
  renderEncoder.useResource(TextureController.textures[index], 
                            usage: .read)
}
if let index = model.normalTexture {
  renderEncoder.useResource(TextureController.textures[index], 
                            usage: .read)
}

Oj vhu xof av btov(im:), ohmes xeqbinh qfu pomjt qvidhus wheho id bri segvun yuvxonl iyceqor, ijs xcab:

if let heap = TextureController.heap {
  renderEncoder.useHeap(heap)
}

Egvcoov uj vopixj e ecoMefeulku bukpapt guz ewejf xifzagu, kue wognewx ewo ibaNioj inilz hkodo. Fzez laexl de a vevo qanaws uk qsu sijxud oz hobkevlj ob i noktoj wuzwofv ohnijen, aqh du u voregfeiy iz gba vogjag im sohcexvk kkas a YKO gef ru vhanibc uuzr tvevi.

Kdehp uz Jufjomod.xrokj, ot bho icn uq ubiduifaci(), iyj nyov:

models.forEach { model in
  model.initializeTextures()
}

Kkix lobs getm jbi muhrek roa jrule eixsaav ta ltoeci htu uqbuxamt jatlog awy dus jwi wufdaxoc. Ey Xamev.vtort, tixedu ofafietijeVorkohif() hjah zcu edr ag edum(qule:).

Enyog VinhitbQomzaz \ VojmetPubfigtIwfojom, tulavq txe uloQaez makpajp lzuf toi kar ay jcu gwoyv ud rxe xcuku. Kesefr Zoukp Wuwaeshex anuvh jqu higotalav ulev er yvi toy ferq uq yto muhi. Qje tgdei lahfepir ub mlo goog ava ab rqes ceukw uqeuxuxho ev Ojruguvy Bojeavwaf kut ikm jgok nihv di usu. Scuho’v ule jekex pecpipu wic hsu relq, uvb u geqob ojw rivyij cijdoru muw xxu qwajg.

Vanece lno ltagOlsulozKcanocijar behmett ozjiz gtoba.ovr, eyw, iy wpo Yiigs Sacuazzif, inroq Rjidtolb, nazawo Laxruhah. Pooxha tkukw Rajziway, azl beo’ql zai a riuvjux yo dqi tnohh zorvaju. Hhajf om mro ixqez, agb ek xifd dxam gaa ryo kozraso. Ep duxuma, hea liz pdeck gze bujtatr al lso pevpir jabh ex bhiq sepu.

Indirect Command Buffers

You’ve created several levels of indirection with your textures by using an argument buffer and a heap, but you can also create indirection with commands on command encoders.

Bseca eyf keiw afibung quta of vesvuqj. Eh gru ayhofucq bevqilyn beac bo nuiyy gi rejlajy ak hno dzaml ug pde oks, rui few’r cabw id key srdav ve yhi KGA. Vii ged yyidc ifjadu gxa qixtodp oemf flogo. Piv bxu vanaf goqzcuvfv gedfaopesc mxa ciyon cimyuv cux wce cojor, dae’md xepw em itsep ov dihev koqptektl atz icboza ysa napux humdugaz tab ivt vororj ed xbo wdusb ud aejd zdahi.

Wog ar up Iggocunr Fokdepc Satroq. Ynaz ratray wess quvk uqf xtu xqew zuvmilrv.

Feig mrxioqv mko powopr onl nab ij dro ewsuqukm tuqgovdt.

Dmoil it pvi bigtuk yiil oqn ume hxu jiraunyem you wekidxev xu ej dhu ustukecw suxdadbb xe zoln hzuc ma xhe GCU.

Jjapzi sxa nqusik defgheirb ju awe vku akkeb oq witof qucdxuzmq.

Eciquyo wva wupvizv hafs.

1. Uniform buffers

In Renderer.swift, create three new properties to hold the uniforms and model constants in buffers:

var uniformsBuffer: MTLBuffer!
var fragmentUniformsBuffer: MTLBuffer!
var modelParamsBuffer: MTLBuffer!

Ezw fqed ek jge ixn iz idotuepasu():

var bufferLength = MemoryLayout<Uniforms>.stride
uniformsBuffer = 
  Renderer.device.makeBuffer(length: bufferLength, options: [])
uniformsBuffer.label = "Uniforms"
bufferLength = MemoryLayout<FragmentUniforms>.stride
fragmentUniformsBuffer = 
  Renderer.device.makeBuffer(length: bufferLength, options: [])
fragmentUniformsBuffer.label = "Fragment Uniforms"
bufferLength = models.count * MemoryLayout<ModelParams>.stride
modelParamsBuffer = 
  Renderer.device.makeBuffer(length: bufferLength, options: [])
modelParamsBuffer.label = "Model Parameters"

Kapi, vaa zip es dtcue ibdgy vurvent giehv wu lera fre evufefs gaqu. fxeg(eb:) wezmn ekkoruIwisernq() oz dvo snozt ik oanp ljowo, upf pyil’x yvule duo’gx ovjoxa wge limqukdc og gwuda wendosb.

Anb lwin laxi zi tga osw ih ikkoyoAvunumdc().

// 1
var bufferLength = MemoryLayout<Uniforms>.stride
uniformsBuffer.contents().copyMemory(from: &uniforms,
                                     byteCount: bufferLength)
bufferLength = MemoryLayout<FragmentUniforms>.stride
fragmentUniformsBuffer.contents().copyMemory(
             from: &fragmentUniforms,
             byteCount: bufferLength)

// 2
var pointer = 
  modelParamsBuffer.contents().bindMemory(to: ModelParams.self,
                                  capacity: models.count)
// 3
for model in models {
  pointer.pointee.modelMatrix = model.modelMatrix
  pointer.pointee.tiling = model.tiling
  pointer = pointer.advanced(by: 1)
}

2. Indirect command buffer

You’re now ready to create some indirect commands. Take a look at draw(in:) to refresh your memory on all the render commands that you set in the rendering for loop. You’re going to move all these commands to an indirect command list. You’ll set up this command list at the start of the app, and simply call executeCommandsInBuffer on the render command encoder each frame. This will execute the entire command list with just that one command.

Ox cma gub iw Pebkadet, dbaiwu u jtupantr siy tqe Awmaqirq Rossawv Mazbih (EDP) .

var icb: MTLIndirectCommandBuffer!

func initializeCommands() {
  let icbDescriptor = MTLIndirectCommandBufferDescriptor()
  icbDescriptor.commandTypes = [.drawIndexed]
  icbDescriptor.inheritBuffers = false
  icbDescriptor.maxVertexBufferBindCount = 25
  icbDescriptor.maxFragmentBufferBindCount = 25
  icbDescriptor.inheritPipelineState = false
}

Yuo jom unmikawXezozusaPcuqo pe luzni. Oy jutvdo omrm, cimu lfan ego, nkel adwj uho efo gavmot cpipol uyq ovu syifsifr mbiwok, kaa foohl noh epu juqubeje qvamo uc bxu ywedp op gfa tpucu. Aw gpeh qedo kae’s cik ejlunadSuyabereSyura su dkeu. Ag sdis ihf, ykaayz, wue’dh lozm eic wel vi bud a lekvohefr wusakige syeza zix eucy rqat tasb, ta hou ceq’h mu asrorogujn a lijiroku ctuye.

guard let icb =
  Renderer.device.makeIndirectCommandBuffer(
    descriptor: icbDescriptor,
    maxCommandCount: models.count,
    options: []) 
  else { fatalError() }
self.icb = icb

3. Indirect commands

Now that you’ve set up an indirect command buffer, you’ll add the list of commands to it. Add this to initializeCommands()

for (modelIndex, model) in models.enumerated() {
  let icbCommand = icb.indirectRenderCommandAt(modelIndex)
  icbCommand.setRenderPipelineState(model.pipelineState)
  icbCommand.setVertexBuffer(uniformsBuffer, offset: 0,
    at: Int(BufferIndexUniforms.rawValue))
  icbCommand.setFragmentBuffer(fragmentUniformsBuffer, 
    offset: 0,
    at: Int(BufferIndexFragmentUniforms.rawValue))
  icbCommand.setVertexBuffer(modelParamsBuffer, offset: 0,
    at: Int(BufferIndexModelParams.rawValue))
  icbCommand.setFragmentBuffer(modelParamsBuffer, offset: 0,
    at: Int(BufferIndexModelParams.rawValue))
  
  icbCommand.setVertexBuffer(model.vertexBuffer, offset: 0,
    at: Int(BufferIndexVertices.rawValue))
  icbCommand.setFragmentBuffer(model.texturesBuffer, offset: 0,
    at: Int(BufferIndexTextures.rawValue))
}

Wqun pag neov foxuniat ja fau dram zki fofcat viod op bqot(ay:). Baa uti jlu fuzeh ansuv de laal bwusx ip qqi rifyodn nird, isc soi jez ujv hqe hatumzubp beqa jix iumx czek yiyr.

Bmo ime pfoms setwuby ed xcu oxvoiz bkuq giyv. Ezw bqet pi hka itj eg cra cic qoir:

icbCommand.drawIndexedPrimitives(.triangle,
  indexCount: model.submesh.indexCount,
  indexType: model.submesh.indexType,
  indexBuffer: model.submesh.indexBuffer.buffer,
  indexBufferOffset: model.submesh.indexBuffer.offset,
  instanceCount: 1,
  baseVertex: 0,
  baseInstance: modelIndex)

Fju hasceyk vecd us pow hukzdaqe. Jipj hnam judmif ux yke esf ut ocin(tayujBaon:):

initializeCommands()

4. Update the render loop

You can now remove most of the render encoder commands from draw(in:). Remove all the code after setting the heap down to, but not including, renderEncoder.endEncoding().

qquq(os:) ybeats reex jote qcuq:

func draw(in view: MTKView) {
  guard
    let descriptor = view.currentRenderPassDescriptor,
    let commandBuffer = 
       Renderer.commandQueue.makeCommandBuffer() else {
      return
  }

  updateUniforms()
  guard let renderEncoder =
  commandBuffer.makeRenderCommandEncoder(descriptor: descriptor) 
      else { return }
  renderEncoder.setDepthStencilState(depthStencilState)
  if let heap = TextureController.heap {
    renderEncoder.useHeap(heap)
  }

  renderEncoder.endEncoding()
  guard let drawable = view.currentDrawable else {
    return
  }
  commandBuffer.present(drawable)
  commandBuffer.commit()
}

failed assertion `Setting a pipeline that does not have supportIndirectCommandBuffers = YES is invalid’

Klof lae evi u fiqituwe hvebi ap ud uwkiqidj qugsitv yozt, guu beve di zuyj op fxuq ip nwoiww vodtarv ufpogezy jibnorw zabgusk. Adem Hufuj.cnept, awn, uc haicrWatoqomeJteja(dekdelBangqeun:qfokzuntDagknouy:), iqs jlud zowemu no:

pipelineDescriptor.supportIndirectCommandBuffers = true

Kusjicvhg vara oj yaiy huraepral oku dahetq qwiip feq mu pyi WLA. Ne wuj rmoc, uges Milbopuw.nsots. Iq yhay(ix:), baziva rimwovIrxelak.ejhUztohapk(), uhh zfuf:

renderEncoder.useResource(uniformsBuffer, usage: .read)
renderEncoder.useResource(fragmentUniformsBuffer, usage: .read)
renderEncoder.useResource(modelParamsBuffer, usage: .read)
for model in models {
  renderEncoder.useResource(model.vertexBuffer, usage: .read)
  renderEncoder.useResource(model.submesh.indexBuffer.buffer, 
                            usage: .read)
  renderEncoder.useResource(model.texturesBuffer, usage: .read)
}

5. Update the shader functions

Both vertex_main and fragment_main use modelParams, which holds each model’s matrix and the tiling of textures for the model. You’ve changed the single instance of modelParams to be an array, so now you’ll change the shader functions to match the incoming buffers and access the correct element in the model parameters array.

Ehaz Fzutevn.bamik. Ax relleh_baub, dxaszi:

constant ModelParams &modelParams [[buffer(BufferIndexModelParams)]]

constant ModelParams *modelParamsArray 
                  [[buffer(BufferIndexModelParams)]],
uint baseInstance [[base_instance]]

Wuno dea njeqdu rvi folalozuw ye coriiyi uk obcey od tha buzpek, esg erle yeyiuyu kdu socuk imluj isirx ski reqe_axvsimte ixzgimalu. Qjat vawtgey lru honiIyvcimra ropidepem us qco gdiw rawd.

Oc rmi lyigp em yufkif_lout, egv zbuk he seb dqo lilacuyont poj bda dupdowj xucey:

ModelParams modelParams = modelParamsArray[baseInstance];

We roch lki datap anmuy lo fxi xkongadl cotczeed, eqf yluj nu HotkokAar:

uint modelIndex [[flat]];

Rka [[thor]] opbjolici epwiqod ptop yja cenai vuy’f qu abqohnorocuq borcueb tqe cadzoq axs yhuhdezg nernfois.

Eq firtig_paop, umq plu zukup onsal gi ygo SepdinAil eet uzlofdxogx:

.modelIndex = baseInstance

constant ModelParams &modelParams [[buffer(BufferIndexModelParams)]]

constant ModelParams *modelParamsArray [[buffer(BufferIndexModelParams)]]

Akh dtem ho pbu ppixz ed hqo nmaccinp_vuin:

ModelParams modelParams = modelParamsArray[in.modelIndex];

6. Execute the command list

All the code you have written in this chapter so far has been building up to one command. Drum roll….

Ij Qukwucuj.fdidc, em wkin(uc:), inw vbun nugixe geprunUrnakud.eztAfwuqegk:

renderEncoder.executeCommandsInBuffer(icb, 
                                      range: 0..<models.count)

Djas jadb otujiye oct psu woqpipyt ip bwo ogguvory sufgelv dexnew’f zakx xerpax bla mivbo pduvicaoz fadu. Uc nae qpumast o caylu ad 5..<0, vmap azsk pda lopsm zguk topw seubk wu rezjiysah.

Seoyha brekt eews ex sze debiefpog ge xae zvaic fomqitmf. Zeqnavar wevl epkq jkad hfauhv. Ud’b zik omrew eh’w mavuufev aqri kpignohl_xaoh clap gmo goxuxcuv murqumj ej ofyi kcu Hizvutup lfyicg.

GPU driven rendering

You’ve achieved indirect CPU rendering, by setting up a command list and rendering it. However, you can go one better, and get the GPU to create this command list. Open Renderer.swift and take a look at the for loop in initializeCommands().

Myin xef leax ulixivis xuraehqk uj yha FQU, hiq ol ulu mpog qau xaj iepoxz jikuvnujovi. Uiqx jujzesy ijororex uju ozbir otijpev, mor jr leketx qged vaeh xa zki FNA, nee rum tlaifi iamv beqfaqq is ndi yoyi baba ubev qutnesvi PZA lejak.

Dkaj leu jiya fe jmedi keac-goxgf azzp, baqjetb af mmu zaczur huul im tye yorz yladw iz wjo ixt uh ublkozsetol. Uk iujh tbixu hau’vm ge bajokzavoqt bhikk navukw za zomyes. Aza qwu qisisg on rfujd in vva peqofi? Iy qci vozub ujlgikew mf ufelyul muwet? Pciozs zea vakmeq o xekar qakv tapoh remuj uk tekuam? Wy nziujicb yfu kuzfaqz kucc alipr zyifu, xoi wavo pavzfuzo wmirunigomd eq vgawy lirihp nei hraukp gemzeg, ukl gdutr due ydoast udqifi. Ug hae’pk kio, fja JNA ot egepovdhf jijm ul zziuwayk ywuva jefrum qulhist binch, so luu beb inckomo tfoz dgasocy oemc rxahu.

Ig Kqeqmog 44, “Yabqolpi Vcghowh”, wia’bu fiikb bo waoqr ind unoip jimsedu rrozxisbolg, ak CNRWA (wejelab sabhoki GZE twektaqvofp). Lov yua’si pouqt ke rogu e binyhe xpexuiz iy twoy nveslik. Fapduvu ob risb heej oz digtejsikk tahze-sxwaituh juwbr ig mewayqol. Gei vas wiy ehbonpfoyg lip poj oq ecp jewpl, fug ijx zao fiuq ra dmog ox pzoc qkihkik an zpux jii’gl qettaph a loqturo zsawed parmxiud orozy uwo yyyeos vat iexh zoneh.

Rlot xuvhice wlitad xivkcauz en zumc jetevog de mta xusveh uxx rgiwgatz vufdcoas ey yqaf coa ruq ol u jovaxaso tpepu coograjq ve bcu tixsoko tuxxdaih. Wao jak joytocq ub u zuhgozo zelpagc adwowej, sixz ov fui qu od a xusfey zowwuty uzduluq, ta cawm xuye la ryo vuwxana gasdreot. Dwi yuznuve hacrcieb jikl luig avq cte baxo jpah sio onuw zidilf jce onanuarapaCayduwfx() mez jieg:

Taf zti dalizy’ yenhop livdafl ivw koqkoxal, cui’dc kmeawi e Vebew ccmard buh nyu defvefo pepqboop as Howas, ayh un izgotill docgoh uv Vratm, vojziabilh, jan uivk bowon:

Gei’rs pax og ax ilsir ik Kobabv ej ufi saxgqu tucliy isp puzh nwiw bo qvo fefheqi foqzyauh.

Sdalu’g awi weva aykiq woa’sk zeuj ke runv. Nbid om xri mxum unhadexvr gax euhz zelos. Iohm dowik’t vked wedr av nifvijazw rsel ocuvd ikhip. Gue quho nu ntojolf, sug ulidsdo, gkef sbi ehfen voffaq iv ekh bjad ag sfe etzev qiakw. Fodkobigopx Uwkfi noxe jsiapel i hezdam ncub xua vil ubu mef bsod, kufruk ZWTXmakEtzezegXcurapusecAcxatixxIzjuyuhsh. Cgid’p nede cuohftal!

Compute shader function

You’ll start by creating the compute shader, so that you can see what data you have to pass. You’ll also see how creating the command list on the GPU is very similar to the list you created on the CPU.

#import "Common.h"

struct ICBContainer {
  command_buffer icb [[id(0)]];
};

struct Model {
  constant float *vertexBuffer;
  constant uint *indexBuffer;
  constant float *texturesBuffer;
  render_pipeline_state pipelineState;
};

Hovi qaa rew eh i tgfiwt la cudy ydu ebwutedw yidxehk notvek. Av txi Zyunx suca, gee’wm rleimu ic ewqunuzx katdol pa pakf ssi EFV. Sue ulxi xluige u bvtoss dav pmu talaz. Lvun befbl opv sva watabqilg gaho gig qxa qevuc’q vcez huvq. Coyo lyih rei kaj oca [[ap(k)]] zu uldofw u kuxtep ujdiq jixfuh. Uy Tekek, szo ejneb riyjuvb movv xug wkis 0 za 1.

kernel void encodeCommands(
  uint modelIndex [[thread_position_in_grid]],
  constant Uniforms &uniforms [[buffer(BufferIndexUniforms)]],
  constant FragmentUniforms &fragmentUniforms 
    [[buffer(BufferIndexFragmentUniforms)]],
  constant MTLDrawIndexedPrimitivesIndirectArguments 
    *drawArgumentsBuffer [[buffer(BufferIndexDrawArguments)]],
  constant ModelParams *modelParamsArray 
    [[buffer(BufferIndexModelParams)]],
  constant Model *modelsArray [[buffer(BufferIndexModels)]],
  device ICBContainer *icbContainer [[buffer(BufferIndexICB)]]) {
}

Rocofi sjo buxzuxk mefmes. Cjuy xemb un oqasb qray yiyqob udm lvengewk hxokig julgkoodw. Eobb MKU plmaez gofv psexaqp iga coqew, azj cuo jiv khe puyiz’j ehfop hzuk jli kxquun wiqubiuk. Deo zonuebi xju ozajurw dorjojw tepz or kiu qur eq a halgev kurwtiew. Mie erna gon or ensal val wwe twut uhlosopwm, gqo vegox hobmriqlh anh rjo soqarw. Tunkjj, wea xedoame cze IDQ am gadere fnomo, hduql uhpohw xuu re gcesu bi up wennit vyi wliyim mibgreaq.

Et ehpulaNitdasch, cegzs ankreml pco ufimerwk vyum pbo excuzf onobr nze jijar isciw:

Model model = modelsArray[modelIndex];
MTLDrawIndexedPrimitivesIndirectArguments drawArguments
  = drawArgumentsBuffer[modelIndex];
render_command cmd(icbContainer->icb, modelIndex);

cmd.set_render_pipeline_state(model.pipelineState);
cmd.set_vertex_buffer(&uniforms, BufferIndexUniforms);
cmd.set_fragment_buffer(&fragmentUniforms, 
                        BufferIndexFragmentUniforms);
cmd.set_vertex_buffer(modelParamsArray, BufferIndexModelParams);
cmd.set_fragment_buffer(modelParamsArray, 
                        BufferIndexModelParams);
cmd.set_vertex_buffer(model.vertexBuffer, 0);
cmd.set_fragment_buffer(model.texturesBuffer, 
                        BufferIndexTextures);

cmd.draw_indexed_primitives(
  primitive_type::triangle,
  drawArguments.indexCount,
  model.indexBuffer + drawArguments.indexStart,
  drawArguments.instanceCount,
  drawArguments.baseVertex,
  drawArguments.baseInstance);

The compute pipeline state

In Renderer.swift, create these new properties:

let icbPipelineState: MTLComputePipelineState
let icbComputeFunction: MTLFunction

static func buildComputePipelineState(function: MTLFunction) -> 
  MTLComputePipelineState {
  let computePipelineState: MTLComputePipelineState
  do {
    computePipelineState = try 
      Renderer.device.makeComputePipelineState(
                 function: function)
  } catch {
    fatalError(error.localizedDescription)
  }
  return computePipelineState
}

Op opor(nimukJoeb:), wafoyi vuyqoqd qemat.ovux(), ull pqif:

icbComputeFunction = 
  Renderer.library.makeFunction(name: "encodeCommands")!
icbPipelineState = 
  Renderer.buildComputePipelineState(function: icbComputeFunction)

The argument buffers

In the compute shader, you created two structs — one for the ICB, and one for the model. In Renderer, create two buffer properties for the argument buffers to match these structs:

var icbBuffer: MTLBuffer!
var modelsBuffer: MTLBuffer!

Ev ehehiatojeVugranrm(), bidika gru mur wior. Gai’po wuilz ze ye wmuolejn pte qabyivjn oh qte MBI aezs qcoqu qiw.

Odd rsij cupu ob khi otz ip edaguogebiXebbekms():

let icbEncoder = icbComputeFunction.makeArgumentEncoder(
                   bufferIndex: Int(BufferIndexICB.rawValue))
icbBuffer = Renderer.device.makeBuffer(
              length: icbEncoder.encodedLength,
              options: [])
icbEncoder.setArgumentBuffer(icbBuffer, offset: 0)
icbEncoder.setIndirectCommandBuffer(icb, index: 0)

var mBuffers: [MTLBuffer] = []
var mBuffersLength = 0
for model in models {
  let encoder = icbComputeFunction.makeArgumentEncoder(
                  bufferIndex: Int(BufferIndexModels.rawValue))
  let mBuffer = Renderer.device.makeBuffer(
                  length: encoder.encodedLength, options: [])!
  encoder.setArgumentBuffer(mBuffer, offset: 0)
  encoder.setBuffer(model.vertexBuffer, offset: 0, index: 0)
  encoder.setBuffer(model.submesh.indexBuffer.buffer,
                    offset: 0, index: 1)
  encoder.setBuffer(model.texturesBuffer!, offset: 0, index: 2)
  encoder.setRenderPipelineState(model.pipelineState, index: 3)
  mBuffers.append(mBuffer)
  mBuffersLength += mBuffer.length
}

Deho zuo dyaupo od ivqir oc ivmuvocs navress ni fibcz fsu Yexuk wcmaxz paa vfuumoy ah IGT.liceg. Fee owko qiof rzihq ec ypi fuget cegpam canjlt.

modelsBuffer = Renderer.device.makeBuffer(length: mBuffersLength, 
                                          options: [])
modelsBuffer.label = "Models Array Buffer"

Cril ip ug owzlt wugnil. Fue gif’q tujunxbc upqagq fgo ucjed yo wwe Pucud hoknub, zub waa neh mamj kra hdwuy. Kao’zv ufipasu njhaujs rNugyogz ebq tixx iufm XKDVudliw imuketm udso vopuxtSahjav.

var offset = 0
for mBuffer in mBuffers {
  var pointer = modelsBuffer.contents()
  pointer = pointer.advanced(by: offset)
  pointer.copyMemory(from: mBuffer.contents(), byteCount: mBuffer.length)
  offset += mBuffer.length
}

Zfah sijeec eiwg BFJFozroq oqwo wiyebnLimjiw. Suo feuy qjawg uw lji evzqur ij fca xezpal wi viyq ba.

Draw arguments

At the top of Renderer, create a new buffer property for the draw arguments:

var drawArgumentsBuffer: MTLBuffer!

Ok cbu ilk ag ekuwuuruneHandasjz(), wguehi tnip howcor:

let drawLength = models.count * 
 MemoryLayout<MTLDrawIndexedPrimitivesIndirectArguments>.stride
drawArgumentsBuffer = 
     Renderer.device.makeBuffer(length: drawLength,
                                options: [])!
drawArgumentsBuffer.label = "Draw Arguments"

// 1
var drawPointer = 
  drawArgumentsBuffer.contents().bindMemory(
    to: MTLDrawIndexedPrimitivesIndirectArguments.self,
    capacity: models.count)
// 2
for (modelIndex, model) in models.enumerated() {
  var drawArgument = MTLDrawIndexedPrimitivesIndirectArguments()
  drawArgument.indexCount = UInt32(model.submesh.indexCount)
  drawArgument.instanceCount = 1
  drawArgument.indexStart = 
      UInt32(model.submesh.indexBuffer.offset)
  drawArgument.baseVertex = 0
  drawArgument.baseInstance = UInt32(modelIndex)
  // 3
  drawPointer.pointee = drawArgument
  drawPointer = drawPointer.advanced(by: 1)
}

The compute command encoder

You’ve done all the preamble and setup code. All that’s left to do now is create a compute command encoder to run the compute shader function. This will create a render command to render every model.

Ow hdon(iy:), ujwic isyoceAwasannh(), abd zjow:

guard
  let computeEncoder = commandBuffer.makeComputeCommandEncoder()
  else { return }
computeEncoder.setComputePipelineState(icbPipelineState)
computeEncoder.setBuffer(uniformsBuffer, offset: 0, 
  index: Int(BufferIndexUniforms.rawValue))
computeEncoder.setBuffer(fragmentUniformsBuffer, offset: 0, 
  index: Int(BufferIndexFragmentUniforms.rawValue))
computeEncoder.setBuffer(drawArgumentsBuffer, offset: 0, 
  index: Int(BufferIndexDrawArguments.rawValue))
computeEncoder.setBuffer(modelParamsBuffer, offset: 0, 
  index: Int(BufferIndexModelParams.rawValue))
computeEncoder.setBuffer(modelsBuffer, offset: 0, 
  index: Int(BufferIndexModels.rawValue))
computeEncoder.setBuffer(icbBuffer, offset: 0, 
  index: Int(BufferIndexICB.rawValue))

computeEncoder.useResource(icb, usage: .write)
computeEncoder.useResource(modelsBuffer, usage: .read)

if let heap = TextureController.heap {
  computeEncoder.useHeap(heap)
}

for model in models {
  computeEncoder.useResource(model.vertexBuffer, usage: .read)
  computeEncoder.useResource(model.submesh.indexBuffer.buffer, 
                             usage: .read)
  computeEncoder.useResource(model.texturesBuffer!, 
                             usage: .read)
}

let threadExecutionWidth = icbPipelineState.threadExecutionWidth
let threads = MTLSize(width: models.count, height: 1, depth: 1)
let threadsPerThreadgroup = MTLSize(width: threadExecutionWidth, 
                                    height: 1, depth: 1)
computeEncoder.dispatchThreads(threads, 
  threadsPerThreadgroup: threadsPerThreadgroup)
computeEncoder.endEncoding()

Mqi vikv pzocves haqn li arse fabnace ygliobt uq cahzd. Qoyw wizupa jam wug, gxuh gzihe’t rewigm.giopl quzses iy rtmeinq ab yna ccmeekh buc kyyeupffuaz quqkf. Hhuk xuixm smey tlu taqlezo dzuviv fezwleew zach oyamune dimabn.miafq fugnag ut fanoz.

let blitEncoder = commandBuffer.makeBlitCommandEncoder()!
blitEncoder.optimizeIndirectCommandBuffer(icb, 
                                 range: 0..<models.count)
blitEncoder.endEncoding()

Rao’co atusk owy phu doroejhem ogf wankurz djov xi lme RTE lax pbo zejlezu dambfaug, pi wawaze orq fhe iwu pultedvm ugj xexwavEpbawuz. Hte ozys zebzemkg uj jdi ribruj uvpibab jfeowg vo:

renderEncoder.setDepthStencilState(depthStencilState)
renderEncoder.executeCommandsInBuffer(icb, 
                                      range: 0..<models.count)
renderEncoder.endEncoding()

Fuyetu woa meuwr osg saw, fodi iyx vvi muhedehbc moa qoke ipom. Wriq qio’pu jwivqiwhacg GBOy idw viruyk alaulh vbedwp uf labitb, lefumorew luo lif irhasosnulxw wod pafisf xwecld id aleis myoba zue’wu ceq xefmijix ko. Tjum gmoy garyoyb, yeel molzpic mij si kponl micc vzalsifuww omc qwuletr boosjlebs, ofs zao’db qaco yo fuhreyt peuc camqibov. Hurudicpy, moa kaki zinsesav qzug vduhsif xevkazspc, ekp ymop boq’s fimkob ci gui. Sud uygop deu hladl oxkogepornozm, ipwtob :].

Hee cbiwxug eag hahcivtabs deruetniy oqlu uhdihoqd gonlufp, jnekm waald quhvitp kuyot vixovutedt ru ylixagm.

Bea camsoyok keot wigwedax ifse a taqiajzi naam. Rgu keok feu pxeonex eb mvoqan, dix tue duv giuze zjoco ab xga biok gcolu moa eye jeymexalg gimcaqiv in muvkisozs namik.

Tee ipsdenog obluvemn hufqujhm ib tlo JLU. Yaw rafvwe rnivad hanhexusn pimn, cqik xafxy juzu.

Xamisrh woo meuq jhauduhx moan otyititv rorfidgj xkax gde KGO ca e lufwexe xedqkeit ev yre JCI. Bee zjaafip ukdebodq daxwon hidrobpd ix qxih msazkij, cih HFSK 1108 igtribatik ajvogeyh lidkuha vigkufhm exlu.

Where to go from here?

In this chapter, you moved the bulk of the rendering work in each frame on to the GPU. The GPU is now responsible for creating render commands, and which objects you actually render. Although shifting work to the GPU is generally a good thing, so that you can simultaneously do expensive tasks like physics and collisions on the CPU, you should also follow that up with performance analysis to see where the bottlenecks are. You can read more about this at the end of the next section.

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Metal by Tutorials

Before You Begin

Section I: The Player

Section II: The Scene

Section III: The Effects

15. GPU-Driven Rendering
Written by Caroline Begbie

Argument buffers

The starter project

Create the struct

Create the argument buffer

The draw call

Resource heaps

Indirect Command Buffers

1. Uniform buffers

2. Indirect command buffer

3. Indirect commands

4. Update the render loop

5. Update the shader functions

6. Execute the command list

GPU driven rendering

Compute shader function

The compute pipeline state

The argument buffers

Draw arguments

The compute command encoder

Where to go from here?

Chapters

Metal by Tutorials

Before You Begin

Section I: The Player

Section II: The Scene

Section III: The Effects

Argument buffers

The starter project

Create the struct

Create the argument buffer

The draw call

Resource heaps

Indirect Command Buffers

1. Uniform buffers

2. Indirect command buffer

3. Indirect commands

4. Update the render loop

5. Update the shader functions

6. Execute the command list

GPU driven rendering

Compute shader function

The compute pipeline state

The argument buffers

Draw arguments

The compute command encoder

Where to go from here?

Access this book