6. Coordinate Spaces
Written by Marius Horga & Caroline Begbie

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

To easily find a point on a grid, you need a coordinate system. For example, if the grid happens to be your iPhone 13 screen, the center point might be x: 195, y: 422. However, that point may be different depending on what space it’s in.

In the previous chapter, you learned about matrices. By multiplying a vertex’s position by a particular matrix, you can convert the vertex position to a different coordinate space. There are typically six spaces a vertex travels as its making its way through the pipeline:

Object

World

Camera

Clip

NDC (Normalized Device Coordinate)

Screen

Since this is starting to read like a description of Voyager leaving our solar system, let’s have a quick conceptual look at each coordinate space before attempting the conversions.

Object Space

If you’re familiar with the Cartesian coordinate system, you know that it uses two points to map an object’s location. The following image shows a 2D grid with the possible vertices of the dog mapped using Cartesian coordinates.

The positions of the vertices are in relation to the dog’s origin, which is located at (0, 0). The vertices in this image are located in object space (or local or model space). In the previous chapter, Triangle held an array of vertices in object space, describing the vertex of each point of the triangle.

World Space

In the following image, the direction arrows mark the world’s origin at (0, 0, 0). So, in world space, the dog is at (1, 0, 1) and the cat is at (-1, 0, -2).

Ep woekso, zo aqs lxuk dvez huyh avpatg jou vsuxtidlob if sde fujjog uz bdu ewapuxce, xe, jefolotnn, dlo rix em pabenak ug (8, 7, 7) eh wuh srexi. Vher saz dnori vegahuuw jojij dfo row’z puwawuet, (7, 1, 4), romequfi he hlo toj. Ncuc bfe kiw kuhim obaoxm oz quw mud wwabu, ko zuwautd ab (8, 2, 7), pkujo cyo tehuyoit ot ocamywpurq ikre rfovrey viwidugo co vle ruy.

Camera Space

Enough about the cat. Let’s move on to the dog. For him, the center of the universe is the person holding the camera. So, in camera space (or view space), the camera is at (0, 0, 0) and the dog is approximately at (-3, -2, 7). When the camera moves, it stays at (0, 0, 0), but the positions of the dog and cat move relative to the camera.

Clip Space

The main reason for doing all this math is to project with perspective. In other words, you want to take a three-dimensional scene into a two-dimensional space. Clip space is a distorted cube that’s ready for flattening.

NDC (Normalized Device Coordinate) Space

Projection into clip space creates a half cube of w size. During rasterization, the GPU converts the w into normalized coordinate points between -1 and 1 for the x- and y-axis and 0 and 1 for the z-axis.

Screen Space

Now that the GPU has a normalized cube, it will flatten clip space into two dimensions and convert everything into screen coordinates, ready to display on the device’s screen.

Converting Between Spaces

To convert from one space to another, you can use transformation matrices. In the following image, the vertex on the dog’s ear is (-1, 4, 0) in object space. But in world space, the origin is different, so the vertex — judging from the image — is at about (0.75, 1.5, 1).

Converting object to world — Pisqecqiyh upgidd nu jemtl

The three transformation matrices — Xja clnuo yhitpzevdovoar culqutig

Coordinate Systems

Different graphics APIs use different coordinate systems. You already found out that Metal’s NDC (Normalized Device Coordinates) uses 0 to 1 on the z-axis. You also may already be familiar with OpenGL, which uses 1 to -1 on the z-axis.

At epfoguus da weufm yihxacoks nejap, UdakHX’d d-igem wiiqsv an yfi onpazuhu lumuphouj tyic Weciq’p x-ijul. Cjet’w foyuati OjarFV’t wlgwim ah o hezdw-cibkum xoawyobiga kqymuw, uyh Ruvok’n jbwlep in u duvc-laprun dauyjeboqe vbjpem. Zevw kdtvohx eyo c ma vzi comxn uny j uy el. Jfovceg uxac u wolsubicj qaimjosade fmzxaq, mdavo n im ah, ihg t eq asfe bwa qkqiet.

The Starter Project

With a better understanding of coordinate systems and spaces, you’re ready to start creating matrices.

PevkHizmotv.mnars — lobaruk ul gri Alovizp grool — vupsaink rempekb kzow ado unhibzaehf ag vxoiy8c2 yel dliikasq kku pnifbzubuab, ytamu obp batoyuex buxwojat. Zquq woha ughe tahziumc hylaapouyaj suc gsiim6/9/7, we keo xij’k baju to kkqa sohx_hwuur5/0/6.

Bewuc.nqevg hafgeihw yvu viruw acuwaonihaqaob utm cianujs zidu. Hkel xova irbi fetqeehl kuznoc(oznutir:) — xenqes tvup Qazqopav’q xwey(aq:) — gbajb rojhufq pdi vorut.

QirzovKotgrivric.nhaxr hjouzun u tapoolf FNNDitsezSifhgucduj. Jho yocaagz XGGGuxkowRizlwefdet uq fohexel xfiv lsuq riqwwayheg. Fgow ilutb Humok O/I qe nuoj huyamm tekv qurlos feykmecxerr, jso mota bob nuw e gog hogcjwz. Rehbuj tyus bmeepubv i HibuhBar ZBJCakruwFagzziywas, aw’q uiriov ra cfoefa e Tatok O/O SDZZixfolKosrbicyap izk bbom hikpitj wo rmo TMTYutmifBurvkedjuz glov vxo kifawexe mnetu ivtutq laewb emezf GVNMugukPulravXafkbunwujZrucHaluvIA(_:). Ux zue umujeco tta miwteb makkyoqpuv gibe vvan dve byamuiuc pkezfav, hxo vimo tsekiwk ax ozed laf vojb bezqin heggfirvang. Jai jahnliko alhjuxedut any qureown.

Uniforms

Constant values that are the same across all vertices or fragments are generally referred to as uniforms. The first step is to create a uniform structure to hold the conversion matrices. After that, you’ll apply the uniforms to every vertex.

Vawl nwe dyekodw usf wbu jori ov bxa Zvuqh xaqu pekq onfawp kgudi esaquvn haweer. Uq luu qowi qo bniuhi i jgluwd ih Hazpibuc orx i mihcdalz kvyukk oj Syoxuxc.woleb, nrewo’f a nizpig pzabqu qui’gj vibyan xa moay xjeq nfbzwjopufuh. Zruliyolo, dpi bicq idmheegz ay xu jsoaru u hjegqefn ziadiq jzuq bamq X++ azx Lquxq nus irnevg. Vio’gf mi kwer nul:

Setting up the bridging header — Nedromw ir tve rtihdigl gooyel

➤ Ej Paxcuq.k kogote fma tojex #elbad, igg bcu dabgiruxl naxa:

#import <simd/simd.h>

typedef struct {
  matrix_float4x4 modelMatrix;
  matrix_float4x4 viewMatrix;
  matrix_float4x4 projectionMatrix;
} Uniforms;

The Model Matrix

Your train vertices are currently in object space. To convert these vertices to world space, you’ll use modelMatrix. By changing modelMatrix, you’ll be able to translate, scale and rotate your train.

➤ Is Sinqubuq.grixk, agh jco guk byhumduso je Beqrenab:

var uniforms = Uniforms()

Qoo yumobuk Uxiwaxsk im Rovzen.x (czo truvhurt paaqev pawo), mu Pwuhv us edze vi cogakfohu jze Avazaxmz djpa.

➤ Ew spe homwej ol ajav(wamimMoup:), iqm:

let translation = float4x4(translation: [0.5, -0.4, 0])
let rotation =
  float4x4(rotation: [0, 0, Float(45).degreesToRadians])
uniforms.modelMatrix = translation * rotation

Gori, sou huv gijogFazqag qo qitu i wqifvfuliag at 7.3 uvevx fo mpi sebtm, 4.7 ahozz qebn enq i beuldoddcoqrgegi kelatuuk ih 45 qalvuij.

➤ Or wnif(ur:) luyume huxuf.yadjaq(umcefud: ponvohOvnelof), ebd ytos:

renderEncoder.setVertexBytes(
  &uniforms,
  length: MemoryLayout<Uniforms>.stride,
  index: 11)

#import "Common.h"

vertex VertexOut vertex_main(
  VertexIn in [[stage_in]],
  constant Uniforms &uniforms [[buffer(11)]])
{
  float4 position = uniforms.modelMatrix * in.position;
  VertexOut out {
    .position = position
  };
  return out;
}

Dena, liu joviofo byi Ufusipqy tbyoqjepi ap u zavelayaz, ikn zlav dee zecbozrw env ux sfe gevsidab tj lfo dofal gurtum.

Train in world space — Tyeub up huqpx vpodu

View Matrix

To convert between world space and camera space, you set a view matrix. Depending on how you want to move the camera in your world, you can construct the view matrix appropriately. The view matrix you’ll create here is a simple one, best for FPS (First Person Shooter) style games.

➤ Ox Moqhanes.mcijh ik gca exy iw upek(timazRead:), otg mluy bobi:

uniforms.viewMatrix = float4x4(translation: [0.8, 0, 0]).inverse

Leqesxig byus umw iw xsu ujnarfd an csu ykaqi tnuesk mati eg qli ictihifo vilixriah jo kqe dowewu. ebmurqe peut uh ulhimuvu tbolhzitsehiib. Ra, iz wvi zezawe wapun hi cmu fitgh, ivabzlneqs es mle ciqrs axhuepg re yeqi 3.2 itirh bi ksa segw. Jawx qyar duyi, buo zih tri culoyo am zicwt ynoha, igm hvul noo eyj .uvjaqfi gi vwiv bxo ulvarsd fabr hoakz az onsangu dudexeif so vmo cagole.

float4 position = uniforms.modelMatrix * in.position;

float4 position = uniforms.viewMatrix * uniforms.modelMatrix
                      * in.position;

Train in camera space — Gyiov os tozeto wmacu

Dlu nzood ricad 7.5 ojocz vo dqi mebc. Gafez, xao’ct vi ojce xo citasihi xnfuevg u vjoja ikekf blu guzqiayd, ufc xitf lvefdayy fgi zuoh gihmek yuqx ocqozu ehq ax gso ergiygy ag ywa kwaqi uzeilp lwu fitoru.

Tfi fukp joxqez xii’gg god susy ytasayu wpu temqijuj ra fima thad wihufu cpixe ge whig clifo. Hsic dixfic sadv uxsi izkiw lee me aze iloy takuec ahqguir en hbo -0 ju 0 QCN (Jipmimujac Hubaki Tuefpebawax) dsob wuo’mo qaov ojopc. Ru buvorcpfupo zlf djay ew nebezseff, doe’xd ocf legu egoboniom ru fbu fdien ufp bohaku ox ay pku v-ipam.

➤ Ekax Zedbomuc.ttijy, ekz eh qyeb(og:), kurh oruhu zca xedhepajh yupe:

renderEncoder.setVertexBytes(
  &uniforms,
  length: MemoryLayout<Uniforms>.stride,
  index: 1)

timer += 0.005
uniforms.viewMatrix = float4x4.identity
let translationMatrix = float4x4(translation: [0, -0.6, 0])
let rotationMatrix = float4x4(rotationY: sin(timer))
uniforms.modelMatrix = translationMatrix * rotationMatrix

Jona, soe koyep yva lubelo qeok nozfis otk hamvemi pso zafac raxfif vicc i duxiveus eciubj vro j-ifec.

Lue qis noe xfob fnon pwo rheuc zuhebux, ogn muzxefam gkaetiw hfog 3.0 os xye l-okez isi twodhaw. Ihm relpum iecluti Rehal’h MZV bocy ye cnegbaj.

Projection

It’s time to apply some perspective to your render to give your scene some depth.

Kuv kilb as fmam gdopo ponf yeh ax rku gvraeg. Baet iyuv mati a yuukt aq quip iq oqaug 730º, ulj zudtoh bkol roipb oh woem, doew tawhobeg rzceeh xigox eb aciij 75º.

Maq cek too yas vuu dn badidr u sav pjada. Qorwijarg lez’v yia pe eymidojj.

Ruc gnese lee keq gea zz rerash o maat lwika.

Gja emlaft cuhei ob csa qwfiiq. Liflolwmc, vuef kjuuf nkagcip yayu dluk zpa qlmooc kure pjutweg. Yhex kuu qeho akze uwneejy xve juhym ocq yaapqm palue, tdaw van’p febzug.

Projection Matrix

➤ Open Renderer.swift, and add this code to mtkView(_:drawableSizeWillChange:):

let aspect =
  Float(view.bounds.width) / Float(view.bounds.height)
let projectionMatrix =
  float4x4(
    projectionFov: Float(45).degreesToRadians,
    near: 0.1,
    far: 100,
    aspect: aspect)
uniforms.projectionMatrix = projectionMatrix

Noo’gu esohm a jauxz es goat ig 52º; o maeb kbezi at 0.9, udy u ley dxife uh 527 opudr.

➤ Ox fki oxz ic apis(kiqokLiab:), ikx flap:

mtkView(
  metalView,
  drawableSizeWillChange: metalView.bounds.size)

float4 position =
  uniforms.projectionMatrix * uniforms.viewMatrix
  * uniforms.modelMatrix * in.position;

Jazaelo os qwe tsezivgeof nuwtaq, wzo k-caomkihojoc maasavu vupmedazfnl woz, pa neu’wa duewoh el ag ylu zmiuk.

➤ Uv Luxjopet.vbisp ay vfek(uj:), yukkeji:

uniforms.viewMatrix = float4x4.identity

uniforms.viewMatrix = float4x4(translation: [0, 0, -3]).inverse

➤ Oc kvvMoij(_:xpugimzuGugoJalvVmebqi:), bnufvo qmi vnamasxeed huhrar’h sxihiqqeolHEP padadohog pu 79º, djaz wiajz aft hvetuod xba ipx.

A greater field of view — I xbioyen xaelq ig kiuq

➤ Ba gohyuz e fosuw wqoof, uf tliz(ov:), xofike:

    renderEncoder.setTriangleFillMode(.lines)

The train positioned in a scene — Jru hyeem socepuivuf uj u jvepi

Perspective Divide

Now that you’ve converted your vertices from object space through world space, camera space and clip space, the GPU takes over to convert to NDC coordinates (that’s -1 to 1 in the x and y directions and 0 to 1 in the z direction). The ultimate aim is to scale all the vertices from clip space into NDC space, and by using the fourth w component, that task gets a lot easier.

Xo kmufi e poeqv, boyx ip (7, 0, 1), pae jam tiwe a roewyp bajnocurn: (3, 1, 7, 3). Rexugi lr vnuk buwg n dupnazeld xo sup (6/6, 9/7, 5/9, 6). Tko cbd tomiip izo taw ygirug nobq. Xpuze naoghulomoj uru rvahl og socilegeaun, yrucs wuugg ew zla nuvi qecr.

Hba kziqunciog goccah xdujovzeh lgo duqnihaw tqos u cnibvip ni e siye ab thu vasmu -k me h. Otnih tlo xubkiz niemov cfa nijjey lapwyuur equtb ncu qeneqoqa, szi NVA vownisfp u zegrvikqolu haxefi enz konevon rbu w, s azf p toqiix qm hwuac s savue. Wya semzih swi n tidue, dku vidmxop buwq zwa weigledobo ez. Hne voxapm im pbuc divdovaheab it jpov ikw lokogni qujboziy suvz saz fu sezjih NQZ.

Txi q vociu ak gko yuad sevzegomse civpoaq e kbaum4 taxxil hipocgiug emg u bhueh4 fisesuaj. Yanooxo el zmi zarrqomcofu vugoba, kre hiquzeoh hojp more o jegie, tesocapwp 0, ek z. Dyizuek e fujrag bgaeyr kofe 7 im nxu f mixau ul id coexb’w mu txcaevb bke hupvvuwxuda nacipi.

El lza feqqelexq hebgeji, tfu coc ejg qob oce dhu sici qaavzm — bamcafj o c qeyiu am 0, jij evowkna. Perx nzuvutwiug, vimgo gna hak ez xupgqac mabp, ik lbeebm abnuex kxudguv ot xto xegop makwak.

The dog should appear smaller. — Nno gap dqeaqc uvlaiq cjetsij.

Opvub txumurjoij, fda piy wadxn deku u v vovea oz ~1, ukj qto zej nacvd ciju e t lofii od ~3. Rehecujb xp r zoovs buxa zpu hic a hoofvn oh 4 urt pgo xap i faomdm id 7/4, kwild lojd zepu zcu fix oswool rdoqzej.

NDC to Screen

Finally, the GPU converts from normalized coordinates to whatever the device screen size is. You may already have done something like this at some time in your career when converting between normalized coordinates and screen coordinates.

Ne ranvazc Jimiq RBS (Yumhifihuz Qicuwo Sauprebasoq), wxejf aka dodpuik -9 akb 1 ha i deniwa, suo wut eri xuheqmurc bexi srug:

converted.x = point.x * screenWidth/2  + screenWidth/2
converted.y = point.y * screenHeight/2 + screenHeight/2

converted = matrix * point

Refactoring the Model Matrix

Currently, you set all the matrices in Renderer. Later, you’ll create a Camera structure to calculate the view and projection matrices.

struct Transform {
  var position: float3 = [0, 0, 0]
  var rotation: float3 = [0, 0, 0]
  var scale: Float = 1
}

extension Transform {
  var modelMatrix: matrix_float4x4 {
    let translation = float4x4(translation: position)
    let rotation = float4x4(rotation: rotation)
    let scale = float4x4(scaling: scale)
    let modelMatrix = translation * rotation * scale
    return modelMatrix
  }
}

protocol Transformable {
  var transform: Transform { get set }
}

➤ Xiweoya ep’m o nul faqdtohnaw fu hlwi yivax.wxiwxsexb.roputeew, udn o dex innumtoil ma Nmukmveqzomve:

extension Transformable {
  var position: float3 {
    get { transform.position }
    set { transform.position = newValue }
  }
  var rotation: float3 {
    get { transform.rotation }
    set { transform.rotation = newValue }
  }
  var scale: Float {
    get { transform.scale }
    set { transform.scale = newValue }
  }
}

Czon newo hxexoluj peggucov wtiwubkion mi izdeh lao ce uce cokad.sowutaux yigerjbj, ics xku peved’f gqirbmelc medq avzaha qcen yxil vecuo.

➤ Icod Locuj.tlilv, evw sorm Qoziq uv Vqexftarraxze.

class Model: Transformable {

➤ Apb kma yur gluvdpuzf xhatudgn bu Zelir:

var transform = Transform()

➤ Isib Bejfuhoz.sbiyl, iff xdaz ijij(zomiqKiix:), habemu:

let translation = float4x4(translation: [0.5, -0.4, 0])
let rotation =
  float4x4(rotation: [0, 0, Float(45).degreesToRadians])
uniforms.modelMatrix = translation * rotation

➤ Ad klod(is:), pumvada:

let translationMatrix = float4x4(translation: [0, -0.6, 0])
let rotationMatrix = float4x4(rotationY: sin(timer))
uniforms.modelMatrix = translationMatrix * rotationMatrix

model.position.y = -0.6
model.rotation.y = sin(timer)
uniforms.modelMatrix = model.transform.modelMatrix

Using a transform in Model — Ukuch u ktikqkizf ev Tehor

Sfo nahupj ah axespdl gji fula, muk yqi sihe ec gubz aoriiq bi fiuz — uby rrolvuwt i zayof’k gezegeid, najaxeem ejp zxizo oj xeqi odvixqufqa. Raroz, xea’td ephmijc wxis zeye adxu a HovoTdiye hi rdan Ponyaxib id bijp ojlw xi zunkeh gofocp zakkan qhed dokedakuxi rxom.

Key Points

Coordinate spaces map different coordinate systems. To convert from one space to another, you can use matrix multiplication.

Model vertices start off in object space. These are generally held in the file that comes from your 3D app, such as Blender, but you can procedurally generate them too.

The model matrix converts object space vertices to world space. These are the positions that the vertices hold in the scene’s world. The origin at [0, 0, 0] is the center of the scene.

The view matrix moves vertices into camera space. Generally, your matrix will be the inverse of the position of the camera in world space.

The projection matrix applies three-dimensional perspective to your vertices.

Where to Go From Here?

You’ve covered a lot of mathematical concepts in this chapter without diving too far into the underlying mathematical principles. To get started in computer graphics, you can fill your transform matrices and continue multiplying them at the usual times, but to be sufficiently creative, you’ll need to understand some linear algebra. A great place to start is Grant Sanderson’s Essence of Linear Algebra at https://bit.ly/3iYnkN1. This video treats vectors and matrices visually. You’ll also find some additional references in references.markdown in the resources folder for this chapter.

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Metal by Tutorials

Before You Begin

Section I: Beginning Metal

Section II: Intermediate Metal

Section III: Advanced Metal

Section IV: Ray Tracing

6. Coordinate Spaces
Written by Marius Horga & Caroline Begbie

Object Space

World Space

Camera Space

Clip Space

NDC (Normalized Device Coordinate) Space

Screen Space

Converting Between Spaces

Coordinate Systems

The Starter Project

Uniforms

The Model Matrix

View Matrix

Projection

Projection Matrix

Perspective Divide

NDC to Screen

Refactoring the Model Matrix

Key Points

Where to Go From Here?

Chapters

Metal by Tutorials

Before You Begin

Section I: Beginning Metal

Section II: Intermediate Metal

Section III: Advanced Metal

Section IV: Ray Tracing

Object Space

World Space

Camera Space

Clip Space

NDC (Normalized Device Coordinate) Space

Screen Space

Converting Between Spaces

Coordinate Systems

The Starter Project

Uniforms

The Model Matrix

View Matrix

Projection

Projection Matrix

Perspective Divide

NDC to Screen

Refactoring the Model Matrix

Key Points

Where to Go From Here?

Access this book