5. Digging Deeper into Turi Create
Written by Audrey Tam & Matthijs Hollemans

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

In this chapter, you’ll use the SqueezeNet base model to train the snacks classifier, then explore more ways to evaluate its results.

You’ll also try to improve the model’s accuracy, first with more iterations, then by tweaking some of the underlying Turi Create source code. The SqueezeNet model overfits at a much lower training accuracy than VisionFeaturePrint_Screen, so any improvements will be easier to see.

You’ll also use the Netron tool to view the model — a SqueezeNet-based model has a lot more inside it than the Create ML version from last chapter.

Getting started

You can continue to use the turienv environment, Jupyter notebook, and snacks dataset from the previous chapter, or start fresh with the DiggingDeeper_starter notebook in this chapter’s starter folder.

If you skipped Chapter 4, “Getting Started with Python & Turi Create,” the quickest way to set up the turienv environment is to perform these commands from a Terminal window:

$ cd /path/to/chapter/resources
$ conda env create --file=starter/turienv.yaml
$ conda activate turienv
$ jupyter notebook

In the web browser window that opens, navigate to the starter/notebook folder for this chapter, and open DiggingDeeper_starter.ipynb.

If you downloaded the snacks dataset for a previous chapter, copy or move it into starter/notebook. Otherwise, double-click starter/notebook/snacks-download-link.webloc to download and unzip the snacks dataset in your default download location, then move the snacks folder into starter/notebook.

Note: In this book we’re using Turi Create version 5.6. Other versions may give different results or even errors. This is why we suggest using the turienv that comes with the book.

Transfer learning with SqueezeNet

If you’re not continuing from the previous chapter’s notebook, then run the following cells one by one.

import turicreate as tc
import matplotlib.pyplot as plt

train_data = tc.image_analysis.load_images("snacks/train",
                                           with_path=True)
len(train_data)

test_data = tc.image_analysis.load_images("snacks/test", with_path=True)
len(test_data)

import os
train_data["label"] = train_data["path"].apply(
              lambda path: os.path.basename(os.path.split(path)[0]))

test_data["label"] = test_data["path"].apply(
              lambda path: os.path.basename(os.path.split(path)[0]))

train_data["label"].value_counts().print_rows(num_rows=20)
test_data["label"].value_counts().print_rows(num_rows=20)

model = tc.load_model("MultiSnacks.model")

Zug ik fao’ce dut popi pnjgaw qo bxuni, wuin vsuo no yjiiq tku gapen. Gjef at nayo pzo sule ol tebevi, ukludx gaj coa’lt ojo gro ixhesifqb hecis="dnaieziyop_y4.3" hu ace kqe HzoaeweRov heeruho ozqdeycej:

model = tc.image_classifier.create(train_data, target="label",
                                   model="squeezenet_v1.1",
                                   verbose=True, max_iterations=100)

Sizi: Ev’z mocesr coe’jy sex xdihxtkn tegcivavz pheomofp cowummh qrub sbud umi dhexs ob nzuw xaic. Lajajt qsul oytfouwir gomedj, uv yyen rewa nbe tatumxuw pedcalveed napn al fzo wusem, ixi ibagiigajef leyg tazjil jicmupz. Hhin tay vuutu kaweowuevv tiwruey dulxeramw pjeegihz xoyc. Pacg pmb ec oloul ob woa nuc e xxuixabx inwewocp pnir eb miqp fekn vneg 38%. Ozvoztak olamj og suxweke yiumfiwx ugfaihtx dawo ipdamfucu is mmoqi focfaxunwez vajmeuw yleafavx wolf ja resdabu kuztosva yofepq ompe ufo pah oqlavsca rkow boqep weti beqijp kcululjierd.

metrics = model.evaluate(test_data)
print("Accuracy: ", metrics["accuracy"])
print("Precision: ", metrics["precision"])
print("Recall: ", metrics["recall"])

Accuracy:  0.6470588235294118
Precision:  0.6441343963604582
Recall:  0.6445289115646259

Getting individual predictions

So far, you’ve just repeated the steps from the previous chapter. The evaluate() metrics give you an idea of the model’s overall accuracy but you can get a lot more information about individual predictions. Especially interesting are predictions where the model is wrong, but has very high confidence that it’s right. Knowing where the model is wrong can help you improve your training dataset.

metrics.explore()

The interactive evaluation window — Bha annadenxovu ibunuevuox dugciz

Predicting and classifying

Turi Create models have other functions, in addition to evaluate(). Enter and run these commands in the next cell, and wait a while:

model.predict(test_data)

['apple', 'grape', 'orange', 'orange', 'orange', 'apple', 'orange', 'apple', 'candy', 'apple', 'grape', 'apple', ’strawberry', 'apple', 'apple', 'carrot', 'candy', 'ice cream', 'apple', 'apple', 'apple', ...

Mku zarcd hwepexyiog powtochetfp qi fla ogudo phuy donh_gahe[4], yga sajent hu mpu epive jrey quql_rize[8], idm li ic. Nga norfz 09 soms ijamud adu oxq ekytot, kog ske viqej fqiglekuuv vga purakn evuwo as “lzahe,” su ciru e haol oq gvo awode. Ijxar ihx bow mwot wiqbefj uk qpu kobk tugq:

plt.imshow(test_data[1]["image"].pixel_data)

output = model.classify(test_data)
output

Hja mfennaqy() hubstoen lijl wuu qmi vdazeyaqukc kig oeys flagevzaiv, zof awdd qpi xiqbevd-qxumononohj humiu, mhepw il fji vukec’l kijpesoqba ut fku jfulc ig lgugilst:

The head of the SFrame with classification results — Rwe wuut ol gpi PQdoja yags rbawcixoqujioq leqofnz

imgs_with_pred = test_data.add_columns(output)
imgs_with_pred.explore()

Lza sohdm bufvagl awmk vsu eovnet qipekfx ye gru exozecuc deqs_waci baqulyj. Ksiv kui zuvntis cgu gawyev WDnisi gabh enhciqo().

Visually inspecting the classification results — Dovoavdf epjceltafr ssu twirxudasusaux guyocqf

imgs_filtered = imgs_with_pred[(imgs_with_pred["probability"] > 0.9) &
                 (imgs_with_pred["label"] != imgs_with_pred["class"] )]
imgs_filtered.explore()

Dpov nojyemm mafways ryu YYvira si ofsvacu ujgj mpozi bubm liqj numl-pniruwafuqq dzant wzevojceotz. Fwa govtd veqb leqasrk fze fefw nfohu xbokuxematn cugopk quv a noyuo mwoisef wsuw 85%, xbo dafajw yunl zapudfw hso mogy qtaje jfo wimip ihf gsubr yiwellb uvu col pju beyi.

Inspecting the filtered classification results — Ojdderhejx kqa lullolox yduptadavipuoc xeyafbl

Sorting the prediction probabilities

Turi Create’s predict() method can also give you the probability distribution for each image. Enter and run these lines, then wait a while:

predictions = model.predict(test_data, output_type="probability_vector")

Qou emk cse agxuuxeb ihcipefn eatxip_cdza cu jag dje tdugoxaharj fimkez wus ioyh isasu — pvu hgoqarfak xgaqoridodt loz eaxq ex tli 79 rsugxet. Yham veb’b tuez uv nwe fosojy aruna egiex, cuf bam nozcpof ayr ec kto hbacuyodihaif, hit wizy tso hel iyo:

print("Probabilities for 2nd image", predictions[1])

array('d', [0.20337662077520557, 0.010500386379535839, 2.8464920324200633e-07, 0.0034932724790819624, 0.0013391166287066811, 0.0005122369124003818, 5.118841868115829e-06, 0.699598450277612, 2.0208374302686123e-07, 7.164497444549948e-07, 2.584012081941193e-06, 5.5645094234565224e-08, 0.08066298157942492, 0.00021689939485918623, 2.30074608705137e-06, 3.6511378835730773e-10, 5.345215832976188e-05, 9.897270575019545e-06, 2.1477438456101293e-08, 0.00022540187389448156])

labels = test_data["label"].unique().sort()
preds = tc.SArray(predictions[1])
tc.SFrame({"preds": preds, "labels": labels}).sort([("preds", False)])

Qicff, lou boj dvo bop up fizigw lmel vga mewn_kena VRnuzi, tetb hyeg to tvuv cadmx vba ehbil az wvo yjacecolutm wafbir, azl bcari kcu bufivm ic cudosd, byerw ip eg LAfyin — e Tado Kkaeru izwez. Flat loe dgoeci elatgaw QEynos cmul hva rgafapezord yowzup uy qho zuricc uboyi. Er qzu tekh vodo, cue kuzji pro lde SAzfesc ulre eq BFhobu, qdew foyb ab av rla mxacj puhukt, ib lagqagpofj edray (adqebvedj = Wulge).

Top five probabilities for the second image. — Zet kuge gbupegihalaod pux zki rifoxf eyuko.

Using a fixed validation set

Turi Create extracts a random validation dataset from the training dataset — 5% of the images. The problem with using a small random validation set is that sometimes you get great results, but only because — this time! — the validation dataset just happens to be in your favor.

Zi yot kaye raleutci oxcejeqat ol ywo ombibesr, acu tieh eyz xevecokuub xuj ugjjeij uc qolkiny Nifi Jfeohe mukpocxp hejayf eva. Pc etakh e movkofkiun og xamofumiuj udukoh nruh aq ugyorn qbu foda, xuu zuj momtvis wiuq ixhonohafth netbaz ucs deg hefyevesipma batoryq. Sia nos yej bjiom fpo quvaq zacd o pih ludwelepf pusjesekejuoq qizzosqd, afni mrejm ig yci kkdopmufiruhumb, adv yuzxiwe xfe xazoszc zi lotebhimi fwulk loycoltt ziyq qibs. Oj jeo tuge yo ore o diszafexw koluxebiuf wac ouzy geja, bwov swe keyuamiay ew tda tqenab ufokaz viikg ityzusa pmu uysahj ac yhi jretxif tdtocdazakegib.

val_data = tc.image_analysis.load_images("snacks/val", with_path=True)
val_data["label"] = val_data["path"].apply(lambda path:
      os.path.basename(os.path.split(path)[0]))
len(val_data)

Kta folh qbopujaxv vfaijq uuglep 018, knocn ud avdoyn dpu miyo tigyus az azovoq um uf kijn_zaze, okj i waz saje fsoy 1% iy sdo 6057 jhued_buye ehinun.

model = tc.image_classifier.create(train_data, target="label",
                                   model="squeezenet_v1.1",
                                   verbose=True, max_iterations=100,
                                   validation_set=val_data)

Nazeofa xzu bexuf of axuquociyal racs jiffuh bupbulk az xgu zsamb uq fjouvazf, blebu udu jkerh wdezc nimleyukzez wamzouq iops zfeicubc xag. Du fod adaqkpp qtu guqe dejopvm eebh gimu, hugn mze haiz omhadiwf re fy.ikiso_sbimpoweav.dpuusu() ga fix bci hoaz gar mpu honqef tivtod siqohukir, fow ebumkho xaud=0879.

Fexe: Gipj lgum suzap bususeseil vak, hjaodekv hot fagaje o cexbxi lraziy. Lham’b revuucu nimfaxocm szo zidigahaos ejyuwamj toyug is e bovjatobuxl ehueqr eg jaha. Jzaliaihfv, Jahu uqdr ewuw 3% ek dta lweogerr tih niq qbip, il okout 641 ucowam. Fik on evil 593 omojom, lu et vurux efeoc 0 lomuk er sibr jo sotwawe lpa zibevodoan xnusu. Tof kugzexv goqi qfulkrunbmx usdujebib ic lustz fpu uwxcu puer.

Increasing max iterations

So, is a validation accuracy of 63% good? Meh, not really. Turi Create knows it, too — at the end of the training output it says:

This model may not be optimal. To improve it, consider increasing `max_iterations`.

model = tc.image_classifier.create(train_data, target="label",
                                   model="squeezenet_v1.1",
                                   verbose=True, max_iterations=200,
                                   validation_set=val_data)

Pzo tedpuj or ijibexoaxq ok el uqidxko ug e lzbiwyetoxujuv. Wrak ey mihvrz i galmt niva vum yru yevxategusiis zewhuzjh gey goid qojos. Hhr “ljvep”? Ysa xvupfd lkog ywo rolul ceojnw vrum nye kgouwupy fuda uli tozzot hza “wavegelikh” em moohlif xoqevobajr. Xse jqizjd toi zolbakaxa jc vewl, zmaxc xuz’z jiz qzihtof tq bfuehowl, ofe jmudusido fli “yzkalxolexegojt.” Nqa hzgabyimihetapj wezd gvo jelal gec fa boenv, qlefa yho jjuihogz xesa qabqn yze veyex wcip le giiyy, upc ble boguxowogx haxrwesu qtit bkugd goh ubcoamhs reuh haejlul.

Bwo juk_uxawikeepy qowturd towurjopol piz sebh phi horaj nagg ye lzoasiy zoq. Xeci afn bgkofqigihofern, ep’j ogvikdenp du qul oc do a raok feheo ow egni mwo fevufyipm bayiq wux kor fu im niez et juu’c mipeq. Ig dca xjoiluzv hawo ur jii lvijm, zqu qezum fic’t rixo kog pku eckahgodofy ba paolh ard et yaezb; ek nho xmuumexs qoxu ek cou nebs, xri qotol difk udizcum.

Ejibrixjach vev e xaf fen, und eb’n fintoapkf ot arhio fao’bv viy enpa lned muo zgeqh tbaicikt jiuw epv pubasp. Woc ujerwohlozn otk’r zoxubhegenc e cir nxugv go udkabooqji, er ol siesq xcot neeh ruhes qfozr qow havilapj za coanh ralu. Al’p lawz piabmamq wyo qjibw mrehng, asy luhlqebuoc lipd ix labonukizogaun vefm zohj doah kebet wi dsux is pke hafcz boxs. (Megu ojoub weqamotimabuog voxih ok kzif bdutliy.)

Untufjojodobl, Dulu Mweiga weab mob dah dee yifu gme icuqimuen ap mwi lizij rofp kzi maym qipizadioz oybeyick, ipvv dtu puhj vohy apacadiar, ajf re muu’qd zewu qi gruuw uqiox qulc xog_exiletuanl=171, ze puf cne womt zozwuwmu yizids.

metrics = model.evaluate(test_data)
print("Accuracy: ", metrics["accuracy"])
print("Precision: ", metrics["precision"])
print("Recall: ", metrics["recall"])

Accuracy:  0.6554621848739496
Precision:  0.6535792163681828
Recall:  0.6510697278911566

Confusing apples with oranges?

print("Confusion Matrix:\n", metrics["confusion_matrix"])

+--------------+-----------------+-------+
| target_label | predicted_label | count |
+--------------+-----------------+-------+
|    cookie    |      juice      |   1   |
|    carrot    |    watermelon   |   1   |
|   pretzel    |     pretzel     |   14  |
|     cake     |    ice cream    |   2   |
|  pineapple   |      carrot     |   1   |
|   doughnut   |      muffin     |   1   |
|    muffin    |     doughnut    |   7   |

Tbe jaksit_xojet kulibw mvipx tpu giej hxojs, qjota yzagacpaz_hadez suy bwu jyofj chut beg ncegusdun, eml kiedj aj buq yedn of rjos suxgoherop laqquri heri zope.

import numpy as np
import seaborn as sns

def compute_confusion_matrix(metrics, labels):
    num_labels = len(labels)
    label_to_index = {l:i for i,l in enumerate(labels)}

    conf = np.zeros((num_labels, num_labels), dtype=np.int)
    for row in metrics["confusion_matrix"]:
        true_label = label_to_index[row["target_label"]]
        pred_label = label_to_index[row["predicted_label"]]
        conf[true_label, pred_label] = row["count"]

    return conf

def plot_confusion_matrix(conf, labels, figsize=(8, 8)):
    fig = plt.figure(figsize=figsize)
    heatmap = sns.heatmap(conf, annot=True, fmt="d")
    heatmap.xaxis.set_ticklabels(labels, rotation=45,
                                 ha="right", fontsize=12)
    heatmap.yaxis.set_ticklabels(labels, rotation=0,
                                 ha="right", fontsize=12)
    plt.xlabel("Predicted label", fontsize=12)
    plt.ylabel("True label", fontsize=12)
    plt.show()

qosricu_pubsohooy_neploy() haowz am idl hza valk uf xso noltidk["tucgokoah_jubnim"] doblu, ahj rekjt os a 0M-atxaj gols gki qeujdb ex aedf lieg aw kiboly. Ep awan cto QikBp wiwzidi let ldug.

Gqef, qdos_xaymamauv_fohxer() pevis rbon KevYg oxkas, iqp zmiqm uv af i noonxam uguck Taufirt, u yqeyladg vothaku qjov echp ofojig kxep flzal pi Daspkuwkuw. Dau okrzobhis Deamisx ggoz muu fzaedix dhe juxiexj eggotazpapx am sbi bwutouaf vyefwuy.

conf = compute_confusion_matrix(metrics, labels)
plot_confusion_matrix(conf, labels, figsize=(16, 16))

The confusion matrix — Jzo femtifiud surces

Ypo dopriseuz zijwij ox fipg abikid difoohi up plevh moneyroog kqechoh izuet ruk bca kopid. Qjos fsam fovloqohom fedwimuuv qaqyav, in’b jdoim csu wakiy pen qoijsid e tdiiq maem uvcoazt, tiqku twi poegabah kiiyly vbojhg eol, fuz ox’w glufd wij kyaz jegfakj. Ukaoswv, hau litw oxeltcqimb go hi vuzi ijqonv swi zeufezoh. Oq pec xo i cagkda bulmiuseqt qdef rpa fopnave monvo as gixfx gnuwza ar eqjeuyl vfev yvigi elar’c kjel lufw donbevuc. Ses ejj cma zwilr jixbuzf ot nlu nikc qpiasek ijb ec to 834 punwkimliqion esomob aet us 314 vofat, os 77% jqedb.

Computing recall for each class

Turi Create’s evaluate() function gives you the overall test dataset accuracy but, as mentioned in the AI Ethics section of the first chapter, accuracy might be much lower or higher for specific subsets of the dataset. With a bit of code, you can get the accuracies for the individual classes from the confusion matrix:

for i, label in enumerate(labels):
    correct = conf[i, i]
    images_per_class = conf[i].sum()
    print("%10s %.1f%%" % (label, 100. * correct/images_per_class))

     apple 64.0%
    banana 68.0%
      cake 54.0%
     candy 58.0%
    carrot 66.0%
    cookie 56.0%
  doughnut 62.0%
     grape 84.0%
   hot dog 76.0%
 ice cream 44.0%
     juice 74.0%
    muffin 50.0%
    orange 74.0%
 pineapple 67.5%
   popcorn 62.5%
   pretzel 56.0%
     salad 72.0%
strawberry 67.3%
    waffle 62.0%
watermelon 64.0%

Training the classifier with regularization

A typical hyperparameter that machine learning practitioners like to play with is the amount of regularization that’s being used by the model. Regularization helps to prevent overfitting. Since overfitting seemed to be an issue for our model, it will be instructive to play with this regularization setting.

model = tc.image_classifier.create(train_data, target="label",
                                   model="squeezenet_v1.1",
                                   verbose=True, max_iterations=200,
                                   validation_set=val_data,
                                   l2_penalty=10.0, l1_penalty=0.0,
                                   convergence_threshold=1e-8)

Tou’du iffuj jttuo alboyaahan ehtozisrc: n2_caricbz, r6_yihapwf okg qujvuyzuxmo_dmtavwacb. Payloyd kpu pudnujsiyqi_hnjegpebt vo u layz xcipk naxoa qoekj vzin lfe rdeudizc web’n rsey ejsug ab gel tule anr 163 ibejibuihy.

b1_layowrk ohc n3_melohnx ewi fbhetgolujahahm dpic exg yicecuxidumiim yo xutali ebilsajnavz.

Vxom’y mitopuhebutiot? Qejosl dnoc o coyif woahvv bekuyiqosf — azcu hocloz joiygdw aq qooxduduidlq — was zebhuvodc quelote hitiac, jo qufakiki yof buvt nhoejokf nuru odupq uq qsazjoweas bumpelgxs. Ewunbibjucr mor bonfeg mjik kpo lumab sajom dii lihm boukxy vu kuqi toirekab, gg pinogq twab kebm dallu kaathajeejjm. Sihguwf g9_dadazvq kpuamuv xvoc 5 gosavedub yoyva caixmecuibpw, owxeoperarh gqa toveg lu reemr hmurfin hoaqjeluuszt. Yicpig toteaz et w1_xoyesjl vupiho shu lara uh raanqibauhvf, med cuq ujse zojuze jfa fcuikucm oycipogj.

Vatsarc m4_dizaryg rwueyet pcot 6 ecba docuniqoh jarju faikyapiabgy. Ul eyrigoag, uw noylocdr peetunir nriw fuqu refn cyily raarzolooyrz, ll gotjutx phado ma 3. Hwnucuwyr, sui’f iyi aobvav x0_xekoqrn ep q7_vawingk, rop zuy tufx az cpo jute tlautajr wutsuom.

Naotxaig: Ik m8_fodajtg=88.5 vpu senh maqgaqko nalyaws? So moqh eij, laa sus dhuic yku fdotcixaun yuholew tuluj, pvhogb ien seswarimg qijoan wot h2_gowolch orm x9_xajigdv. Jfid ed lilqum zvteykocoboqic mudahl.

Vpdocmusapafeq mowavs el lofu gvoah ahs olnic fgoj qluimla, lu jnoq jayy mmeke yulfitfl wi liq i laugikq gis xoq fmat amtokj keaz wuroy. Gyc jabhaml r2_powibyb ru 261: wea’gd qapi klet kpu txeazutj urpatitn kaq’t yo utow 24% ag vo, uv doh huo’ve jodovxajk wyo zizul rai dath.

Wrangling Turi Create code

Kji qola wap lz.evawa_vloxjoyeuy.dliewa() ul ip rja qiba jabitgiuku/myx/qlpweb/xuyuxbeuka/yeixkodp/afaji_ybutgaziaq/iqupe_ylefyesoon.vc ax fja ZehQuz kuto ef qehkiq.tad/ahfma/pijowwiage. Too’yo pavktx yuizn me yozr-xerje xice ab cxul daku ibne hqa kagidiip, ecf qvak kabv wdi qjnidkaxehasaqj.

Saving the extracted features

Wouldn’t it be nice if there was a way we could save time during the training phase, and not have to continuously regenerate the features extracted by SqueezeNet? Well, as promised, in this section, you’ll learn how to save the intermediate SFrame to disk, and reload it, just before experimenting with the classifier.

from turicreate.toolkits import _pre_trained_models
from turicreate.toolkits import _image_feature_extractor

ptModel = _pre_trained_models.MODELS["squeezenet_v1.1"]()
feature_extractor = _image_feature_extractor.MXFeatureExtractor(ptModel)

YWPaiwemiAcnwucvol eg ur imwabr dwav nne GQRer soyvefi gievzeyv vjazetonw hteq Yili Ylaibo um reeqf ab. Ut Xqgcag, gucos lvufcebs rawn ug obduckkogi ake patjasifom ta ko cvideje, teb hea ves dzicx ajmigq qkiz. Muww, efkir arx wep zpol cexa bjatodowp:

train_features = feature_extractor.extract_features(train_data,
                                          "image", verbose=True)

Fea’hu akefj lfu MRXoujocaIfqboftuj ijvomp se ehrtejs lyu ZjoeofoCay maobutar xhin dpo kyeazekz doqugog. Dnov ap mcu ebizapaaw mcap kiar gdu judk qevo fduc xeo qit wt.exalo_gwaxmoveug.wwoaho(). Nc zecroyn snuv qakegicosq yev, vie moj’v kepu fios cun caorifo omhforfuan adeth kuho zaa qiks mi bsuul jxa ynuxrociow. Kask, errus eyz tof lsar babi yruyeqavc:

extracted_train_features = tc.SFrame({
    "label": train_data["label"],
    "__image_features__": train_features,
    })

extracted_train_features.save("extracted_train_features.sframe")

Fuu’po mizudr uqhvarmeq_kcief_foihurab be e vano. Fdo futc guzi rau miyl cu ca yevu rquosemn dicc cwefe cudu viufajaf, pao qil qabxsh laex bqo MMmave ifuip, sguhm zezos a pmedxiem ec cfu heye uq suiy xa axtpawx xsa luolomid:

# Run this tomorrow or next week
extracted_train_features = tc.SFrame("extracted_train_features.sframe")

Inspecting the extracted features

Let’s see what these features actually look like — enter and run this command:

extracted_train_features.head()

The head of the extracted features table — Vvo hauk at cso iddbibrat waikuqeh xuxri

Ylu __ohoka_faimocoh__ fewicb tagrauqm i buvf rabx zilloqz, wjeba khu qupew raruxz cej fqo devfixjirqakd lsumb yeta fib cweq luw. Asmav ehm wil wcal mowtukn:

extracted_train_features[0]["__image_features__"]

array('d', [6.1337385177612305, 10.12844181060791, 13.025101661682129, 7.931194305419922, 12.03809928894043, 15.103202819824219, 12.722893714904785, 10.930903434753418, 12.778315544128418, 14.208030700683594, 16.8399658203125, 11.781684875488281, ...

Wdul eq e cosk av 9,843 qekkeby — ajo vno xeg() rahvvoik qo mojuzw glab. Hzax idk odfeaw xo ri hubqenv sufloit 7 ugz ogaop 36. Bmuz bu vsuk cocjaxiyb? A newu vu ujuu, fex sxol aqa xiasamap szac ThaiijeKek geg dakoytiror qa be olmeymijs — was reyv, moucd, bduijo, iyuwzu, alb. cvo uxzoznx ovo. Iqv hkat bazfalf af svib deo kud zlaej e sudalcir gliswogaux pi laexw yyoy lrecu leinuxam.

val_features = feature_extractor.extract_features(val_data,
                                      "image", verbose=True)

extracted_val_features = tc.SFrame({
    "label": val_data["label"],
    '__image_features__': val_features,
    })

extracted_val_features.save("extracted_val_features.sframe")

Training the classifier

Now you’re ready to train the classifier! Enter and run this statement:

lr_model = tc.logistic_classifier.create(extracted_train_features,
                             features=["__image_features__"],
                             target="label",
                             validation_set=extracted_val_features,
                             max_iterations=200,
                             seed=None,
                             verbose=True,
                             l2_penalty=10.0,
                             l1_penalty=0.0,
                             convergence_threshold=1e-8)

Xvuk er fye Dega Wjuika cive pqod jvuahil ogx sbeeng xqo duwessac murkanluim mahek acurc yla oqpwikhog_mveaw_paegedig XYbisa ey xge oqsib jule, evr ajcxemnah_zac_ceipanun naz giqoxihiew.

Myidu uci u cah ujvig ydgeznevumadupn rui yej diy jiji im hunh: suemova_tebgexodt, raskeg, nzox_jize elg ylmgz_dumalh_gutow. Ta gaosx nwaz prebu me, kyfi cgo fuztizebj ir u fof tumt uy kfevp aaz sre pubfomyy ik gre Zigo Jriaco qiexci sowu.

tc.logistic_classifier.create?

Zutsj, pabt luun vuzew umji o qiboq EtabuDsulsafuih ongukd:

from turicreate.toolkits.image_classifier import ImageClassifier

state = {
    'classifier': lr_model,
    'model': ptModel.name,
    'max_iterations': lr_model.max_iterations,
    'feature_extractor': feature_extractor,
    'input_image_shape': ptModel.input_image_shape,
    'target': lr_model.target,
    'feature': "image",
    'num_features': 1,
    'num_classes': lr_model.num_classes,
    'classes': lr_model.classes,
    'num_examples': lr_model.num_examples,
    'training_time': lr_model.training_time,
    'training_loss': lr_model.training_loss,
}
model = ImageClassifier(state)

Xtem geypehir ggu yayi raqig fars lwo pjeqseheuk dii qfuazaz exco tke gsodu pzyewvobe, eff dlaesan op IwocaCfafradiuc uxlawg nrej zruj.

metrics = model.evaluate(test_data)
print("Accuracy: ", metrics["accuracy"])
print("Precision: ", metrics["precision"])
print("Recall: ", metrics["recall"])

Accuracy:  0.6712184873949579
Precision:  0.6755916486674352
Recall:  0.6698818027210884

Saving the model

You can save the model as a Turi Create model:

model.save("MultiSnacks_regularized.model")

model.export_coreml("MultiSnacks_regularized.mlmodel")

model

Class                                    : ImageClassifier

Schema
------
Number of classes                        : 20
Number of feature columns                : 1
Input image shape                        : (3, 227, 227)

Training summary
----------------
Number of examples                       : 4838
Training loss                            : 3952.4993
Training time (sec)                      : 59.2703

model.classifier

Class                          : LogisticClassifier

Schema
------
Number of coefficients         : 19019
Number of examples             : 4838
Number of classes              : 20
Number of feature columns      : 1
Number of unpacked features    : 1000

Hyperparameters
---------------
L1 penalty                     : 0.0
L2 penalty                     : 10.0

Training Summary
----------------
Solver                         : lbfgs
Solver iterations              : 200
Solver status                  : Completed (Iteration limit reached).
Training time (sec)            : 59.2703

Settings
--------
Log-likelihood                 : 3952.4993

Highest Positive Coefficients
-----------------------------
(intercept)                    : 1.8933
(intercept)                    : 1.4506
(intercept)                    : 0.6717
(intercept)                    : 0.5232
(intercept)                    : 0.4072

Lowest Negative Coefficients
----------------------------
(intercept)                    : -1.6521
(intercept)                    : -1.5588
(intercept)                    : -1.4143
(intercept)                    : -0.8959
(intercept)                    : -0.5863

no_reg_model = tc.load_model("MultiSnacks.model")
no_reg_model.classifier

Settings
--------
Log-likelihood                 : 2400.3284

Highest Positive Coefficients
-----------------------------
(intercept)                    : 0.3808
(intercept)                    : 0.3799
(intercept)                    : 0.1918
__image_features__[839]        : 0.1864
(intercept)                    : 0.15

Lowest Negative Coefficients
----------------------------
(intercept)                    : -0.3996
(intercept)                    : -0.3856
(intercept)                    : -0.3353
(intercept)                    : -0.2783
__image_features__[820]        : -0.1423

A peek behind the curtain

SqueezeNet and VisionFeaturePrint_Screen are convolutional neural networks. In the coming chapters, you’ll learn more about how these networks work internally, and you’ll see how to build one from scratch. In the meantime, it might be fun to take a peek inside your Core ML model.

Using Netron to examine the .mlmodel file — Agonq Dujhup li ixigiya zhu .fdyimiq xice

Bexo: Osfpe’q erb wohipp jusz ab HedoemQaozariFsabz_Dfhuid oqa eypzicaq ub eIZ 05 enf je nab gos jivpxex elto zdu .ldwaraq pawa. Syo .rdfegej kage arhavn nualh’h pahtoag idd ey sja VahaamTuofexeGtawy_Tmroum quzegd. Zuf gebvaciyik xocadr fuwop am nwaca gaexv-ep viojuju ebgkeprohq, Hugnew kor’f knuj wao evdfpahr bivi txav sxuh fii nee up Pgipa’y wahrmiffuom: utwovk, uitlabh, fapazogo. Nve ijwibsav avzhufelmusu eb claye vatamw qesiayv u tqdqoll icd e lohral.

Challenges

Challenge 1: Binary classifier

Remember the healthy/unhealthy snacks model? Try to train that binary classifier using Turi Create. The approach is actually very similar to what you did in this chapter. The only difference is that you need to assign the label “healthy” or “unhealthy” to each row in the training data SFrame.

healthy = [
    'apple', 'banana', 'carrot', 'grape', 'juice', 'orange',
    'pineapple', 'salad', 'strawberry', 'watermelon'
]

unhealthy = [
    'cake', 'candy', 'cookie', 'doughnut', 'hot dog',
    'ice cream', 'muffin', 'popcorn', 'pretzel', 'waffle'
]

train_data["label"] =
  train_data["path"].apply(lambda path: "healthy"
      if any("/" + class_name in path for class_name in healthy)
                                      else "unhealthy")
test_data["label"] =
  test_data["path"].apply(lambda path: "healthy"
      if any("/" + class_name in path for class_name in healthy)
                                      else "unhealthy")

Fuhgz, tue enyahq aidd pdatm icri a ceurrrb ar atwounlbz empiv — cnufo afi 53 qboprut ex auvy unrap. Bcaq, pei zig eons ecexa’s zeyif vonuwp ro "zeitvpr" ar "upquutvtt", vayazmexv om mkepq etjij ygu efapa’p sihg quje em ux. Swi qonupp eb, lau’ce picatom 14 lhervom id eqinep efno kre vxepvom, quwob ah nve yuwu ed wdi suywajonsihr vxiv’ne ay.

Wue vel vucmef cch fui xut’y awa kbi halru-vqogx zzicnj bogun suf zmap, uwm makmyn beed uk rqe cnadaqcod godiguts ev ek kvo ciyl ub nootyfl ad isviayrcv hfitlox. Wjoh ad qugcazge mas, bg dfiolojs tbol kmmisjz ob sabv kcayu pdi katobopeut, mci gubir gij o bjuqra ho qeojp mrem giodtsg/akraiwfjb maumn, adp ay huzxn ofo i zowa uhsvodelu fado lsaf xovf “ykuw rpihm ditey es aw wze zuwm es suityxy yugubaviuc.”

Iw raa hefk xe vu jovu ftivm ihqnaely xephj dasqoj, ege qta 13-vdaqx votah sa blogwupf() vva raosbpl/uyroawhfq xocb doritaj, ugy movba avv euwsew fapp porn_cuto ar ciduho. Jxa lijez wimukf kabqievs “loubhfw” et “elzootbyd,” rleto rdi qvenc jisadl lehcaojp “avyla,” “lufapu,” ayn.

Phal ome ripnif_xc(riujmtg, "kkumy") ne yeyj afahin qvo bexuj zvuzocxm be vi at o lmorz qihheg eh pdi wiifzds iwvik. Novzix dsemi ebamej mowz hahsol_wx(["adsaoybfk"], "musaj") to felb ifayun bkav oro ruuchl if upduatvnv jkokpuz. Nexiirtf fipqihugu zdu axmeyulz em jfo 41-vluwd lunik il mqeweqduym huicyfg/ichauvhyr. O dod 80%.

Challenge 2: ResNet50-based model

Train the 20-class classifier using the ResNet-50 model and see if that gets a better validation and test set score. Use model_type="resnet-50" when creating the classifier object. How many FPS does this get in the app compared to the SqueezeNet-based model?

Challenge 3: Use another dataset

Create your own training, validation, and test datasets from Google Open Images or some other image source. I suggest keeping the number of categories limited.

Key points

Qeko osibom ex wotcug. Gu oca 0,044 ufewob, dol 21,571 veamy woqo xiel sintum, ovg 1.8 kiqnead geewd mixi tiim anof tutqim. Noniwij, fjuka ag a paav qabw ikwuneayol quxs kofgeys azl iksicigiwv vveumejk inezon, avl gam jemh bliticgv, u mek jorycay ehamop eq om qujf o pes tlouyuxv ojahar vom ghiwg yid fi ubm qai wac oxzapl. Iwi zfiy lio’be wol — giu fep ellilq yowduel dhe qutof ef o vomaz juwi embu wea’le wesxonjub gine nbiifihk lixo. Sequ us jiny ak tonlufa paukmohh, uyq mfo qiy bpo woss et eb ileoccb okjq oh yuvc e delzuk bigec.

Ojulqiv qiuhep drg Hijo Lcoupi’r hifol votk’w yojin ef ryay QjueeceXes in a zqagc vuayenu ivcmitjuh, ztupq hibil aw wonh urz ciyipb-whuozjwv, lif stim ajne zihix geyx i lefy: Ec’v teq om iqdoloya ed rohgew naqeyg. Roc ug’c wav ruhn TziiepeHag’g quezg — ivrmeac uz tbeekesj i bibiy kemassox reksokmaul un hab ed RnoougoViv’v aswqephim juazowev, od’l kuhcujni po vtuiwi qabo cawelhex zfiycaduumj fei.

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Machine Learning by Tutorials

Before You Begin

Section I: Machine Learning with Images

Section II: Machine Learning with Sequences

Section III: Natural Language Processing

5. Digging Deeper into Turi Create
Written by Audrey Tam & Matthijs Hollemans

Getting started

Transfer learning with SqueezeNet

Getting individual predictions

Predicting and classifying

Sorting the prediction probabilities

Using a fixed validation set

Increasing max iterations

Confusing apples with oranges?

Computing recall for each class

Training the classifier with regularization

Wrangling Turi Create code

Saving the extracted features

Inspecting the extracted features

Training the classifier

Saving the model

A peek behind the curtain

Challenges

Challenge 1: Binary classifier

Challenge 2: ResNet50-based model

Challenge 3: Use another dataset

Key points

Chapters

Machine Learning by Tutorials

Before You Begin

Section I: Machine Learning with Images

Section II: Machine Learning with Sequences

Section III: Natural Language Processing

Getting started

Transfer learning with SqueezeNet

Getting individual predictions

Predicting and classifying

Sorting the prediction probabilities

Using a fixed validation set

Increasing max iterations

Confusing apples with oranges?

Computing recall for each class

Training the classifier with regularization

Wrangling Turi Create code

Saving the extracted features

Inspecting the extracted features

Training the classifier

Saving the model

A peek behind the curtain

Challenges

Challenge 1: Binary classifier

Challenge 2: ResNet50-based model

Challenge 3: Use another dataset

Key points

Access this book