100% found this document useful (1 vote)

457 views24 pages

ML Best Practices for Engineers

This document provides rules and best practices for machine learning engineering based on experiences at Google. It is intended for those with basic ML knowledge. The document covers terminology, an overview of effective ML approaches, and then provides 37 rules organized into sections on the ML pipeline, objectives, feature engineering, handling training/serving skew, and later optimization stages. The overall approach is to start simply, ensure a solid end-to-end pipeline, focus on good features before complex models, and maintain a stable pipeline through iterations.

Uploaded by

Kevin P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

457 views24 pages

ML Best Practices for Engineers

Uploaded by

Kevin P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Rules of Machine Learning:

Best Practices for ML Engineering

MartinZinkevich

Thisdocumentisintendedtohelpthosewithabasicknowledgeofmachinelearninggetthe
benefitofbestpracticesinmachinelearningfromaroundGoogle.Itpresentsastyleformachine
learning,similartotheGoogleC++StyleGuideandotherpopularguidestopractical
programming.Ifyouhavetakenaclassinmachinelearning,orbuiltorworkedona
machinelearnedmodel,thenyouhavethenecessarybackgroundtoreadthisdocument.

Terminology
Overview
BeforeMachineLearning
Rule#1:Dontbeafraidtolaunchaproductwithoutmachinelearning.
Rule#2:Makemetricsdesignandimplementationapriority.
Rule#3:Choosemachinelearningoveracomplexheuristic.
MLPhaseI:YourFirstPipeline
Rule#4:Keepthefirstmodelsimpleandgettheinfrastructureright.
Rule#5:Testtheinfrastructureindependentlyfromthemachinelearning.
Rule#6:Becarefulaboutdroppeddatawhencopyingpipelines.
Rule#7:Turnheuristicsintofeatures,orhandlethemexternally.
Monitoring
Rule#8:Knowthefreshnessrequirementsofyoursystem.
Rule#9:Detectproblemsbeforeexportingmodels.
Rule#10:Watchforsilentfailures.
Rule#11:Givefeaturesetsownersanddocumentation.
YourFirstObjective
Rule#12:Dontoverthinkwhichobjectiveyouchoosetodirectlyoptimize.
Rule#13:Chooseasimple,observableandattributablemetricforyourfirst
objective.
Rule#14:Startingwithaninterpretablemodelmakesdebuggingeasier.
Rule#15:SeparateSpamFilteringandQualityRankinginaPolicyLayer.
MLPhaseII:FeatureEngineering
Rule#16:Plantolaunchanditerate.
Rule#17:Startwithdirectlyobservedandreportedfeaturesasopposedtolearned
features.

Rule#18:Explorewithfeaturesofcontentthatgeneralizeacrosscontexts.
Rule#19:Useveryspecificfeatureswhenyoucan.
Rule#20:Combineandmodifyexistingfeaturestocreatenewfeaturesin
humanunderstandableways.
Rule#21:Thenumberoffeatureweightsyoucanlearninalinearmodelisroughly
proportionaltotheamountofdatayouhave.
Rule#22:Cleanupfeaturesyouarenolongerusing.
HumanAnalysisoftheSystem
Rule#23:Youarenotatypicalenduser.
Rule#24:Measurethedeltabetweenmodels.
Rule#25:Whenchoosingmodels,utilitarianperformancetrumpspredictivepower.
Rule#26:Lookforpatternsinthemeasurederrors,andcreatenewfeatures.
Rule#27:Trytoquantifyobservedundesirablebehavior.
Rule#28:Beawarethatidenticalshorttermbehaviordoesnotimplyidentical
longtermbehavior.
TrainingServingSkew
Rule#29:Thebestwaytomakesurethatyoutrainlikeyouserveistosavetheset
offeaturesusedatservingtime,andthenpipethosefeaturestoalogtousethemat
trainingtime.
Rule#30:Importanceweightsampleddata,dontarbitrarilydropit!
Rule#31:Bewarethatifyoujoindatafromatableattrainingandservingtime,the
datainthetablemaychange.
Rule#32:Reusecodebetweenyourtrainingpipelineandyourservingpipeline
wheneverpossible.
Rule#33:IfyouproduceamodelbasedonthedatauntilJanuary5th,testthemodel
onthedatafromJanuary6thandafter.
Rule#34:Inbinaryclassificationforfiltering(suchasspamdetectionordetermining
interestingemails),makesmallshorttermsacrificesinperformanceforveryclean
data.
Rule#35:Bewareoftheinherentskewinrankingproblems.
Rule#36:Avoidfeedbackloopswithpositionalfeatures.
Rule#37:MeasureTraining/ServingSkew.
MLPhaseIII:SlowedGrowth,OptimizationRefinement,andComplexModels
Rule#38:Dontwastetimeonnewfeaturesifunalignedobjectiveshavebecomethe
issue.
Rule#39:Launchdecisionswilldependuponmorethanonemetric.
Rule#40:Keepensemblessimple.
Rule#41:Whenperformanceplateaus,lookforqualitativelynewsourcesof
informationtoaddratherthanrefiningexistingsignals.
Rule#42:Dontexpectdiversity,personalization,orrelevancetobeascorrelated
withpopularityasyouthinktheyare.
Rule#43:Yourfriendstendtobethesameacrossdifferentproducts.Yourinterests
tendnottobe.

RelatedWork
Acknowledgements
Appendix
YouTubeOverview
GooglePlayOverview
GooglePlusOverview

Terminology

Thefollowingtermswillcomeuprepeatedlyinourdiscussionofeffectivemachinelearning:

Instance:Thethingaboutwhichyouwanttomakeaprediction.Forexample,theinstance
mightbeawebpagethatyouwanttoclassifyaseither"aboutcats"or"notaboutcats".
Label:Ananswerforapredictiontaskeithertheanswerproducedbyamachinelearning
system,ortherightanswersuppliedintrainingdata.Forexample,thelabelforawebpage
mightbe"aboutcats".
Feature:Apropertyofaninstanceusedinapredictiontask.Forexample,awebpagemight
haveafeature"containstheword'cat'".
FeatureColumn1:Asetofrelatedfeatures,suchasthesetofallpossiblecountriesinwhich
usersmightlive.Anexamplemayhaveoneormorefeaturespresentinafeaturecolumn.A
featurecolumnisreferredtoasanamespaceintheVWsystem(atYahoo/Microsoft),ora
field.
Example:Aninstance(withitsfeatures)andalabel.
Model:Astatisticalrepresentationofapredictiontask.Youtrainamodelonexamplesthenuse
themodeltomakepredictions.
Metric:Anumberthatyoucareabout.Mayormaynotbedirectlyoptimized.
Objective:A
metricthatyouralgorithmistryingtooptimize.
Pipeline:Theinfrastructuresurroundingamachinelearningalgorithm.Includesgatheringthe
datafromthefrontend,puttingitintotrainingdatafiles,trainingoneormoremodels,and
exportingthemodelstoproduction.

Overview
Tomakegreatproducts:
domachinelearninglikethegreatengineeryouare,notlikethegreatmachinelearning
expertyouarent.
1

Googlespecificterminology.

Mostoftheproblemsyouwillfaceare,infact,engineeringproblems.Evenwithallthe
resourcesofagreatmachinelearningexpert,mostofthegainscomefromgreatfeatures,not
greatmachinelearningalgorithms.So,thebasicapproachis:
1. makesureyourpipelineissolidendtoend
2. startwithareasonableobjective
3. addcommonsensefeaturesinasimpleway
4. makesurethatyourpipelinestayssolid.
Thisapproachwillmakelotsofmoneyand/ormakelotsofpeoplehappyforalongperiodof
time.Divergefromthisapproachonlywhentherearenomoresimpletrickstogetyouany
farther.Addingcomplexityslowsfuturereleases.

Onceyou'veexhaustedthesimpletricks,cuttingedgemachinelearningmightindeedbeinyour
future.SeethesectiononP
haseIIImachinelearningprojects.

Thisdocumentisarrangedinfourparts:
1. Thefirstpartshouldhelpyouunderstandwhetherthetimeisrightforbuildingamachine
learningsystem.
2. Thesecondpartisaboutdeployingyourfirstpipeline.
3. Thethirdpartisaboutlaunchinganditeratingwhileaddingnewfeaturestoyourpipeline,
howtoevaluatemodelsandtrainingservingskew.
4. Thefinalpartisaboutwhattodowhenyoureachaplateau.
5. Afterwards,thereisalistofr elatedworkandana
ppendixwithsomebackgroundonthe
systemscommonlyusedasexamplesinthisdocument.

BeforeMachineLearning
Rule#1:Dontbeafraidtolaunchaproductwithoutmachinelearning.
Machinelearningiscool,butitrequiresdata.Theoretically,youcantakedatafromadifferent
problemandthentweakthemodelforanewproduct,butthiswilllikelyunderperformbasic
heuristics.Ifyouthinkthatmachinelearningwillgiveyoua100%boost,thenaheuristicwillget
you50%ofthewaythere.

Forinstance,ifyouarerankingappsinanappmarketplace,youcouldusetheinstallrateor
numberofinstalls.Ifyouaredetectingspam,filteroutpublishersthathavesentspambefore.
Dontbeafraidtousehumaneditingeither.Ifyouneedtorankcontacts,rankthemostrecently
usedhighest(orevenrankalphabetically).Ifmachinelearningisnotabsolutelyrequiredforyour
product,don'tuseituntilyouhavedata.

Rule#2:First,designandimplementmetrics.
Beforeformalizingwhatyourmachinelearningsystemwilldo,trackasmuchaspossibleinyour
currentsystem.Dothisforthefollowingreasons:

1. Itiseasiertogainpermissionfromthesystemsusersearlieron.
2. Ifyouthinkthatsomethingmightbeaconcerninthefuture,itisbettertogethistorical
datanow.
3. Ifyoudesignyoursystemwithmetricinstrumentationinmind,thingswillgobetterfor
youinthefuture.Specifically,youdontwanttofindyourselfgreppingforstringsinlogs
toinstrumentyourmetrics!
4. Youwillnoticewhatthingschangeandwhatstaysthesame.Forinstance,supposeyou
wanttodirectlyoptimizeonedayactiveusers.However,duringyourearlymanipulations
ofthesystem,youmaynoticethatdramaticalterationsoftheuserexperiencedont
noticeablychangethismetric.

GooglePlusteammeasuresexpandsperread,resharesperread,plusonesperread,
comments/read,commentsperuser,resharesperuser,etc.whichtheyuseincomputingthe
goodnessofapostatservingtime.A
lso,notethatanexperimentframework,whereyou
cangroupusersintobucketsandaggregatestatisticsbyexperiment,isimportant.See
Rule#12.

Bybeingmoreliberalaboutgatheringmetrics,youcangainabroaderpictureofyoursystem.
Noticeaproblem?Addametrictotrackit!Excitedaboutsomequantitativechangeonthelast
release?Addametrictotrackit!

Rule#3:Choosemachinelearningoveracomplexheuristic.
Asimpleheuristiccangetyourproductoutthedoor.Acomplexheuristicisunmaintainable.
Onceyouhavedataandabasicideaofwhatyouaretryingtoaccomplish,moveontomachine
learning.Asinmostsoftwareengineeringtasks,youwillwanttobeconstantlyupdatingyour
approach,whetheritisaheuristicoramachinelearnedmodel,andyouwillfindthatthe
machinelearnedmodeliseasiertoupdateandmaintain(seeR
ule#16).

MLPhaseI:YourFirstPipeline
Focusonyoursysteminfrastructureforyourfirstpipeline.Whileitisfuntothinkaboutallthe
imaginativemachinelearningyouaregoingtodo,itwillbehardtofigureoutwhatishappening
ifyoudontfirsttrustyourpipeline.

Rule#4:Keepthefirstmodelsimpleandgettheinfrastructureright.
Thefirstmodelprovidesthebiggestboosttoyourproduct,soitdoesn'tneedtobefancy.But
youwillrunintomanymoreinfrastructureissuesthanyouexpect.Beforeanyonecanuseyour
fancynewmachinelearningsystem,youhavetodetermine:

1. Howtogetexamplestoyourlearningalgorithm.
2. Afirstcutastowhatgoodandbadmeantoyoursystem.
3. Howtointegrateyourmodelintoyourapplication.Youcaneitherapplythemodellive,or
precomputethemodelonexamplesofflineandstoretheresultsinatable.Forexample,
youmightwanttopreclassifywebpagesandstoretheresultsinatable,butyoumight
wanttoclassifychatmessageslive.

Choosingsimplefeaturesmakesiteasiertoensurethat:
1. Thefeaturesreachyourlearningalgorithmcorrectly.
2. Themodellearnsreasonableweights.
3. Thefeaturesreachyourmodelintheservercorrectly.
Onceyouhaveasystemthatdoesthesethreethingsreliably,youhavedonemostofthework.
Yoursimplemodelprovidesyouwithbaselinemetricsandabaselinebehaviorthatyoucanuse
totestmorecomplexmodels.Someteamsaimforaneutralfirstlaunch:afirstlaunchthat
explicitlydeprioritizesmachinelearninggains,toavoidgettingdistracted.

Rule#5:Testtheinfrastructureindependentlyfromthemachinelearning.
Makesurethattheinfrastructureistestable,andthatthelearningpartsofthesystemare
encapsulatedsothatyoucantesteverythingaroundit.Specifically:
1. Testgettingdataintothealgorithm.Checkthatfeaturecolumnsthatshouldbepopulated
arepopulated.Whereprivacypermits,manuallyinspecttheinputtoyourtraining
algorithm.Ifpossible,checkstatisticsinyourpipelineincomparisontoelsewhere,such
asRASTA.
2. Testgettingmodelsoutofthetrainingalgorithm.Makesurethatthemodelinyour
trainingenvironmentgivesthesamescoreasthemodelinyourservingenvironment
(seeR
ule#37).

Machinelearninghasanelementofunpredictability,somakesurethatyouhavetestsforthe
codeforcreatingexamplesintrainingandserving,andthatyoucanloadanduseafixedmodel
duringserving.Also,itisimportanttounderstandyourdata:seeP
racticalAdviceforAnalysisof
Large,ComplexDataSets.
Rule#6:Becarefulaboutdroppeddatawhencopyingpipelines.
Oftenwecreateapipelinebycopyinganexistingpipeline(i.e.cargocultprogramming),andthe
oldpipelinedropsdatathatweneedforthenewpipeline.Forexample,thepipelineforG
oogle
PlusWhatsHotdropsolderposts(becauseitistryingtorankfreshposts).Thispipelinewas
copiedtouseforG
ooglePlusStream,whereolderpostsarestillmeaningful,butthepipeline
wasstilldroppingoldposts.Anothercommonpatternistoonlylogdatathatwasseenbythe
user.Thus,thisdataisuselessifwewanttomodelwhyaparticularpostwasnotseenbythe
user,becauseallthenegativeexampleshavebeendropped.AsimilarissueoccurredinPlay.
WhileworkingonPlayAppsHome,anewpipelinewascreatedthatalsocontainedexamples
fromtwootherlandingpages(PlayGamesHomeandPlayHomeHome)withoutanyfeatureto
disambiguatewhereeachexamplecamefrom.

Rule#7:Turnheuristicsintofeatures,orhandlethemexternally.
Usuallytheproblemsthatmachinelearningistryingtosolvearenotcompletelynew.Thereis
anexistingsystemforranking,orclassifying,orwhateverproblemyouaretryingtosolve.This
meansthatthereareabunchofrulesandheuristics.T
hesesameheuristicscangiveyoua
liftwhentweakedwithmachinelearning.Yourheuristicsshouldbeminedforwhatever
informationtheyhave,fortworeasons.First,thetransitiontoamachinelearnedsystemwillbe
smoother.Second,usuallythoserulescontainalotoftheintuitionaboutthesystemyoudont
wanttothrowaway.Therearefourwaysyoucanuseanexistingheuristic:
1. Preprocessusingtheheuristic.Ifthefeatureisincrediblyawesome,thenthisisan
option.Forexample,if,inaspamfilter,thesenderhasalreadybeenblacklisted,donttry
torelearnwhatblacklistedmeans.Blockthemessage.Thisapproachmakesthemost
senseinbinaryclassificationtasks.
2. Createafeature.Directlycreatingafeaturefromtheheuristicisgreat.Forexample,if
youuseaheuristictocomputearelevancescoreforaqueryresult,youcanincludethe
scoreasthevalueofafeature.Lateronyoumaywanttousemachinelearning
techniquestomassagethevalue(forexample,convertingthevalueintooneofafinite
setofdiscretevalues,orcombiningitwithotherfeatures)butstartbyusingtheraw
valueproducedbytheheuristic.
3. Minetherawinputsoftheheuristic.Ifthereisaheuristicforappsthatcombinesthe
numberofinstalls,thenumberofcharactersinthetext,andthedayoftheweek,then
considerpullingthesepiecesapart,andfeedingtheseinputsintothelearning
separately.Sometechniquesthatapplytoensemblesapplyhere(s eeRule#40).
4. Modifythelabel.Thisisanoptionwhenyoufeelthattheheuristiccapturesinformation
notcurrentlycontainedinthelabel.Forexample,ifyouaretryingtomaximizethe
numberofdownloads,butyoualsowantqualitycontent,thenmaybethesolutionisto
multiplythelabelbytheaveragenumberofstarstheappreceived.Thereisalotof
spacehereforleeway.SeethesectiononYourFirstObjective.
DobemindfuloftheaddedcomplexitywhenusingheuristicsinanMLsystem.Usingold
heuristicsinyournewmachinelearningalgorithmcanhelptocreateasmoothtransition,but
thinkaboutwhetherthereisasimplerwaytoaccomplishthesameeffect.

Monitoring
Ingeneral,practicegoodalertinghygiene,suchasmakingalertsactionableandhavinga
dashboardpage.

Rule#8:Knowthefreshnessrequirementsofyoursystem.
Howmuchdoesperformancedegradeifyouhaveamodelthatisadayold?Aweekold?A
quarterold?Thisinformationcanhelpyoutounderstandtheprioritiesofyourmonitoring.Ifyou
lose10%ofyourrevenueifthemodelisnotupdatedforaday,itmakessensetohavean
engineerwatchingitcontinuously.Mostadservingsystemshavenewadvertisementstohandle

everyday,andmustupdatedaily.Forinstance,iftheMLmodelforG
ooglePlaySearchisnot
updated,itcanhaveanimpactonrevenueinunderamonth.SomemodelsforWhatsHotin
GooglePlushavenopostidentifierintheirmodelsotheycanexportthesemodelsinfrequently.
Othermodelsthathavepostidentifiersareupdatedmuchmorefrequently.Alsonoticethat
freshnesscanchangeovertime,especiallywhenfeaturecolumnsareaddedorremovedfrom
yourmodel.

Rule#9:Detectproblemsbeforeexportingmodels.
Manymachinelearningsystemshaveastagewhereyouexportthemodeltoserving.Ifthereis
anissuewithanexportedmodel,itisauserfacingissue.Ifthereisanissuebefore,thenitisa
trainingissue,anduserswillnotnotice.

Dosanitychecksrightbeforeyouexportthemodel.Specifically,makesurethatthemodels
performanceisreasonableonheldoutdata.Or,ifyouhavelingeringconcernswiththedata,
dontexportamodel.Manyteamscontinuouslydeployingmodelscheckthea
reaunderthe
ROCcurve(orAUC)beforeexporting.Issuesaboutmodelsthathaventbeenexported
requireanemailalert,butissuesonauserfacingmodelmayrequireapage.Sobetterto
waitandbesurebeforeimpactingusers.

Rule#10:Watchforsilentfailures.
Thisisaproblemthatoccursmoreformachinelearningsystemsthanforotherkindsof
systems.Supposethataparticulartablethatisbeingjoinedisnolongerbeingupdated.The
machinelearningsystemwilladjust,andbehaviorwillcontinuetobereasonablygood,decaying
gradually.Sometimestablesarefoundthatweremonthsoutofdate,andasimplerefresh
improvedperformancemorethananyotherlaunchthatquarter!Forexample,thecoverageofa
featuremaychangeduetoimplementationchanges:forexampleafeaturecolumncouldbe
populatedin90%oftheexamples,andsuddenlydropto60%oftheexamples.Playoncehada
tablethatwasstalefor6months,andrefreshingthetablealonegaveaboostof2%ininstall
rate.Ifyoutrackstatisticsofthedata,aswellasmanuallyinspectthedataonoccassion,you
canreducethesekindsoffailures.

Rule#11:Givefeaturecolumnownersanddocumentation.
Ifthesystemislarge,andtherearemanyfeaturecolumns,knowwhocreatedorismaintaining
eachfeaturecolumn.Ifyoufindthatthepersonwhounderstandsafeaturecolumnisleaving,
makesurethatsomeonehastheinformation.Althoughmanyfeaturecolumnshavedescriptive
names,it'sgoodtohaveamoredetaileddescriptionofwhatthefeatureis,whereitcamefrom,
andhowitisexpectedtohelp.

YourFirstObjective
Youhavemanymetrics,ormeasurementsaboutthesystemthatyoucareabout,butyour
machinelearningalgorithmwilloftenrequireasingleo
bjective,anumberthatyouralgorithm

istryingtooptimize.Idistinguishherebetweenobjectivesandmetrics:a
metricisany
numberthatyoursystemreports,whichmayormaynotbeimportant.SeealsoR
ule#2.

Rule#12:Dontoverthinkwhichobjectiveyouchoosetodirectlyoptimize.
Youwanttomakemoney,makeyourusershappy,andmaketheworldabetterplace.Thereare
tonsofmetricsthatyoucareabout,andyoushouldmeasurethemall(seeR
ule#2).However,
earlyinthemachinelearningprocess,youwillnoticethemallgoingup,eventhosethatyoudo
notdirectlyoptimize.Forinstance,supposeyoucareaboutnumberofclicks,timespentonthe
site,anddailyactiveusers.Ifyouoptimizefornumberofclicks,youarelikelytoseethetime
spentincrease.
So,keepitsimpleanddontthinktoohardaboutbalancingdifferentmetricswhenyoucanstill
easilyincreaseallthemetrics.Donttakethisruletoofarthough:donotconfuseyourobjective
withtheultimatehealthofthesystem(seeR
ule#39).And,ifyoufindyourselfincreasingthe
directlyoptimizedmetric,butdecidingnottolaunch,someobjectiverevisionmaybe
required.

Rule#13:Chooseasimple,observableandattributablemetricforyourfirstobjective.
Oftenyoudon'tknowwhatthetrueobjectiveis.Youthinkyoudobutthenyouasyoustareat
thedataandsidebysideanalysisofyouroldsystemandnewMLsystem,yourealizeyouwant
totweakit.Further,differentteammembersoftencan'tagreeonthetrueobjective.T
heML
objectiveshouldbesomethingthatiseasytomeasureandisaproxyforthetrue
objective2.SotrainonthesimpleMLobjective,andconsiderhavinga"policylayer"ontopthat
allowsyoutoaddadditionallogic(hopefullyverysimplelogic)todothefinalranking.

Theeasiestthingtomodelisauserbehaviorthatisdirectlyobservedandattributabletoan
actionofthesystem:
1. Wasthisrankedlinkclicked?
2. Wasthisrankedobjectdownloaded?
3. Wasthisrankedobjectforwarded/repliedto/emailed?
4. Wasthisrankedobjectrated?
5. Wasthisshownobjectmarkedasspam/pornography/offensive?
Avoidmodelingindirecteffectsatfirst:
1. Didtheuservisitthenextday?
2. Howlongdidtheuservisitthesite?
3. Whatwerethedailyactiveusers?
Indirecteffectsmakegreatmetrics,andcanbeusedduringA/Btestingandduringlaunch
decisions.
Finally,donttrytogetthemachinelearningtofigureout:
1. Istheuserhappyusingtheproduct?
2. Istheusersatisfiedwiththeexperience?
3. Istheproductimprovingtheusersoverallwellbeing?
2

Thereisoftennotrueobjective.SeeR
ule#39.

4. Howwillthisaffectthecompanysoverallhealth?
Theseareallimportant,butalsoincrediblyhard.Instead,useproxies:iftheuserishappy,they
willstayonthesitelonger.Iftheuserissatisfied,theywillvisitagaintomorrow.Insofaras
wellbeingandcompanyhealthisconcerned,humanjudgementisrequiredtoconnectany
machinelearnedobjectivetothenatureoftheproductyouaresellingandyourbusinessplan,
sowedontenduph
ere.

Rule#14:Startingwithaninterpretablemodelmakesdebuggingeasier.
Linearregression,logisticregression,andPoissonregressionaredirectlymotivatedbya
probabilisticmodel.Eachpredictionisinterpretableasaprobabilityoranexpectedvalue.This
makesthemeasiertodebugthanmodelsthatuseobjectives(zerooneloss,varioushinge
losses,etcetera)thattrytodirectlyoptimizeclassificationaccuracyorrankingperformance.For
example,ifprobabilitiesintrainingdeviatefromprobabilitiespredictedinsidebysidesorby
inspectingtheproductionsystem,thisdeviationcouldrevealaproblem.

Forexample,inlinear,logistic,orPoissonregression,t herearesubsetsofthedatawherethe
averagepredictedexpectationequalstheaveragelabel(1momentcalibrated,orjust
calibrated)3.Ifyouhaveafeaturewhichiseither1or0foreachexample,thenthesetof
exampleswherethatfeatureis1iscalibrated.Also,ifyouhaveafeaturethatis1forevery
example,thenthesetofallexamplesiscalibrated.

Withsimplemodels,itiseasiertodealwithfeedbackloops(seeR
ule#36).
Often,weusetheseprobabilisticpredictionstomakeadecision:e.g.rankpostsindecreasing
expectedvalue(i.e.probabilityofclick/download/etc.).H
owever,rememberwhenitcomes
timetochoosewhichmodeltouse,thedecisionmattersmorethanthelikelihoodofthe
datagiventhemodel(seeR
ule#27).

Rule#15:SeparateSpamFilteringandQualityRankinginaPolicyLayer.
Qualityrankingisafineart,butspamfilteringisawar.Thesignalsthatyouusetodetermine
highqualitypostswillbecomeobvioustothosewhouseyoursystem,andtheywilltweaktheir
poststohavetheseproperties.Thus,yourqualityrankingshouldfocusonrankingcontentthat
ispostedingoodfaith.Youshouldnotdiscountthequalityrankinglearnerforrankingspam
highly.S
imilarly,racycontentshouldbehandledseparatelyfromQualityRanking.
Spamfilteringisadifferentstory.Youhavetoexpectthatthefeaturesthatyouneedtogenerate
willbeconstantlychanging.Often,therewillbeobviousrulesthatyouputintothesystem(ifa
posthasmorethanthreespamvotes,dontretrieveit,etcetera).Anylearnedmodelwillhaveto
beupdateddaily,ifnotfaster.Thereputationofthecreatorofthecontentwillplayagreatrole.

Atsomelevel,theoutputofthesetwosystemswillhavetobeintegrated.Keepinmind,filtering
spaminsearchresultsshouldprobablybemoreaggressivethanfilteringspaminemail
3

Thisistrueassumingthatyouhavenoregularizationandthatyouralgorithmhasconverged.Itis
approximatelytrueingeneral.

messages.Also,itisastandardpracticetoremovespamfromthetrainingdataforthequality
classifier.

MLPhaseII:FeatureEngineering
Inthefirstphaseofthelifecycleofamachinelearningsystem,theimportantissueistogetthe
trainingdataintothelearningsystem,getanymetricsofinterestinstrumented,andcreatea
servinginfrastructure.A
fteryouhaveaworkingendtoendsystemwithunitandsystem
testsinstrumented,PhaseIIbegins.

Inthesecondphase,thereisalotoflowhangingfruit.Thereareavarietyofobviousfeatures
thatcouldbepulledintothesystem.Thus,thesecondphaseofmachinelearninginvolves
pullinginasmanyfeaturesaspossibleandcombiningtheminintuitiveways.Duringthisphase,
allofthemetricsshouldstillberising.Therewillbelotsoflaunches,anditisagreattimetopull
inlotsofengineersthatcanjoinupallthedatathatyouneedtocreateatrulyawesomelearning
system.

Rule#16:Plantolaunchanditerate.
Dontexpectthatthemodelyouareworkingonnowwillbethelastonethatyouwilllaunch,or
eventhatyouwilleverstoplaunchingmodels.Thusconsiderwhetherthecomplexityyouare
addingwiththislaunchwillslowdownfuturelaunches.Manyteamshavelaunchedamodelper
quarterormoreforyears.Therearethreebasicreasonstolaunchnewmodels:
1. youarecomingupwithnewfeatures,
2. youaretuningregularizationandcombiningoldfeaturesinnewways,and/or
3. youaretuningtheobjective.

Regardless,givingamodelabitoflovecanbegood:lookingoverthedatafeedingintothe
examplecanhelpfindnewsignalsaswellasold,brokenones.So,asyoubuildyourmodel,
thinkabouthoweasyitistoaddorremoveorrecombinefeatures.Thinkabouthoweasyitisto
createafreshcopyofthepipelineandverifyitscorrectness.Thinkaboutwhetheritispossible
tohavetwoorthreecopiesrunninginparallel.Finally,dontworryaboutwhetherfeature16of
35makesitintothisversionofthepipeline.Youllgetitnextquarter.

Rule#17:Startwithdirectlyobservedandreportedfeaturesasopposedtolearned
features.
Thismightbeacontroversialpoint,butitavoidsalotofpitfalls.Firstofall,letsdescribewhata
learnedfeatureis.Alearnedfeatureisafeaturegeneratedeitherbyanexternalsystem(such
asanunsupervisedclusteringsystem)orbythelearneritself(e.g.viaafactoredmodelordeep

learning).Bothofthesecanbeuseful,buttheycanhavealotofissues,sotheyshouldnotbein
thefirstmodel.

Ifyouuseanexternalsystemtocreateafeature,rememberthatthesystemhasitsown
objective.Theexternalsystem'sobjectivemaybeonlyweaklycorrelatedwithyourcurrent
objective.Ifyougrabasnapshotoftheexternalsystem,thenitcanbecomeoutofdate.Ifyou
updatethefeaturesfromtheexternalsystem,thenthemeaningsmaychange.Ifyouusean
externalsystemtoprovideafeature,beawarethattheyrequireagreatdealofcare.

Theprimaryissuewithfactoredmodelsanddeepmodelsisthattheyarenonconvex.Thus,
thereisnoguaranteethatanoptimalsolutioncanbeapproximatedorfound,andthelocal
minimafoundoneachiterationcanbedifferent.Thisvariationmakesithardtojudgewhether
theimpactofachangetoyoursystemismeaningfulorrandom.Bycreatingamodelwithout
deepfeatures,youcangetanexcellentbaselineperformance.Afterthisbaselineisachieved,
youcantrymoreesotericapproaches.

Rule#18:Explorewithfeaturesofcontentthatgeneralizeacrosscontexts.
Oftenamachinelearningsystemisasmallpartofamuchbiggerpicture.Forexample,ifyou
imagineapostthatmightbeusedinWhatsHot,manypeoplewillplusone,reshare,or
commentonapostbeforeitisevershowninWhatsHot.Ifyouprovidethosestatisticstothe
learner,itcanpromotenewpoststhatithasnodataforinthecontextitisoptimizing.Y
ouTube
WatchNextcouldusenumberofwatches,orcowatches(countsofhowmanytimesonevideo
waswatchedafteranotherwaswatched)fromY
ouTubesearch.Youcanalsouseexplicituser
ratings.Finally,ifyouhaveauseractionthatyouareusingasalabel,seeingthatactiononthe
documentinadifferentcontextcanbeagreatfeature.Allofthesefeaturesallowyoutobring
newcontentintothecontext.Notethatthisisnotaboutpersonalization:figureoutifsomeone
likesthecontentinthiscontextfirst,thenfigureoutwholikesitmoreorless.

Rule#19:Useveryspecificfeatureswhenyoucan.
Withtonsofdata,itissimplertolearnmillionsofsimplefeaturesthanafewcomplexfeatures.
Identifiersofdocumentsbeingretrievedandcanonicalizedqueriesdonotprovidemuch
generalization,butalignyourrankingwithyourlabelsonheadqueries..Thus,dontbeafraidof
groupsoffeatureswhereeachfeatureappliestoaverysmallfractionofyourdata,butoverall
coverageisabove90%.Youcanuseregularizationtoeliminatethefeaturesthatapplytotoo
fewexamples.

Rule#20:Combineandmodifyexistingfeaturestocreatenewfeaturesin
humanunderstandableways.
Thereareavarietyofwaystocombineandmodifyfeatures.Machinelearningsystemssuchas
TensorFlowallowyoutopreprocessyourdatathroughtransformations.Thetwomoststandard
approachesarediscretizationsandcrosses.

Discretizationconsistsoftakingacontinuousfeatureandcreatingmanydiscretefeaturesfrom
it.Consideracontinuousfeaturesuchasage.Youcancreateafeaturewhichis1whenageis
lessthan18,anotherfeaturewhichis1whenageisbetween18and35,etcetera.Dont
overthinktheboundariesofthesehistograms:basicquantileswillgiveyoumostoftheimpact.

Crossescombinetwoormorefeaturecolumns.Afeaturecolumn,inTensorFlow'sterminology,
isasetofhomogenousfeatures,(e.g.{male,female},{US,Canada,Mexico},etcetera).Across
isanewfeaturecolumnwithfeaturesin,forexample,{male, f emale} {U S, C anada, M exico} .
Thisnewfeaturecolumnwillcontainthefeature(male,Canada).IfyouareusingTensorFlow
andyoutellTensorFlowtocreatethiscrossforyou,this(male,Canada)featurewillbepresent
inexamplesrepresentingmaleCanadians.Notethatittakesmassiveamountsofdatatolearn
modelswithcrossesofthree,four,ormorebasefeaturecolumns.

Crossesthatproduceverylargefeaturecolumnsmayoverfit.Forinstance,imaginethatyouare
doingsomesortofsearch,andyouhaveafeaturecolumnwithwordsinthequery,andyou
haveafeaturecolumnwithwordsinthedocument.Youcancombinethesewithacross,but
youwillendupwithalotoffeatures(seeR
ule#21).Whenworkingwithtexttherearetwo
alternatives.Themostdraconianisadotproduct.Adotproductinitssimplestformsimply
countsthenumberofcommonwordsbetweenthequeryandthedocument.Thisfeaturecan
thenbediscretized.Anotherapproachisanintersection:thus,wewillhaveafeaturewhichis
presentifandonlyifthewordponyisinthedocumentandthequery,andanotherfeature
whichispresentifandonlyifthewordtheisinthedocumentandthequery.

Rule#21:Thenumberoffeatureweightsyoucanlearninalinearmodelisroughly
proportionaltotheamountofdatayouhave.
Therearefascinatingstatisticallearningtheoryresultsconcerningtheappropriatelevelof
complexityforamodel,butthisruleisbasicallyallyouneedtoknow.Ihavehadconversations
inwhichpeopleweredoubtfulthatanythingcanbelearnedfromonethousandexamples,or
thatyouwouldeverneedmorethan1millionexamples,becausetheygetstuckinacertain
methodoflearning.Thekeyistoscaleyourlearningtothesizeofyourdata:
1. Ifyouareworkingonasearchrankingsystem,andtherearemillionsofdifferentwords
inthedocumentsandthequeryandyouhave1000labeledexamples,thenyoushould
useadotproductbetweendocumentandqueryfeatures,T
FIDF,andahalfdozen
otherhighlyhumanengineeredfeatures.1000examples,adozenfeatures.
2. Ifyouhaveamillionexamples,thenintersectthedocumentandqueryfeaturecolumns,
usingregularizationandpossiblyfeatureselection.Thiswillgiveyoumillionsoffeatures,
butwithregularizationyouwillhavefewer.Tenmillionexamples,maybeahundred
thousandfeatures.
3. Ifyouhavebillionsorhundredsofbillionsofexamples,youcancrossthefeature
columnswithdocumentandquerytokens,usingfeatureselectionandregularization.
Youwillhaveabillionexamples,and10millionfeatures.
Statisticallearningtheoryrarelygivestightbounds,butgivesgreatguidanceforastartingpoint.
Intheend,useR
ule#28todecidewhatfeaturestouse.

Rule#22:Cleanupfeaturesyouarenolongerusing.
Unusedfeaturescreatetechnicaldebt.Ifyoufindthatyouarenotusingafeature,andthat
combiningitwithotherfeaturesisnotworking,thendropitoutofyourinfrastructure.Youwant
tokeepyourinfrastructurecleansothatthemostpromisingfeaturescanbetriedasfastas
possible.Ifnecessary,someonecanalwaysaddbackyourfeature.

Keepcoverageinmindwhenconsideringwhatfeaturestoaddorkeep.Howmanyexamples
arecoveredbythefeature?Forexample,ifyouhavesomepersonalizationfeatures,butonly
8%ofyourusershaveanypersonalizationfeatures,itisnotgoingtobeveryeffective.

Atthesametime,somefeaturesmaypunchabovetheirweight.Forexample,ifyouhavea
featurewhichcoversonly1%ofthedata,but90%oftheexamplesthathavethefeatureare
positive,thenitwillbeagreatfeaturetoadd.

HumanAnalysisoftheSystem
Beforegoingontothethirdphaseofmachinelearning,itisimportanttofocusonsomethingthat
isnottaughtinanymachinelearningclass:howtolookatanexistingmodel,andimproveit.
Thisismoreofanartthanascience,andyetthereareseveralantipatternsthatithelpsto
avoid.

Rule#23:Youarenotatypicalenduser.
Thisisperhapstheeasiestwayforateamtogetboggeddown.Whiletherearealotofbenefits
tofishfooding(usingaprototypewithinyourteam)anddogfooding(usingaprototypewithin
yourcompany),employeesshouldlookatwhethertheperformanceiscorrect.Whileachange
whichisobviouslybadshouldnotbeused,anythingthatlooksreasonablynearproduction
shouldbetestedfurther,eitherbypayinglaypeopletoanswerquestionsonacrowdsourcing
platform,orthroughaliveexperimentonrealusers.

Therearetworeasonsforthis.Thefirstisthatyouaretooclosetothecode.Youmaybe
lookingforaparticularaspectoftheposts,oryouaresimplytooemotionallyinvolved(e.g.
confirmationbias).Thesecondisthatyourtimeistoovaluable.Considerthecostof9
engineerssittinginaonehourmeeting,andthinkofhowmanycontractedhumanlabelsthat
buysonacrowdsourcingplatform.

Ifyoureallywanttohaveuserfeedback,u
seuserexperiencemethodologies.Createuser
personas(onedescriptionisinBillBuxtonsD
esigningUserExperiences)earlyinaprocessand
dousabilitytesting(onedescriptionisinSteveKrugsD
ontMakeMeThink)later.User
personasinvolvecreatingahypotheticaluser.Forinstance,ifyourteamisallmale,itmighthelp
todesigna35yearoldfemaleuserpersona(completewithuserfeatures),andlookatthe
resultsitgeneratesratherthan10resultsfor2540yearoldmales.Bringinginactualpeopleto

watchtheirreactiontoyoursite(locallyorremotely)inusabilitytestingcanalsogetyouafresh
perspective.

Rule#24:Measurethedeltabetweenmodels.
Oneoftheeasiest,andsometimesmostusefulmeasurementsyoucanmakebeforeanyusers
havelookedatyournewmodelistocalculatejusthowdifferentthenewresultsarefrom
production.Forinstance,ifyouhavearankingproblem,runbothmodelsonasampleofqueries
throughtheentiresystem,andlookatthesizeofthesymmetricdifferenceoftheresults
(weightedbyrankingposition).Ifthedifferenceisverysmall,thenyoucantellwithoutrunning
anexperimentthattherewillbelittlechange.Ifthedifferenceisverylarge,thenyouwantto
makesurethatthechangeisgood.Lookingoverquerieswherethesymmetricdifferenceishigh
canhelpyoutounderstandqualitativelywhatthechangewaslike.Makesure,however,thatthe
systemisstable.Makesurethatamodelwhencomparedwithitselfhasalow(ideallyzero)
symmetricdifference.

Rule#25:Whenchoosingmodels,utilitarianperformancetrumpspredictivepower.
Yourmodelmaytrytopredictclickthroughrate.However,intheend,thekeyquestioniswhat
youdowiththatprediction.Ifyouareusingittorankdocuments,thenthequalityofthefinal
rankingmattersmorethanthepredictionitself.Ifyoupredicttheprobabilitythatadocumentis
spamandthenhaveacutoffonwhatisblocked,thentheprecisionofwhatisallowedthrough
mattersmore.Mostofthetime,thesetwothingsshouldbeinagreement:whentheydonot
agree,itwilllikelybeonasmallgain.Thus,ifthereissomechangethatimprovesloglossbut
degradestheperformanceofthesystem,lookforanotherfeature.Whenthisstartshappening
moreoften,itistimetorevisittheobjectiveofyourmodel.

Rule#26:Lookforpatternsinthemeasurederrors,andcreatenewfeatures.
Supposethatyouseeatrainingexamplethatthemodelgotwrong.Inaclassificationtask,this
couldbeafalsepositiveorafalsenegative.Inarankingtask,itcouldbeapairwhereapositive
wasrankedlowerthananegative.Themostimportantpointisthatthisisanexamplethatthe
machinelearningsystemk nowsitgotwrongandwouldliketofixifgiventheopportunity.Ifyou
givethemodelafeaturethatallowsittofixtheerror,themodelwilltrytouseit.

Ontheotherhand,ifyoutrytocreateafeaturebaseduponexamplesthesystemdoesntsee
asmistakes,thefeaturewillbeignored.Forinstance,supposethatinPlayAppsSearch,
someonesearchesforfreegames.Supposeoneofthetopresultsisalessrelevantgagapp.
Soyoucreateafeatureforgagapps.However,ifyouaremaximizingnumberofinstalls,and
peopleinstallagagappwhentheysearchforfreegames,thegagappsfeaturewonthavethe
effectyouwant.

Onceyouhaveexamplesthatthemodelgotwrong,lookfortrendsthatareoutsideyourcurrent
featureset.Forinstance,ifthesystemseemstobedemotinglongerposts,thenaddpost
length.Dontbetoospecificaboutthefeaturesyouadd.Ifyouaregoingtoaddpostlength,

donttrytoguesswhatlongmeans,justaddadozenfeaturesandtheletmodelfigureoutwhat
todowiththem(seeR
ule#21).Thatistheeasiestwaytogetwhatyouwant.

Rule#27:Trytoquantifyobservedundesirablebehavior.
Somemembersofyourteamwillstarttobefrustratedwithpropertiesofthesystemtheydont
likewhicharentcapturedbytheexistinglossfunction.Atthispoint,theyshoulddowhateverit
takestoturntheirgripesintosolidnumbers.Forexample,iftheythinkthattoomanygagapps
arebeingshowninPlaySearch,theycouldhavehumanratersidentifygagapps.(Youcan
feasiblyusehumanlabelleddatainthiscasebecausearelativelysmallfractionofthequeries
accountforalargefractionofthetraffic.)Ifyourissuesaremeasurable,thenyoucanstartusing
themasfeatures,objectives,ormetrics.Thegeneralruleis measurefirst,optimizesecond.

Rule#28:Beawarethatidenticalshorttermbehaviordoesnotimplyidenticallongterm
behavior.
Imaginethatyouhaveanewsystemthatlooksateverydoc_idandexact_query,andthen
calculatestheprobabilityofclickforeverydocforeveryquery.Youfindthatitsbehavioris
nearlyidenticaltoyourcurrentsysteminbothsidebysidesandA/Btesting,sogivenits
simplicity,youlaunchit.However,younoticethatnonewappsarebeingshown.Why?Well,
sinceyoursystemonlyshowsadocbasedonitsownhistorywiththatquery,thereisnowayto
learnthatanewdocshouldbeshown.

Theonlywaytounderstandhowsuchasystemwouldworklongtermistohaveittrainonlyon
dataacquiredwhenthemodelwaslive.Thisisverydifficult.

TrainingServingSkew
Trainingservingskewisadifferencebetweenperformanceduringtrainingandperformance
duringserving.Thisskewcanbecausedby:
adiscrepancybetweenhowyouhandledatainthetrainingandservingpipelines,or
achangeinthedatabetweenwhenyoutrainandwhenyouserve,or
afeedbackloopbetweenyourmodelandyouralgorithm.
WehaveobservedproductionmachinelearningsystemsatGooglewithtrainingservingskew
thatnegativelyimpactsperformance.Thebestsolutionistoexplicitlymonitoritsothatsystem
anddatachangesdontintroduceskewunnoticed.

Rule#29:Thebestwaytomakesurethatyoutrainlikeyouserveistosavethesetof
featuresusedatservingtime,andthenpipethosefeaturestoalogtousethemat
trainingtime.

Evenifyoucantdothisforeveryexample,doitforasmallfraction,suchthatyoucanverifythe
consistencybetweenservingandtraining(seeR
ule#37).Teamsthathavemadethis
measurementatGoogleweresometimessurprisedbytheresults.Y
ouTubehomepage

switchedtologgingfeaturesatservingtimewithsignificantqualityimprovementsanda
reductionincodecomplexity,andmanyteamsareswitchingtheirinfrastructureaswespeak.

Rule#30:Importanceweightsampleddata,dontarbitrarilydropit!
Whenyouhavetoomuchdata,thereisatemptationtotakefiles112,andignorefiles1399.
Thisisamistake:droppingdataintraininghascausedissuesinthepastforseveralteams(see
Rule#6).Althoughdatathatwasnevershowntotheusercanbedropped,importance
weightingisbestfortherest.Importanceweightingmeansthatifyoudecidethatyouaregoing
tosampleexampleXwitha30%probability,thengiveitaweightof10/3.W
ithimportance
weighting,allofthecalibrationpropertiesdiscussedinR
ule#14stillhold.

Rule#31:Bewarethatifyoujoindatafromatableattrainingandservingtime,thedatain
thetablemaychange.
Sayyoujoindocidswithatablecontainingfeaturesforthosedocs(suchasnumberof
commentsorclicks).Betweentrainingandservingtime,featuresinthetablemaybechanged.
Yourmodel'spredictionforthesamedocumentmaythendifferbetweentrainingandserving.
Theeasiestwaytoavoidthissortofproblemistologfeaturesatservingtime(seeR
ule#32).If
thetableischangingonlyslowly,youcanalsosnapshotthetablehourlyordailytoget
reasonablyclosedata.Notethatthisstilldoesntcompletelyresolvetheissue.

Rule#32:Reusecodebetweenyourtrainingpipelineandyourservingpipeline
wheneverpossible.
Batchprocessingisdifferentthanonlineprocessing.Inonlineprocessing,youmusthandle
eachrequestasitarrives(e.g.youmustdoaseparatelookupforeachquery),whereasinbatch
processing,youcancombinetasks(e.g.makingajoin).Atservingtime,youaredoingonline
processing,whereastrainingisabatchprocessingtask.However,therearesomethingsthat
youcandotoreusecode.Forexample,youcancreateanobjectthatisparticulartoyour
systemwheretheresultofanyqueriesorjoinscanbestoredinaveryhumanreadableway,
anderrorscanbetestedeasily.Then,onceyouhavegatheredalltheinformation,during
servingortraining,yourunacommonmethodtobridgebetweenthehumanreadableobject
thatisspecifictoyoursystem,andwhateverformatthemachinelearningsystemexpects.T
his
eliminatesasourceoftrainingservingskew.Asacorollary,trynottousetwodifferent
programminglanguagesbetweentrainingandservingthatdecisionwillmakeitnearly
impossibleforyoutosharecode.

Rule#33:IfyouproduceamodelbasedonthedatauntilJanuary5th,testthemodelon
thedatafromJanuary6thandafter.
Ingeneral,measureperformanceofamodelonthedatagatheredafterthedatayoutrainedthe
modelon,asthisbetterreflectswhatyoursystemwilldoinproduction.Ifyouproduceamodel
basedonthedatauntilJanuary5th,testthemodelonthedatafromJanuary6th.Youwill
expectthattheperformancewillnotbeasgoodonthenewdata,butitshouldntberadically
worse.Sincetheremightbedailyeffects,youmightnotpredicttheaverageclickrateor

conversionrate,buttheareaunderthecurve,whichrepresentsthelikelihoodofgivingthe
positiveexampleascorehigherthananegativeexample,shouldbereasonablyclose.

Rule#34:Inbinaryclassificationforfiltering(suchasspamdetectionordetermining
interestingemails),makesmallshorttermsacrificesinperformanceforverycleandata.
Inafilteringtask,exampleswhicharemarkedasnegativearenotshowntotheuser.Suppose
youhaveafilterthatblocks75%ofthenegativeexamplesatserving.Youmightbetemptedto
drawadditionaltrainingdatafromtheinstancesshowntousers.Forexample,ifausermarksan
emailasspamthatyourfilterletthrough,youmightwanttolearnfromthat.

Butthisapproachintroducessamplingbias.Youcangathercleanerdataifinsteadduring
servingyoulabel1%ofalltrafficasheldout,andsendallheldoutexamplestotheuser.Now
yourfilterisblockingatleast74%ofthenegativeexamples.Theseheldoutexamplescan
becomeyourtrainingdata.

Notethatifyourfilterisblocking95%ofthenegativeexamplesormore,thisbecomesless
viable.Evenso,ifyouwishtomeasureservingperformance,youcanmakeaneventinier
sample(say0.1%or0.001%).Tenthousandexamplesisenoughtoestimateperformancequite
accurately.

Rule#35:Bewareoftheinherentskewinrankingproblems.
Whenyouswitchyourrankingalgorithmradicallyenoughthatdifferentresultsshowup,you
haveeffectivelychangedthedatathatyouralgorithmisgoingtoseeinthefuture.Thiskindof
skewwillshowup,andyoushoulddesignyourmodelaroundit.Therearemultipledifferent
approaches.Theseapproachesareallwaystofavordatathatyourmodelhasalreadyseen.
1. Havehigherregularizationonfeaturesthatcovermorequeriesasopposedtothose
featuresthatareonforonlyonequery.Thisway,themodelwillfavorfeaturesthatare
specifictooneorafewqueriesoverfeaturesthatgeneralizetoallqueries.This
approachcanhelppreventverypopularresultsfromleakingintoirrelevantqueries.Note
thatthisisoppositethemoreconventionaladviceofhavingmoreregularizationon
featurecolumnswithmoreuniquevalues.
2. Onlyallowfeaturestohavepositiveweights.Thus,anygoodfeaturewillbebetterthana
featurethatisunknown.
3. Donthavedocumentonlyfeatures.Thisisanextremeversionof#1.Forexample,even
ifagivenappisapopulardownloadregardlessofwhatthequerywas,youdontwantto
showiteverywhere4.Nothavingdocumentonlyfeatureskeepsthatsimple.

Thereasonyoudontwanttoshowaspecificpopularappeverywherehastodowiththeimportanceof
makingallthedesiredappsr eachable.Forinstance,ifsomeonesearchesforbirdwatchingapp,they
mightdownloadangrybirds,butthatcertainlywasnttheirintent.Showingsuchanappmightimprove
downloadrate,butleavetheusersneedsultimatelyunsatisfied.

Rule#36:Avoidfeedbackloopswithpositionalfeatures.
Thepositionofcontentdramaticallyaffectshowlikelytheuseristointeractwithit.Ifyouputan
appinthefirstpositionitwillbeclickedmoreoften,andyouwillbeconvinceditismorelikelyto
beclicked.Onewaytodealwiththisistoaddpositionalfeatures,i.e.featuresabouttheposition
ofthecontentinthepage.Youtrainyourmodelwithpositionalfeatures,anditlearnstoweight,
forexample,thefeature"1stposition"heavily.Yourmodelthusgiveslessweighttootherfactors
forexampleswith"1stposition=true".Thenatservingyoudon'tgiveanyinstancesthe
positionalfeature,oryougivethemallthesamedefaultfeature,becauseyouarescoring
candidatesb
eforeyouhavedecidedtheorderinwhichtodisplaythem.

Notethatitisimportanttokeepanypositionalfeaturessomewhatseparatefromtherestofthe
modelbecauseofthisasymmetrybetweentrainingandtesting.Havingthemodelbethesumof
afunctionofthepositionalfeaturesandafunctionoftherestofthefeaturesisideal.For
example,dontcrossthepositionalfeatureswithanydocumentfeature.

Rule#37:MeasureTraining/ServingSkew.
Thereareseveralthingsthatcancauseskewinthemostgeneralsense.Moreover,youcan
divideitintoseveralparts:
1. Thedifferencebetweentheperformanceonthetrainingdataandtheholdoutdata.In
general,thiswillalwaysexist,anditisnotalwaysbad.
2. Thedifferencebetweentheperformanceontheholdoutdataandthenextdaydata.
Again,thiswillalwaysexist.Y
oushouldtuneyourregularizationtomaximizethe
nextdayperformance.However,largedropsinperformancebetweenholdoutand
nextdaydatamayindicatethatsomefeaturesaretimesensitiveandpossiblydegrading
modelperformance.
3. Thedifferencebetweentheperformanceonthenextdaydataandthelivedata.Ifyou
applyamodeltoanexampleinthetrainingdataandthesameexampleatserving,it
shouldgiveyouexactlythesameresult(seeR
ule#5).Thus,adiscrepancyhere
probablyindicatesanengineeringerror.

MLPhaseIII:SlowedGrowth,Optimization
Refinement,andComplexModels
Therewillbecertainindicationsthatthesecondphaseisreachingaclose.Firstofall,your
monthlygainswillstarttodiminish.Youwillstarttohavetradeoffsbetweenmetrics:youwillsee
someriseandothersfallinsomeexperiments.Thisiswhereitgetsinteresting.Sincethegains
arehardertoachieve,themachinelearninghastogetmoresophisticated.

Acaveat:thissectionhasmoreblueskyrulesthanearliersections.Wehaveseenmanyteams
gothroughthehappytimesofPhaseIandPhaseIImachinelearning.OncePhaseIIIhasbeen
reached,teamshavetofindtheirownpath.
Rule#38:Dontwastetimeonnewfeaturesifunalignedobjectiveshavebecomethe
issue.
Asyourmeasurementsplateau,yourteamwillstarttolookatissuesthatareoutsidethescope
oftheobjectivesofyourcurrentmachinelearningsystem.Asstatedbefore,iftheproductgoals
arenotcoveredbytheexistingalgorithmicobjective,youneedtochangeeitheryourobjective
oryourproductgoals.Forinstance,youmayoptimizeclicks,plusones,ordownloads,butmake
launchdecisionsbasedinpartonhumanraters.

Rule#39:Launchdecisionsareaproxyforlongtermproductgoals.
Alicehasanideaaboutreducingthelogisticlossofpredictinginstalls.Sheaddsafeature.The
logisticlossdrops.Whenshedoesaliveexperiment,sheseestheinstallrateincrease.
However,whenshegoestoalaunchreviewmeeting,someonepointsoutthatthenumberof
dailyactiveusersdropsby5%.Theteamdecidesnottolaunchthemodel.Aliceis
disappointed,butnowrealizesthatlaunchdecisionsdependonmultiplecriteria,onlysomeof
whichcanbedirectlyoptimizedusingML.

Thetruthisthattherealworldisnotdungeonsanddragons:therearenohitpointsidentifying
thehealthofyourproduct.Theteamhastousethestatisticsitgatherstotrytoeffectively
predicthowgoodthesystemwillbeinthefuture.Theyneedtocareaboutengagement,1day
activeusers(DAU),30DAU,revenue,andadvertisersreturnoninvestment.Thesemetricsthat
aremeasureableinA/Btestsinthemselvesareonlyaproxyformorelongtermgoals:satisfying
users,increasingusers,satisfyingpartners,andprofit,whicheventhenyoucouldconsider
proxiesforhavingauseful,highqualityproductandathrivingcompanyfiveyearsfromnow.

Theonlyeasylaunchdecisionsarewhenallmetricsgetbetter(oratleastdonotget
worse).Iftheteamhasachoicebetweenasophisticatedmachinelearningalgorithm,anda
simpleheuristic,ifthesimpleheuristicdoesabetterjobonallthesemetrics,itshouldchoose
theheuristic.Moreover,thereisnoexplicitrankingofallpossiblemetricvalues.Specifically,
considerthefollowingtwoscenarios:

Experiment

DailyActiveUsers

Revenue/Day

1million

$4million

2million

$2million

IfthecurrentsystemisA,thentheteamwouldbeunlikelytoswitchtoB.Ifthecurrentsystemis
B,thentheteamwouldbeunlikelytoswitchtoA.Thisseemsinconflictwithrationalbehavior:
however,predictionsofchangingmetricsmayormaynotpanout,andthusthereisalargerisk
involvedwitheitherchange.Eachmetriccoverssomeriskwithwhichtheteamisconcerned.

Moreover,nometriccoverstheteamsultimateconcern,whereismyproductgoingtobefive
yearsfromnow?

Individuals,ontheotherhand,tendtofavoroneobjectivethattheycandirectlyoptimize.
Mostmachinelearningtoolsfavorsuchanenvironment.Anengineerbangingoutnewfeatures
cangetasteadystreamoflaunchesinsuchanenvironment.Thereisatypeofmachine
learning,multiobjectivelearning,whichstartstoaddressthisproblem.Forinstance,onecan
formulateaconstraintsatisfactionproblemthathaslowerboundsoneachmetric,andoptimizes
somelinearcombinationofmetrics.However,eventhen,notallmetricsareeasilyframedas
machinelearningobjectives:ifadocumentisclickedonoranappisinstalled,itisbecausethat
thecontentwasshown.Butitisfarhardertofigureoutwhyauservisitsyoursite.Howto
predictthefuturesuccessofasiteasawholeisA
Icomplete,ashardascomputervisionor
naturallanguageprocessing.

Rule#40:Keepensemblessimple.
Unifiedmodelsthattakeinrawfeaturesanddirectlyrankcontentaretheeasiestmodelsto
debugandunderstand.However,anensembleofmodels(amodelwhichcombinesthescores
ofothermodels)canworkbetter.T
okeepthingssimple,eachmodelshouldeitherbean
ensembleonlytakingtheinputofothermodels,orabasemodeltakingmanyfeatures,
butnotboth.Ifyouhavemodelsontopofothermodelsthataretrainedseparately,then
combiningthemcanresultinbadbehavior.

Useasimplemodelforensemblingthattakesonlytheoutputofyourbasemodelsasinputs.
Youalsowanttoenforcepropertiesontheseensemblemodels.Forexample,anincreaseinthe
scoreproducedbyabasemodelshouldnotdecreasethescoreoftheensemble.Also,itisbest
iftheincomingmodelsaresemanticallyinterpretable(forexample,calibrated)sothatchanges
oftheunderlyingmodelsdonotconfusetheensemblemodel.Also,e
nforcethatanincrease
inthepredictedprobabilityofanunderlyingclassifierdoesnotdecreasethepredicted
probabilityoftheensemble.

Rule#41:W
henperformanceplateaus,lookforqualitativelynewsourcesofinformation
toaddratherthanrefiningexistingsignals.
Youveaddedsomedemographicinformationabouttheuser.You'veaddedsomeinformation
aboutthewordsinthedocument.Youhavegonethroughtemplateexploration,andtunedthe
regularization.Youhaventseenalaunchwithmorethana1%improvementinyourkeymetrics
inafewquarters.Nowwhat?

Itistimetostartbuildingtheinfrastructureforradicallydifferentfeatures,suchasthehistoryof
documentsthatthisuserhasaccessedinthelastday,week,oryear,ordatafromadifferent
property.Usew
ikidataentitiesorsomethinginternaltoyourcompany(suchasGoogles
knowledgegraph).Usedeeplearning.Starttoadjustyourexpectationsonhowmuchreturnyou

expectoninvestment,andexpandyoureffortsaccordingly.Asinanyengineeringproject,you
havetoweighthebenefitofaddingnewfeaturesagainstthecostofincreasedcomplexity.

Rule#42:Dontexpectdiversity,personalization,orrelevancetobeascorrelatedwith
popularityasyouthinktheyare.
Diversityinasetofcontentcanmeanmanythings,withthediversityofthesourceofthe
contentbeingoneofthemostcommon.Personalizationimplieseachusergetstheirown
results.Relevanceimpliesthattheresultsforaparticularqueryaremoreappropriateforthat
querythananyother.Thusallthreeofthesepropertiesaredefinedasbeingdifferentfromthe
ordinary.

Theproblemisthattheordinarytendstobehardtobeat.

Notethatifyoursystemismeasuringclicks,timespent,watches,+1s,reshares,etcetera,you
aremeasuringthep
opularityofthecontent.Teamssometimestrytolearnapersonalmodel
withdiversity.Topersonalize,theyaddfeaturesthatwouldallowthesystemtopersonalize
(somefeaturesrepresentingtheusersinterest)ordiversify(featuresindicatingifthisdocument
hasanyfeaturesincommonwithotherdocumentsreturned,suchasauthororcontent),and
findthatthosefeaturesgetlessweight(orsometimesadifferentsign)thantheyexpect.

Thisdoesntmeanthatdiversity,personalization,orrelevancearentvaluable.Aspointedoutin
thepreviousrule,youcandopostprocessingtoincreasediversityorrelevance.Ifyousee
longertermobjectivesincrease,thenyoucandeclarethatdiversity/relevanceisvaluable,aside
frompopularity.Youcantheneithercontinuetouseyourpostprocessing,ordirectlymodifythe
objectivebasedupondiversityorrelevance.

Rule#43:Yourfriendstendtobethesameacrossdifferentproducts.Yourintereststend
nottobe.
TeamsatGooglehavegottenalotoftractionfromtakingamodelpredictingtheclosenessofa
connectioninoneproduct,andhavingitworkwellonanother.Yourfriendsarewhotheyare.On
theotherhand,Ihavewatchedseveralteamsstrugglewithpersonalizationfeaturesacross
productdivides.Yes,itseemslikeitshouldwork.Fornow,itdoesntseemlikeitdoes.Whathas
sometimesworkedisusingrawdatafromonepropertytopredictbehavioronanother.Also,
keepinmindthatevenknowingthatauserhasahistoryonanotherpropertycanhelp.For
instance,thepresenceofuseractivityontwoproductsmaybeindicativeinandofitself.

RelatedWork
TherearemanydocumentsonmachinelearningatGoogleaswellasexternally.
MachineLearningCrashCourse:anintroductiontoappliedmachinelearning

MachineLearning:AProbabilisticApproachbyKevinMurphyforanunderstandingof
thefieldofmachinelearning
PracticalAdvicefortheAnalysisofLarge,ComplexDataSets:adatascienceapproach
tothinkingaboutdatasets.

DeepLearningbyIanGoodfellowetalforlearningnonlinearmodels
Googlepaperontechnicaldebt,whichhasalotofgeneraladvice.
TensorflowDocumentation

Acknowledgements
ThankstoDavidWestbrook,PeterBrandt,SamuelIeong,ChenyuZhao,LiWei,Michalis
Potamias,EvanRosen,BarryRosenberg,ChristineRobson,JamesPine,TalShaked,Tushar
Chandra,MustafaIspir,JeremiahHarmsen,KonstantinosKatsiapis,GlenAnderson,Dan
Duckworth,ShishirBirmiwal,GalElidan,SuLinWu,JaihuiLiu,FernandoPereira,and
HrishikeshAradhyeformanycorrections,suggestions,andhelpfulexamplesforthisdocument.
Also,thankstoKristenLefevre,SuddhaBasu,andChrisBergwhohelpedwithanearlier
version.Anyerrors,omissions,or(gasp!)unpopularopinionsaremyown.

Appendix
ThereareavarietyofreferencestoGoogleproductsinthisdocument.Toprovidemorecontext,
Igiveashortdescriptionofthemostcommonexamplesbelow.

YouTubeOverview
YouTubeisastreamingvideoservice.BothYouTubeWatchNextandYouTubeHomePage
teamsuseMLmodelstorankvideorecommendations.WatchNextrecommendsvideosto
watchafterthecurrentlyplayingone,whileHomePagerecommendsvideostousersbrowsing
thehomepage.

GooglePlayOverview
GooglePlayhasmanymodelssolvingavarietyofproblems.PlaySearch,PlayHomePage
PersonalizedRecommendations,andUsersAlsoInstalledappsallusemachinelearning.

GooglePlusOverview
GooglePlususesmachinelearninginavarietyofsituations:rankingpostsinthestreamof
postsbeingseenbytheuser,rankingWhatsHotposts(poststhatareverypopularnow),
rankingpeopleyouknow,etcetera.

Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
25 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
33 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
36 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
18 pages
Internship Presentation 2
No ratings yet
Internship Presentation 2
16 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
MSTR Vs Looker
No ratings yet
MSTR Vs Looker
45 pages
Book Summary
No ratings yet
Book Summary
35 pages
AI Agents
No ratings yet
AI Agents
10 pages
Unit-4 Aiml
No ratings yet
Unit-4 Aiml
27 pages
Python
No ratings yet
Python
27 pages
Purdue PGP AI and ML
No ratings yet
Purdue PGP AI and ML
35 pages
(Ebook PDF) Hands-On Machine Learning With Scikit-Learn and TensorFlow Instant Download
100% (8)
(Ebook PDF) Hands-On Machine Learning With Scikit-Learn and TensorFlow Instant Download
44 pages
PDF Documentation Package How To Integrate CPP Code in Python
No ratings yet
PDF Documentation Package How To Integrate CPP Code in Python
4 pages
Machine Learning Class Notes: SVM & Bayesian Learning
No ratings yet
Machine Learning Class Notes: SVM & Bayesian Learning
16 pages
Introduction To Deep Learning-1
No ratings yet
Introduction To Deep Learning-1
16 pages
Machine Learning For Everyone - in Simple Words. With Real-World Examples. Yes, Again PDF
No ratings yet
Machine Learning For Everyone - in Simple Words. With Real-World Examples. Yes, Again PDF
62 pages
How To Learn Machine Learning Algorithms For Interviews
No ratings yet
How To Learn Machine Learning Algorithms For Interviews
16 pages
Deep Learning Approach For Earthquake Parameters Classification in Earthquake Early Warning System
No ratings yet
Deep Learning Approach For Earthquake Parameters Classification in Earthquake Early Warning System
5 pages
R12 Upgrade Assessment R12 Upgrade Assessment Discussion
No ratings yet
R12 Upgrade Assessment R12 Upgrade Assessment Discussion
12 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Use C# and ML - Net Machine Learning To Predict Taxi Fares in New York
No ratings yet
Use C# and ML - Net Machine Learning To Predict Taxi Fares in New York
19 pages
Machine Learning With Python Complete Ste - David Park
No ratings yet
Machine Learning With Python Complete Ste - David Park
138 pages
AI Introduction
No ratings yet
AI Introduction
49 pages
MCA Machine Learning Practical File
No ratings yet
MCA Machine Learning Practical File
22 pages
AI Zero To Hero Roadmap 2025
No ratings yet
AI Zero To Hero Roadmap 2025
2 pages
T-GCPBDML-B - M2 - Data Engineering For Streaming Data - ILT Slides
No ratings yet
T-GCPBDML-B - M2 - Data Engineering For Streaming Data - ILT Slides
59 pages
Neuro Symbolic Reasoning and Learning: Paulo Shakarian Chitta Baral Gerardo I. Simari Bowen Xi Lahari Pokala
No ratings yet
Neuro Symbolic Reasoning and Learning: Paulo Shakarian Chitta Baral Gerardo I. Simari Bowen Xi Lahari Pokala
125 pages
STOCK
No ratings yet
STOCK
19 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
60 pages
Q-Learning and Deep Q Networks (DQN)
No ratings yet
Q-Learning and Deep Q Networks (DQN)
52 pages
7 Types of Classification Algorithms
No ratings yet
7 Types of Classification Algorithms
21 pages
Ai Course File
No ratings yet
Ai Course File
67 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
9 pages
M03 FileSystem Consoles Ed12
No ratings yet
M03 FileSystem Consoles Ed12
33 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
12 pages
Lecture No 6 Deep Learning Algorithm
No ratings yet
Lecture No 6 Deep Learning Algorithm
37 pages
Top 10 Machine Learning Algorithms
No ratings yet
Top 10 Machine Learning Algorithms
14 pages
T Thesis Topics in Machine Learning For Research Scholars
No ratings yet
T Thesis Topics in Machine Learning For Research Scholars
14 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Career Track For AI/ML
No ratings yet
Career Track For AI/ML
10 pages
V2 - How Sam Parr Uses ChatGPT As Executive Coach (MFM)
No ratings yet
V2 - How Sam Parr Uses ChatGPT As Executive Coach (MFM)
5 pages
Deep Neural Network Presentation
No ratings yet
Deep Neural Network Presentation
9 pages
Gen AI Course Content
No ratings yet
Gen AI Course Content
6 pages
AI For Business Leaders Executive Program Syllabus
No ratings yet
AI For Business Leaders Executive Program Syllabus
11 pages
Super Cheatsheet Artificial Intelligence
No ratings yet
Super Cheatsheet Artificial Intelligence
18 pages
Developing An Intelligent Chat-Bot Tool To Assist High School Students For Learning General Knowledge Subjects
No ratings yet
Developing An Intelligent Chat-Bot Tool To Assist High School Students For Learning General Knowledge Subjects
13 pages
Welcome To Data Studio! (Start Here) PDF
No ratings yet
Welcome To Data Studio! (Start Here) PDF
10 pages
10 Algorithms That Dominate The World
No ratings yet
10 Algorithms That Dominate The World
26 pages
Generative AI Program - IHUB IIT Roorkee & Microsoft (Intellipaat)
No ratings yet
Generative AI Program - IHUB IIT Roorkee & Microsoft (Intellipaat)
11 pages
DataVisualization 05BH0504pdf 2024 07 04 08 02 44
No ratings yet
DataVisualization 05BH0504pdf 2024 07 04 08 02 44
7 pages
Machine Learning: Pradyumn Sharma Pragati Software Pvt. LTD
No ratings yet
Machine Learning: Pradyumn Sharma Pragati Software Pvt. LTD
85 pages
Rules of ML
No ratings yet
Rules of ML
24 pages
Machine Learning Guide for Experts
No ratings yet
Machine Learning Guide for Experts
3 pages
ML Engineering: Real-World Guide
No ratings yet
ML Engineering: Real-World Guide
39 pages
How To Be A Good Machine Learning PM by Google Product Manager
No ratings yet
How To Be A Good Machine Learning PM by Google Product Manager
71 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
L2 What Is ML
No ratings yet
L2 What Is ML
38 pages
Lec 13
No ratings yet
Lec 13
35 pages
RS2 Documentation - Probabilistic Analysis
No ratings yet
RS2 Documentation - Probabilistic Analysis
3 pages
Maintenance Optimization Models
No ratings yet
Maintenance Optimization Models
12 pages
A4 Cheatsheet
No ratings yet
A4 Cheatsheet
2 pages
Assignment On Pandas - Assignment On Pandas - CN01 Courseware - Supervised Learning PDF
No ratings yet
Assignment On Pandas - Assignment On Pandas - CN01 Courseware - Supervised Learning PDF
3 pages
Greedy Algorithms & Huffman Codes
No ratings yet
Greedy Algorithms & Huffman Codes
21 pages
07 Relation Analysis
No ratings yet
07 Relation Analysis
86 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Angle and Radius of Polar Curves
No ratings yet
Angle and Radius of Polar Curves
4 pages
Book 1
No ratings yet
Book 1
6 pages
Polynomial Functions
No ratings yet
Polynomial Functions
19 pages
Assignment 5 Abhilash Bollam
No ratings yet
Assignment 5 Abhilash Bollam
7 pages
Analysis of The Performance of Feature Optimization Tech - 2022 - Machine Learni
No ratings yet
Analysis of The Performance of Feature Optimization Tech - 2022 - Machine Learni
12 pages
Reusable Rocket Landing Guidance Design
No ratings yet
Reusable Rocket Landing Guidance Design
14 pages
Butterfly Effect Resume
No ratings yet
Butterfly Effect Resume
5 pages
RSA
No ratings yet
RSA
12 pages
Excel Fourier
No ratings yet
Excel Fourier
12 pages
MCQ 5
No ratings yet
MCQ 5
6 pages
Unit 4
No ratings yet
Unit 4
22 pages
Finite Vs Infinite
No ratings yet
Finite Vs Infinite
382 pages
Tabla de Kolmogorov Smirnov
No ratings yet
Tabla de Kolmogorov Smirnov
1 page
Operations Research Essentials
No ratings yet
Operations Research Essentials
88 pages
Travel Demand Models
No ratings yet
Travel Demand Models
17 pages
The Impact of Block Adaptive Quantization Algorithm On Power-Loss With Sar Raw Data
No ratings yet
The Impact of Block Adaptive Quantization Algorithm On Power-Loss With Sar Raw Data
3 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
MATLAB Guide Third Edition Desmond J. Higham - Download The Ebook Now To Start Reading Without Waiting
100% (3)
MATLAB Guide Third Edition Desmond J. Higham - Download The Ebook Now To Start Reading Without Waiting
68 pages
Whatsapp Security: Made By: Abdelrahman Badawy Yousef Abdelfatah Subervised By: Eng/Mai Magdy
100% (1)
Whatsapp Security: Made By: Abdelrahman Badawy Yousef Abdelfatah Subervised By: Eng/Mai Magdy
8 pages
MHF4U Exam Review
No ratings yet
MHF4U Exam Review
6 pages
II-Sem-MULTIVARIATE DATA ANALYSIS
No ratings yet
II-Sem-MULTIVARIATE DATA ANALYSIS
2 pages
Chemical Reactor Stability and Sensitivity
No ratings yet
Chemical Reactor Stability and Sensitivity
9 pages

ML Best Practices for Engineers

Uploaded by

ML Best Practices for Engineers

Uploaded by

Rules of Machine Learning:

Best Practices for ML Engineering

You might also like