Published inTDS ArchiveStop using 0.5 as the threshold for your binary classifierLearn how to set the optimal threshold for your Machine Learning model.Nov 29, 2022534Nov 29, 2022534
Published inTDS ArchiveCan I Trust My Model’s Probabilities? A Deep Dive into Probability CalibrationA practical guide on probability calibrationNov 10, 202218Nov 10, 202218
Published inTDS ArchiveDeploying a Data Science Platform on AWS: Parallelizing Experiments (Part III)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareNov 1, 2022175Nov 1, 2022175
Published inTDS ArchiveDeploying a Data Science Platform on AWS: Running containerized experiments (Part II)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 26, 202210Oct 26, 202210
Published inTDS ArchiveDeploying a Data Science Platform on AWS: Setting Up AWS Batch (Part I)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 7, 202245Oct 7, 202245
Tips and Tricks to Use Jupyter Notebooks EffectivelyA few things to make you 10x more productive with Jupyter.Aug 8, 202227Aug 8, 202227
Published inTDS ArchiveIntroducing Snapshot Testing for Jupyter Notebooksnbsnapshot is an open-source package that benchmarks notebook’s outputs to detect issues automatically.Jul 5, 2022331Jul 5, 2022331
Published inTDS ArchiveFrom Jupyter to Kubernetes: Refactoring and Deploying Notebooks Using Open-Source ToolsA step-by-step guide to going from a messy notebook to a pipeline running in KubernetesJun 23, 202261Jun 23, 202261
Published inTDS ArchiveAnalyze and plot 5.5M records in 20s with BigQuery and PloomberDevelop scalable pipelines on Google Cloud using open-source software.May 23, 202274May 23, 202274
Published inTDS ArchiveA Gentle Introduction to Open-Source ContributionsA step-by-step guide for contributing to an open-source projectMay 10, 2022351May 10, 2022351