Published inTDS ArchiveStop using 0.5 as the threshold for your binary classifierLearn how to set the optimal threshold for your Machine Learning model.Nov 29, 20224Nov 29, 20224
Published inTDS ArchiveCan I Trust My Model’s Probabilities? A Deep Dive into Probability CalibrationA practical guide on probability calibrationNov 10, 2022Nov 10, 2022
Published inTDS ArchiveDeploying a Data Science Platform on AWS: Parallelizing Experiments (Part III)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareNov 1, 2022Nov 1, 2022
Published inTDS ArchiveDeploying a Data Science Platform on AWS: Running containerized experiments (Part II)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 26, 2022Oct 26, 2022
Published inTDS ArchiveDeploying a Data Science Platform on AWS: Setting Up AWS Batch (Part I)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 7, 2022Oct 7, 2022
Tips and Tricks to Use Jupyter Notebooks EffectivelyA few things to make you 10x more productive with Jupyter.Aug 8, 2022Aug 8, 2022
Published inTDS ArchiveIntroducing Snapshot Testing for Jupyter Notebooksnbsnapshot is an open-source package that benchmarks notebook’s outputs to detect issues automatically.Jul 5, 20221Jul 5, 20221
Published inTDS ArchiveFrom Jupyter to Kubernetes: Refactoring and Deploying Notebooks Using Open-Source ToolsA step-by-step guide to going from a messy notebook to a pipeline running in KubernetesJun 23, 20221Jun 23, 20221
Published inTDS ArchiveAnalyze and plot 5.5M records in 20s with BigQuery and PloomberDevelop scalable pipelines on Google Cloud using open-source software.May 23, 2022May 23, 2022
Published inTDS ArchiveA Gentle Introduction to Open-Source ContributionsA step-by-step guide for contributing to an open-source projectMay 10, 20221May 10, 20221