Published inTowards Data ScienceStop using 0.5 as the threshold for your binary classifierLearn how to set the optimal threshold for your Machine Learning model.Nov 29, 20224Nov 29, 20224
Published inTowards Data ScienceCan I Trust My Model’s Probabilities? A Deep Dive into Probability CalibrationA practical guide on probability calibrationNov 10, 2022Nov 10, 2022
Published inTowards Data ScienceDeploying a Data Science Platform on AWS: Parallelizing Experiments (Part III)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareNov 1, 2022Nov 1, 2022
Published inTowards Data ScienceDeploying a Data Science Platform on AWS: Running containerized experiments (Part II)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 26, 2022Oct 26, 2022
Published inTowards Data ScienceDeploying a Data Science Platform on AWS: Setting Up AWS Batch (Part I)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 7, 2022Oct 7, 2022
Tips and Tricks to Use Jupyter Notebooks EffectivelyA few things to make you 10x more productive with Jupyter.Aug 8, 2022Aug 8, 2022
Published inTowards Data ScienceIntroducing Snapshot Testing for Jupyter Notebooksnbsnapshot is an open-source package that benchmarks notebook’s outputs to detect issues automatically.Jul 5, 20221Jul 5, 20221
Published inTowards Data ScienceFrom Jupyter to Kubernetes: Refactoring and Deploying Notebooks Using Open-Source ToolsA step-by-step guide to going from a messy notebook to a pipeline running in KubernetesJun 23, 20221Jun 23, 20221
Published inTowards Data ScienceAnalyze and plot 5.5M records in 20s with BigQuery and PloomberDevelop scalable pipelines on Google Cloud using open-source software.May 23, 2022May 23, 2022
Published inTowards Data ScienceA Gentle Introduction to Open-Source ContributionsA step-by-step guide for contributing to an open-source projectMay 10, 20221May 10, 20221