By Manu Joseph, Problem Solver, Practitioner, Researcher at Thoucentric Analytics. Interpretability is the degree to which a human can understand…
Continue Readingmodel
How to Use Out-of-Fold Predictions in Machine Learning
Last Updated on December 6, 2019Machine learning algorithms are typically evaluated using resampling techniques such as k-fold cross-validation. During the…
Continue ReadingDeep Learning Tutorial Demonstrates How to Simplify Distributed Deep Learning Model Inference Using Delta Lake and Apache Spark™
On October 10th, our team hosted a live webinar—Simple Distributed Deep Learning Model Inference—with Xiangrui Meng, Software Engineer at Databricks.…
Continue ReadingSniffing Out Errors
From this you gain an understanding of where your model is succeeding and what can be amended to improve performance.…
Continue ReadingA Gentle Introduction to Model Selection for Machine Learning
Given easy-to-use machine learning libraries like scikit-learn and Keras, it is straightforward to fit many different machine learning models on…
Continue ReadingHow to Save and Reuse Data Preparation Objects in Scikit-Learn
Last Updated on November 20, 2019It is critical that any data preparation performed on a training dataset is also performed…
Continue ReadingHow to Connect Model Input Data With Predictions for Machine Learning
Fitting a model to a training dataset is so easy today with libraries like scikit-learn. A model can be fit…
Continue ReadingStop explaining black box machine learning models for high stakes decisions and use interpretable models instead
The paper is a mix of technical and philosophical arguments and comes with two main takeaways for me: firstly, a…
Continue ReadingRaymond Willey
Predicting Heart Disease MortalityBuilding a machine learning model that can identify high-risk…Integrating Python & TableauWhen performing in-depth analyses on large and…
Continue ReadingBenjamin Obi Tayo Ph.D.
Data Science Minimum: 10 Essential Skills You Need to Know to Start Doing Data ScienceMedium Partner Program’s New Model for Calculating…
Continue Reading9 Practical Actions to Improve Machine Learning for Fraud Prevention
Start with simpler, more transparent, and explainable and bias-free models and graduate to complicated models over time. MODELS – Experiment…
Continue ReadingA Gentle Introduction to Expectation-Maximization (EM Algorithm)
Last Updated on November 1, 2019 Maximum likelihood estimation is an approach to density estimation for a dataset by searching…
Continue ReadingProbabilistic Model Selection with AIC, BIC, and MDL
Model selection is the problem of choosing one from among a set of candidate models. It is common to choose…
Continue ReadingWhy is Machine Learning Deployment Hard?
By Alexandre Gonfalonieri, AI Consultant. After several AI projects, I realized that deploying Machine Learning (ML) models at scale is…
Continue ReadingA Gentle Introduction to Logistic Regression With Maximum Likelihood Estimation
Last Updated on October 28, 2019 Logistic regression is a model for binary classification predictive modeling. The parameters of a…
Continue ReadingA Gentle Introduction to Linear Regression With Maximum Likelihood Estimation
Last Updated on October 25, 2019 Linear regression is a classical model for predicting a numerical quantity. The parameters of…
Continue ReadingBuild your First Linear Regression Model in Qlik Sense
Think about it before you read the answer. The best line is the one that minimizes the distance of all…
Continue ReadingA Gentle Introduction to Bayesian Belief Networks
Probabilistic models can define relationships between variables and be used to calculate probabilities. For example, fully conditional models may require…
Continue ReadingProductionizing Machine Learning: From Deployment to Drift Detection
Try this notebook to reproduce the steps outlined below and watch our on-demand webinar to learn more. In many articles and…
Continue ReadingMany Heads Are Better Than One: The Case For Ensemble Learning
By Jay Budzik, ZestFinance. “The interests of truth require a diversity of opinions. ” —J. S. MillBanks and lenders are…
Continue ReadingCommon Machine Learning Obstacles
Sponsored Post. By Seth DeLand, Product Marketing Manager, Data Analytics, MathWorksEngineers and scientists who are modeling with machine learning…
Continue ReadingHow to Implement the Inception Score (IS) for Evaluating GANs
Generative Adversarial Networks, or GANs for short, is a deep learning neural network architecture for training a generator model for…
Continue ReadingGuest Blog: Using Databricks, MLflow, and Amazon SageMaker at Brandless to Bring Recommendation Systems to Production
This is a guest blog from Adam Barnhard, Head of Data at Brandless, Inc. , and Bing Liang, Data Scientist at Brandless, Inc. Launched…
Continue ReadingAutoML on Databricks: Augmenting Data Science from Data Prep to Operationalization
Thousands of data science jobs are going unfilled today as global demand for the talent greatly outstrips supply. Every day,…
Continue ReadingHow (Not) To Scale Deep Learning in 6 Easy Steps
However, there’s an important flaw. The final evaluation on the held-out 10% validation data shows that true accuracy is more…
Continue Reading