Machine learning models require all input and output variables to be numeric. This means that if your data contains categorical…
Continue Readingcategorical
WiDS Mysore
Getting started with kaggleKaggle is the world’s largest data science community to help you achieve your data science goals. Kaggle has…Python…
Continue Reading3 Ways to Encode Categorical Variables for Deep Learning
Machine learning and deep learning models, like those in Keras, require all input and output variables to be numeric. This…
Continue ReadingEarly Detection of Sepsis Using Physiological Data
Early Detection of Sepsis Using Physiological Datakaran sindwaniBlockedUnblockFollowFollowingJul 5What is Sepsis ?Sepsis is a potentially life-threatening condition caused by the body’s…
Continue ReadingSpectral encoding of categorical features
In tis case we can use Spectral Graph Theory methods to create low dimensional embedding of the categorical features. The…
Continue ReadingPreparing Tabular Data for Neural Networks
You can use a “multi-hot vector” which is exactly the same as a one-hot vector except more than one entry…
Continue Reading3 Awesome Visualization Techniques for every dataset
Some reusable ideas of graphs that can help us to find information about the data FAST. In this post, I…
Continue ReadingA step-by-step guide for creating advanced Python data visualizations with Seaborn / Matplotlib
A step-by-step guide for creating advanced Python data visualizations with Seaborn / MatplotlibAlthough there’re tons of great visualization tools in…
Continue ReadingGuide to Machine Learning in R for Beginners: Intro to Machine Learning
Guide to Machine Learning in R for Beginners: Intro to Machine LearningThis is part 1 of my Beginner’s series on Machine…
Continue ReadingData Preprocessing: A Practical Guide
Download the dataset from this link. This dataset has been published as a part of Kaggle competition. It has three .…
Continue ReadingHow to Perform Exploratory Data Analysis with Seaborn
How to Perform Exploratory Data Analysis with SeabornLorraine LiBlockedUnblockFollowFollowingFeb 18Exploratory Data Analysis (EDA) is an approach to analyzing datasets to summarize…
Continue ReadingPredicting Animal Shelter Outcomes
Predicting Animal Shelter OutcomesA guide to handling categorical variables in supervised machine learningRebecca VickeryBlockedUnblockFollowFollowingFeb 18Photo by Berkay Gumustekin on UnsplashI have been working…
Continue ReadingTop 10 Presentations from rstudio::conf 2019 – The Best R Conference of the Year!
This year’s conference, rstudio::conf 2019, promised an ever bigger and better show than 2018. And in this article, I have…
Continue ReadingOne-Hot Encoding is making your Tree-Based Ensembles worse, here’s why?
One-Hot Encoding is making your Tree-Based Ensembles worse, here’s why?Optimizing Tree-Based ModelsRakesh RaviBlockedUnblockFollowFollowingJan 10Image CreditsI am a Master’s Student in Data…
Continue ReadingGuide to Machine Learning in R for Beginners : Intro to Machine Learning
Quantitative variables typically have measurement units, such as pounds, dollars, years, volts, gallons, megabytes, inches, degrees, miles per hour, pounds…
Continue ReadingIntroduction to Data Preprocessing in Machine Learning
We can’t say that Blue<Green as it doesn’t make any sense to compare the colors as they don’t have any…
Continue ReadingA Starter Pack to Exploratory Data Analysis with Python, pandas, seaborn, and scikit-learn
In the categorical family, we have nominal and ordinal data, while in the Quantitative family, we have interval and ratio..It…
Continue Reading