Overview Understand what is Categorical Data Encoding Learn different encoding techniques and when to use them Introduction The performance…
Continue Readingencoding
1.1 Billion Taxi Rides using OmniSciDB and a MacBook Pro
Many believe that for near-instant analytics on billions of records youd need dedicated Linux clusters, several GPUs or proprietary Cloud…
Continue ReadingOrdinal and One-Hot Encodings for Categorical Data
Machine learning models require all input and output variables to be numeric. This means that if your data contains categorical…
Continue ReadingOne-Hot Encoding vs. Label Encoding using Scikit-Learn
What is One-Hot Encoding? When should you use One-Hot Encoding over Label Encoding? These are typical data science interview questions…
Continue ReadingRaheel Shaikh
Cross Validation Explained: Evaluating estimator performance. Improve your ML model using cross…The ABC of Machine LearningWhat you need to know before you…
Continue ReadingCounting to infinity at compile time
If we can do 2 we can do 4, and 8, and 16. By this point we’re encoding the operation…
Continue Reading1.1 Billion Taxi Rides with MapD & AWS EC2
$ vi create_trips_table.sql CREATE TABLE trips ( trip_id INTEGER, vendor_id VARCHAR(3) ENCODING DICT, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag VARCHAR(1) ENCODING…
Continue Reading1.1 Billion Taxi Rides with MapD 3.0 & 2 GPU-Powered p2.8xlarge EC2 Instances
| |===============================+======================+======================| | 0 Tesla K80 On | 0000:00:17.0 Off | 0 | | N/A 62C P0 70W / 149W…
Continue Reading1.1 Billion Taxi Rides with MapD & 4 Nvidia Titan Xs
$ vi create_trips_table.sql CREATE TABLE trips ( trip_id INTEGER, vendor_id VARCHAR(3) ENCODING DICT, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag VARCHAR(1) ENCODING…
Continue Reading1.1 Billion Taxi Rides with MapD & 8 Nvidia Pascal Titan Xs
$ vi create_trips_table.sql CREATE TABLE trips ( trip_id INTEGER, vendor_id VARCHAR(3) ENCODING DICT, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag VARCHAR(1) ENCODING…
Continue Reading1.1 Billion Taxi Rides with MapD & 8 Nvidia Tesla K80s
| |===============================+======================+======================| | 0 Tesla K80 On | 0000:06:00.0 Off | Off | | N/A 48C P0 67W / 149W…
Continue ReadingBase 32 and base 64 encoding
There’s no firm convention for whether to use upper or lower case letters.Base 64 encodingThe common use for base 64…
Continue ReadingGetting Data ready for modelling: Feature engineering, Feature Selection, Dimension Reduction (Part 1)
Encoding: So, What and Why is Encoding?Most algorithms we use work with numerical values whereas more often than not categorical…
Continue Reading