The complete guide to clustering analysisk-means and hierarchical clustering by hand and in RAn efficient way to install and load R packagesR…

Continue Reading# test

## Underfitting vs. Overfitting (vs. Best Fitting) in Machine Learning

The Challenge of Underfitting and Overfitting in Machine Learning You’ll inevitably face this question in a data scientist interview: Can…

Continue Reading## Using PractRand to test an RNG

Yesterday I wrote about my experience using NIST STS to test an entropy extractor, a filtering procedure that produces unbiased…

Continue Reading## Testing entropy extractor with NIST STS

Around this time last year I wrote about the entropy extractor used in μRNG. It takes three biased random bit…

Continue Reading## How to Fix k-Fold Cross-Validation for Imbalanced Classification

Last Updated on January 13, 2020Model evaluation involves using the available dataset to fit a model and estimate its performance…

Continue Reading## What is the Chi-Square Test and How Does it Work? An Intuitive Explanation with R Code

Step 1: First, import the data Step 2: Validate it for correctness in R: View the code on Gist. Output: #Count…

Continue Reading## DSAT – First Ever Adaptive Learning Platform for Data Science Professionals

We believe that a single product similar to GMAT can revolutionize this entire industry. After the successful launch of Datamin,…

Continue Reading## Testing Rupert Miller’s suspicion

I was reading Rupert Miller’s book Beyond ANOVA when I ran across this line: I never use the Kolmogorov-Smirnov test…

Continue Reading## Data Engineering Blog

Transparent Schema Registry for Kafka StreamsPainlessly test Kafka Streams with AvroFluent Kafka Streams TestsA Java test DSL for Kafka StreamsRunning R on AWS LambdaR is…

Continue Reading## Testing Cliff RNG with DIEHARDER

My previous post introduced the Cliff random number generator. The post showed how to find starting seeds where the generator…

Continue Reading## Hypothesis testing for dummies

Don’t worry, Python is here to save us. We can easily test this using the stats library from scipy in…

Continue Reading## Inferential Statistics: Understanding Hypothesis Testing Using Chi-Square Test

Well, we have multiple statistical techniques like descriptive statistic where we measure the data central value, how it is spread…

Continue Reading## Log Book —Guide to Hypothesis Testing

Log Book —Guide to Hypothesis TestingThis is a guide to Hypothesis testing. I have tried to cover the basics of…

Continue Reading## Python Tutorial For Researchers Who use R

Python Tutorial For Researchers Who use RInstallation, Loading Data, Visualization, Linear Regression, Rpy2Jun WuBlockedUnblockFollowFollowingJul 2@wwarby unsplash. comThis tutorial is aimed at…

Continue Reading## A quick run-through of Holt-Winters, Seasonal ARIMA and FB Prophet

A quick run-through of Holt-Winters, Seasonal ARIMA and FB ProphetGregory FeltonBlockedUnblockFollowFollowingJun 27gianfelton/Comparing-Holt-Winters-SARIMA-and-FBProphetThis is a simple notebook comparing the output of Holt-Winters,…

Continue Reading## Hypothesis Testing — An Introduction

To solve such problems we always start with a null hypothesis, and we assume that the null hypothesis is true…

Continue Reading## Statistics For Real World Data

Statistics For Real World DataSome useful statistical tools for imperfect dataRyan FarmarBlockedUnblockFollowFollowingJun 14IntroductionIn any introductory statistics course, you’ll pretty much always…

Continue Reading## Predicting Titanic Survivors (A Kaggle Competition)

We’ll find out!Let’s get started!1. 0 Importing the DataThe first step in the process is always to load in the data as…

Continue Reading## Top 10 Statistics Mistakes Made by Data Scientists

The model you built looked great in R&D but performs horrible in production. The model you said will do wonders…

Continue Reading## The ultimate guide to A/B testing. Part 1: experiment design

Maybe it would have been even worse if the game didn’t have that new mode. In this case, the best…

Continue Reading## Introduction to concise and expressive REST API testing framework — WebTau

Introduction to concise and expressive REST API testing framework — WebTauMykola GolubyevBlockedUnblockFollowFollowingMay 28IntroductionWebtau (short for web test automation) is a tool and…

Continue Reading## Singapore Flat Price Predictor

I will be running multiple experiments and do some comparison with the base model. I will cover that in the…

Continue Reading## 1st Place Solution for Intel Scene Classification Challenge

1st Place Solution for Intel Scene Classification ChallengeHosted by Analytics VidhyaAfzal SayedBlockedUnblockFollowFollowingJun 2IntroductionProblemYou are provided with a dataset of ~25k…

Continue Reading## Top 10 reasons to write unit tests

Top 10 reasons to write unit testsMike WillsonBlockedUnblockFollowFollowingMay 31First off, what is a unit test?From Wikipedia:unit testing is a software testing…

Continue Reading## What Does a Lady Tasting Tea Have to Do with Science?

Even if the lady had no ability to distinguish milk-first from tea-first, she would be expected to get a couple…

Continue Reading