Data science is largely an enigma to the enterprise. Although there’s an array of self-service options to automate its various…
Continue Readingcommon
Expected length of longest common DNA substrings
If we have two unrelated sequences of DNA, how long would we expect the longest common substring of the the…
Continue ReadingPreston Badeer
The Easiest Way to Get Fake DataThis simple method covers most use casesA great rule of thumb for writing code, especially in…
Continue ReadingWilsen Tjhung
PHP 5 vs. PHP 7Let’s discuss the main differences between PHP 5 and PHP 7 in this blog. 1. PerformanceCallbacks vs.…
Continue ReadingThe Data Fabric for Machine Learning – Part 2: Building a Knowledge-Graph
In the last articles of the series:The Data Fabric for Machine Learning. Part 1. How the new advances in…
Continue ReadingCausal vs. Statistical Inference
I hope not, I suspect that it is reasonable to expect that chocolate does not cause one to be a…
Continue ReadingFilter, Aggregate and Join in Pandas, Tidyverse, Pyspark and SQL
Filter, Aggregate and Join in Pandas, Tidyverse, Pyspark and SQLYu ZhouBlockedUnblockFollowFollowingNov 18, 2018Alike but different (Source)The Blessing and CurseOne of the most…
Continue ReadingExploring the Tokyo Neighbourhoods: Data-Science in Real Life
(Source: Louie Martinez)As a part of the final IBM Capstone Project, we get a tang of what data scientists go…
Continue ReadingThe Data Fabric for Machine Learning. Part 2: Building a Knowledge-Graph.
The Data Fabric for Machine Learning. Part 2: Building a Knowledge-Graph. Favio VázquezBlockedUnblockFollowFollowingApr 4Before being able to develop a Data…
Continue ReadingNumber Theory — History & Overview
Number Theory — History & OverviewPart I — What Is Number Theory & Why Is It Relevant Today?Jesus NajeraBlockedUnblockFollowFollowingMar 3Math is the Universe’s natural tongue. Since…
Continue ReadingThe Reason Why ‘www8’ Exists
The Reason Why ‘www8’ ExistsAlvin TaiBlockedUnblockFollowFollowingFeb 11Many years ago, I made a decision for my website, alvintai. com, to omit the…
Continue ReadingPredicting Friendship
Let’s take a look at five potential ways of answering those questions. Measure 1: Common NeighborsThe most intuitive and simplest…
Continue ReadingEconomics, power laws, and hacking
Increasing costs impact some players more than others. Those who know about power laws and know how to prioritize are…
Continue ReadingI wrote a Python program to calculate the most commonly used words in subreddits. Here’s what I found…
I used a set instead of a regular list for more efficient lookup time O(1) vs O(n)Yikes, I’m still adding…
Continue Reading