permutations — 362,880. If we sample from this permutations, pick a permutation, say 612934578 and apply the following rule:-Pick every number from…

Continue Reading# times

## The Incredible Shrinking Bernoulli

The Incredible Shrinking BernoulliSimulating Hacker News inter-arrival times distribution with the flip of a coinJean-Frederic PlanteBlockedUnblockFollowFollowingJun 29Joey Kyber via PexelsBernoulli counting processBernoulli distributions…

Continue Reading## A Beginners Introduction into MapReduce

A Beginners Introduction into MapReduceDima ShulgaBlockedUnblockFollowFollowingApr 7Many times, as Data Scientists, we have to deal with huge amount of data.…

Continue Reading## How machine learning is accelerating last-mile, and last-meter, delivery

While much of the logistics industry’s efforts to accelerate delivery times focuses on optimizing routes, it turns out that’s not…

Continue Reading## Beer, Bravado & Bitbucket: Using data to drive CODE decisions

All I have so far is an idea. Sure, I could load the Pull Request page a couple times before…

Continue Reading## Comparison of Linked Data Triplestores: Developing the Methodology

It’s impossible to tell but almost certainly the latter. This is of course equally true for query 7. One interesting…

Continue Reading## Predicting Kickstarter Campaign Success with Gradient Boosted Decision Trees: A Machine Learning Classification Problem

Predicting Kickstarter Campaign Success with Gradient Boosted Decision Trees: A Machine Learning Classification ProblemRiley PredumBlockedUnblockFollowFollowingFeb 2I was surfing data. world…

Continue Reading## The Mythical 10X Programmer

I posit a few reasons:In any given real-life scenario, some developers bring more to the table than others, and some…

Continue Reading## 1.2 Billion Taxi Rides on AWS RDS running PostgreSQL

$ time ( ./initialize_database.sh; ./import_trip_data.sh; ./import_uber_trip_data.sh; cat analysis/prepare_analysis.sql tlc_statistics/create_statistics_tables.sql | psql trips; cd tlc_statistics; ruby import_statistics_data.rb ) The following were…

Continue Reading## Following an idea to its logical conclusion

If you double the length of the sides of a Euclidean rectangle 15 times, you do double the area 15…

Continue Reading## Descriptive Statistics

A data can have one or more than one mode.If there is only one number that appears maximum number of…

Continue Reading## Improving Patient Flows With Data Science And Analytics

We now access to more complete data sets that in our experience can be in upwards of the billions of…

Continue Reading## Fit to Print: Finding the medium in the message

This tool can take a new article, find its best stylistic match and highlight some changes that might bring the…

Continue Reading