permutations — 362,880. If we sample from this permutations, pick a permutation, say 612934578 and apply the following rule:-Pick every number from…
Continue Readingtimes
The Incredible Shrinking Bernoulli
The Incredible Shrinking BernoulliSimulating Hacker News inter-arrival times distribution with the flip of a coinJean-Frederic PlanteBlockedUnblockFollowFollowingJun 29Joey Kyber via PexelsBernoulli counting processBernoulli distributions…
Continue ReadingA Beginners Introduction into MapReduce
A Beginners Introduction into MapReduceDima ShulgaBlockedUnblockFollowFollowingApr 7Many times, as Data Scientists, we have to deal with huge amount of data.…
Continue ReadingHow machine learning is accelerating last-mile, and last-meter, delivery
While much of the logistics industry’s efforts to accelerate delivery times focuses on optimizing routes, it turns out that’s not…
Continue ReadingBeer, Bravado & Bitbucket: Using data to drive CODE decisions
All I have so far is an idea. Sure, I could load the Pull Request page a couple times before…
Continue ReadingComparison of Linked Data Triplestores: Developing the Methodology
It’s impossible to tell but almost certainly the latter. This is of course equally true for query 7. One interesting…
Continue ReadingPredicting Kickstarter Campaign Success with Gradient Boosted Decision Trees: A Machine Learning Classification Problem
Predicting Kickstarter Campaign Success with Gradient Boosted Decision Trees: A Machine Learning Classification ProblemRiley PredumBlockedUnblockFollowFollowingFeb 2I was surfing data. world…
Continue ReadingThe Mythical 10X Programmer
I posit a few reasons:In any given real-life scenario, some developers bring more to the table than others, and some…
Continue Reading1.2 Billion Taxi Rides on AWS RDS running PostgreSQL
$ time ( ./initialize_database.sh; ./import_trip_data.sh; ./import_uber_trip_data.sh; cat analysis/prepare_analysis.sql tlc_statistics/create_statistics_tables.sql | psql trips; cd tlc_statistics; ruby import_statistics_data.rb ) The following were…
Continue ReadingFollowing an idea to its logical conclusion
If you double the length of the sides of a Euclidean rectangle 15 times, you do double the area 15…
Continue ReadingDescriptive Statistics
A data can have one or more than one mode.If there is only one number that appears maximum number of…
Continue ReadingImproving Patient Flows With Data Science And Analytics
We now access to more complete data sets that in our experience can be in upwards of the billions of…
Continue ReadingFit to Print: Finding the medium in the message
This tool can take a new article, find its best stylistic match and highlight some changes that might bring the…
Continue Reading