You can also leave any feedback and questions via my personal website ???? Right after the AI talk has finished, I asked a…
Continue Readinggroup
Hypothesis testing for dummies
Don’t worry, Python is here to save us. We can easily test this using the stats library from scipy in…
Continue ReadingHomomorphic encryption
A function that satisfiesf(x*y) = f(x)*f(y)is called a homomorphism. The symbol “*” can stand for any operation, and it need…
Continue ReadingHow Do We Survive in PUBG?
What kind of strategies can help them survive until the end?This project is also my first attempt to go through…
Continue ReadingUnsupervised Learning: Clustering
That’s where unsupervised learning comes in. So what is unsupervised learning?There are three types of unsupervised learning: clustering (what we’re…
Continue ReadingDo children eat more food when they prepared their own healthy and balanced meal?
Do children eat more food when they prepared their own healthy and balanced meal?Conducted a Two-Sample T-Test in R to compare…
Continue ReadingIdeas: Design Methodologies for Data Sprints
Photo by William Iven on UnsplashIdeas: Design Methodologies for Data SprintsAnne GibbonBlockedUnblockFollowFollowingMay 30I recently spent four days at a research lab with…
Continue ReadingDifference-in-Differences Analyses with Natural Experiments
One approach is to look for a natural experiment, in which treatment and control groups occur naturally, and apply a…
Continue ReadingEvolution of Traditional Statistical Tests in the Age of Data
Well it’s simple we use a theorem and make an assumption based on it. *Enter* The Central Limit TheoremIn particular,…
Continue ReadingNatural Language Processing — Event Extraction
Natural Language Processing — Event ExtractionExtracting events from news articlesRodrigo NaderBlockedUnblockFollowFollowingMay 2The amount of text generated every day is mind-blowing. Millions of data…
Continue ReadingAI Fairness — Explanation of Disparate Impact Remover
AI Fairness — Explanation of Disparate Impact RemoverIntroduction to AI FairnessStacey RonaghanBlockedUnblockFollowFollowingApr 22AI Fairness is an important topic for machine learning practitioners. We must…
Continue ReadingGroups in categories
The first time I saw a reference to a “group in a category” I misread it as something in the…
Continue ReadingWhat is an isogeny?
The previous post said that isogenies between elliptic curves are the basis for a quantum-resistant encryption method, but we didn’t…
Continue ReadingSome properties of ASCII characters
Some properties of ASCII charactersAnthony AbeoBlockedUnblockFollowFollowingApr 20When writing programs that deal with characters and strings, some of the methods programmers…
Continue ReadingStock Clustering with Time Series Clustering in R
Stock Clustering with Time Series Clustering in RYin-Ta PanBlockedUnblockFollowFollowingAug 9, 2018IMPORTANT: THIS IS NOT INVESTMENT ADVICE. As a newbie in stock…
Continue ReadingUsing Regular Expression in Genetics with Python
finds the preceding character or character group zero or one times. If it is a requirement to be specific or…
Continue ReadingHow neuroscientists analyze data from transparent fish brains : part 2, clustering neural data.
Finding ensembles of neurons with the same activity is an extremely important question for neuroscientists, since groups of synchronous neurons…
Continue ReadingMatch Markdown links with advanced regex features
Here is the trick: explicitly look for space using [ ], between square brackets. Let’s update our regular expression with…
Continue ReadingMongoDB: Optimizing Aggregation
MongoDB: Optimizing AggregationElvis RozarioBlockedUnblockFollowFollowingJan 29Recently I went through a couple of code reviews involving MongoDB aggregations. The aggregations were giving…
Continue ReadingHas China’s Investments Succeeded in Changing Country Governance Abroad?
Has China’s Investments Succeeded in Changing Country Governance Abroad?Rio RinaldiBlockedUnblockFollowFollowingJan 23Photo by chuttersnap on UnsplashChina’s foreign aid program now rivals some…
Continue Reading50-node Presto Cluster on Amazon EMR
$ hive CREATE EXTERNAL TABLE trips_orc ( trip_id INT, vendor_id STRING, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag STRING, rate_code_id SMALLINT, pickup_longitude…
Continue Reading1.1 Billion Taxi Rides: EC2 versus EMR
SELECT passenger_count, year(pickup_datetime), count(*) FROM trips_orc GROUP BY passenger_count, year(pickup_datetime); The following completed in 65 seconds..SELECT passenger_count, year(pickup_datetime) trip_year, round(trip_distance),…
Continue ReadingGroups of order 2019
from itertools import product identity = (0, 1) h_list = [1, 255, 417] def elem(x): g, h = x g_ok…
Continue ReadingBuilding Data Pipelines With Kafka
In this case we assign a set of consumers to a consumer group id, Kafka will make sure that a…
Continue ReadingGetting Started with Group Programming
We also decided to combine “research” work into the “simple” bucket because we felt it would be better for one…
Continue Reading