news outlets and therefore were not included in the initial analysis.**123 articles (roughly a quarter of all in the database)…
Continue ReadingWhy every data scientist shall read “The Book of Why” by Judea Pearl
Despite numerous algorithms I had acquired, my puzzle remains.Puzzle That Algorithm Itself Cannot SolveIf you are not the kind of…
Continue ReadingShrinkage Estimators: Shrinking statistical estimates
For those interested in optimizing portfolios, look at OptimalPortfolio.I must agree, the name shrinkage is quite a strange one, but…
Continue ReadingData Scientists: Why are they so expensive to hire?
Therefore, a mere 23 student class size from one university and roughly around 700 graduating students from all universities offering…
Continue ReadingTop Highlights from the Amazing Machine Learning Tutorials Presented at NeurIPS (NIPS) 2018
Table of Contents Automatic Machine Learning Common Pitfalls for Studying the Human Side of Machine Learning Statistical Learning Theory:…
Continue ReadingData Completeness in the 2016 Elections Performance Index
Following the recent update to the index with data from 2016, we are dedicating a series of posts to exploring…
Continue ReadingReview: DeepMask (Instance Segmentation)
DeepMask is the CNN approach for instance segmentation.Image Classification: Classify the main object category within an image.Object Detection: Identify the…
Continue ReadingNet upvote prediction and subreddit-based sentence completion for Reddit comments:
The computational complexity of training this seq2seq model is higher than training the word embedding neural language model, so in…
Continue ReadingA different kind of (deep) learning: part 2
A different kind of (deep) learning: part 2Self Supervised learning: generative approachesGidi ShperberBlockedUnblockFollowFollowingDec 19IntroIn the previous post, we’ve discussed some self…
Continue ReadingIs a good idea to start an Airbnb business in Seattle?
The listings.csv contains 3818 rows and 93 columns data, with fruitful of features describing each property informaion.Before merging them together,…
Continue ReadingGenerating New Ideas for Machine Learning Projects Through Machine Learning
prediction for a mass neural network', 'learning of human activity recognition from analysis of text', "an nba player 's approach…
Continue ReadingSupport Vector Machine: Kernel Trick; Mercer’s Theorem
And what the heck is Kernel Trick ?From the previous post about support vectors, we have already seen (please check to…
Continue ReadingA Data Science Secret — A business Perspective is all you need
Look for extra data sources.For example, Crime data in a city might help banks provide a better credit line to…
Continue ReadingImproving The Wrong Error Is a Futile Effort.
They will end up not doing its task properly.You decide to build another cat classifier, and its goal is to…
Continue ReadingMachine Learning and Techniques
Training data can be generalized and that the model can be used on new data with some accuracy.Algorithms under supervised…
Continue ReadingOn the importance of proper data handling (part 1)
A 208×208 tile size works well for small objects, for medium 416×416 and for large 832×832.Large tile for the large…
Continue ReadingThe US and China aren’t in a “cold war” so stop calling it that
In recent months the New York Times has reported that “a cold war is being waged across the world’s most advanced…
Continue ReadingChina’s tech boom has inspired a wave of internet-related art
They are each other’s past, as well as each other’s future.” The creative, complex responses to censorship seen in work…
Continue ReadingChina’s tech giants want to go global. Just one thing might stand in their way.
The ambitions charted in recent government plans are far-ranging: to excel in areas like 5G mobile technology, seed breeding, and…
Continue ReadingHow to Think About Data in 2019
How to Think About Data in 2019It is tangible human beings, not abstract “data”, that power the online economyThe EconomistBlockedUnblockFollowFollowingJan 11Photo: Jekaterina…
Continue ReadingMultimodal Deep Learning
The most common method in practice is to combine high-level embeddings from the different inputs by concatenating them and then…
Continue ReadingObject Detection: An End to End Theoretical Perspective
I will be sure to correct myself and post.References:http://cs231n.github.io/transfer-learning/#tfhttp://cs231n.stanford.edu/slides/2017/cs231n_2017_lecture11.pdfEfficient Graph-Based Image Segmentation -http://people.cs.uchicago.edu/~pff/papers/seg-ijcv.pdfRich feature hierarchies for accurate object detection and…
Continue ReadingQuality inspection in manufacturing using deep learning based computer vision
Quality inspection in manufacturing using deep learning based computer visionImproving yield by removing bad quality material with image recognitionPartha DekaBlockedUnblockFollowFollowingDec 18Author:…
Continue ReadingBuilding sentence embeddings via quick thoughts
Same as skip-gram, both quick thoughts and skip-gram build leverage classifier to learn vectors.#a skip-thoughts, #b quick-thoughts (Logeswaran et al., 2018)Giving…
Continue ReadingThe Data Question
If someone were to ask me ‘the data question’ today, I would answer that more relevant training data always seems…
Continue Reading