Building an End-To-End Data Science ProjectLearnings from my Data Scientist Ideal Profiles projectIt is often said that the majority of a Data…
Continue ReadingLDA on the Texts of Harry Potter
Feel free to contact me with any questions!In this post, I’ll describe topic modeling with Latent Dirichlet Allocation and compare…
Continue ReadingGPS Trajectories Clustering in Python
In the second, we will show how to use and customize the algorithm in Python.Neuroimage Algorithms and GPS Trajectories ClusteringInstead…
Continue ReadingNo One Wants Your Neural Network
They’ll accept your model, if it means that one of those things happens, but that doesn’t mean they want it.Before…
Continue ReadingRandom forests explained intuitively
One quick example, I use very frequently to explain the working of random forests is the way a company has…
Continue Reading4 Pillars of Analytics
4 Pillars of AnalyticsData acquisition, processing, surfacing and actioning are key to an effective analytics initiativeJulien KervizicBlockedUnblockFollowFollowingDec 7There are four…
Continue ReadingImproving Data Quality with Product Similarity Search
In addition, it can assist enhancing data information: If some product data lack certain attribute values, it is possible to…
Continue ReadingMeetup on Airflow and Cloud Composer
Later, we will load it into our Data Ware House cluster on the cloud, then we will process it by…
Continue ReadingLatent Semantic Indexing (LSI)
Latent Semantic Indexing (LSI)Karthik PansettyBlockedUnblockFollowFollowingDec 5Latent Semantic Indexing (LSI), also known as Latent Semantic Analysis (LSA) is a text processing technique…
Continue ReadingData Engineer VS Data Scientist
This pyramid illustrates well the process necessary to use Data in a company.At the base, Software Developers will be working…
Continue ReadingDouble Q-Learning, the Easy Way
He pointed out that the poor performance is caused by large overestimation of action values due to the use of…
Continue ReadingCanada and France plan an international panel to assess AI’s dangers
Recommended for You 250 pages of internal Facebook files were just dumped online—here are the 6 key takeaways Data analysis…
Continue ReadingDeep Transfer Learning for Natural Language Processing — Text Classification with Universal Embeddings
We’ve had some recent successes with word embeddings including methods like Word2Vec, GloVe and FastText, all of which I have…
Continue ReadingAvoiding Parking Tickets in San Francisco Using Data Analytics
Avoiding Parking Tickets in San Francisco Using Data AnalyticsThis is a short(er) write up that summarizes and expands upon findings…
Continue ReadingBasic NLP on the Texts of Harry Potter: Topic Modeling with Latent Dirichlet Allocation
Basic NLP on the Texts of Harry Potter: Topic Modeling with Latent Dirichlet AllocationGreg RaffertyBlockedUnblockFollowFollowingDec 6I’m Greg Rafferty, a data…
Continue ReadingLand of the “Super Founders“— A Data-Driven Approach to Uncover the Secrets of Billion Dollar Startups
Land of the “Super Founders“— A Data-Driven Approach to Uncover the Secrets of Billion Dollar StartupsI Spent 300 Hours Gathering Data…
Continue ReadingFinding our way: thoughts about information architecture’s history
Finding our way: thoughts about information architecture’s historyZach LantzBlockedUnblockFollowFollowingDec 4The Great Library of Alexandria“The most common definition [of the word…
Continue ReadingPredicting the task duration based on a range
We discussed that we can fit the estimates (both for the Agile and Waterfall projects) to a Log-Normal distribution, which…
Continue ReadingDouble Q-Learning the Easy Way
He pointed out that the poor performance is caused by large overestimation of action values due to the use of…
Continue ReadingData Studio with BigQuery: 2018's best practices
We don’t want that.At Google Next 18 the Developer Advocate for Data Studio — Minhaz Kazi— and I gave a talk on…
Continue ReadingAnimating Your Data Visualizations Like a Boss Using R
Each frame is a different plot when conveying motion, which is built using some relevant subset of the aggregate data..The…
Continue ReadingFacial recognition has to be regulated to protect the public, says AI report
The speed at which facial recognition has grown comes down to the rapid development of a type of machine learning…
Continue ReadingClustering Ethereum Addresses
Clustering Ethereum AddressesCategorizing addresses using patterns in transaction activityWill PriceBlockedUnblockFollowFollowingDec 6IntroductionEthereum users may be anonymous, but their addresses are unique…
Continue ReadingDatacleaning fighter jets
I already downloaded and cleaned data, and it was an experience I didn’t want to repeat..I couldn’t find a way…
Continue ReadingTagOverflow — Correlating Tags in Stackoverflow
We imported the whole dump of StackOverflow into Neo4j, ran the algorithms and then visualized the results using Neo4j Browser…
Continue Reading