NLP Kaggle CompetitionIntroductory Notebook and Exploratory Data AnalysisTara BoyleBlockedUnblockFollowFollowingFeb 4The Quora Insincere Questions Classification competition is a natural language processing task…
Continue Readingnumber
Supervised Learning: Basics of Classification and Main Algorithms
Supervised Learning: Basics of Classification and Main AlgorithmsVictor RomanBlockedUnblockFollowFollowingJan 31IntroductionAs stated in the first article of this series, Classification is…
Continue Reading10 Tips for Choosing the Optimal Number of Clusters
The cValid package can be used to simultaneously compare multiple clustering algorithms, to identify the best clustering approach and the…
Continue Reading6 Effective Email Marketing Metrics to Measure Success
Allow me to introduce you to the Click Through Rate (CTR). CTR is the number of clicks on links within…
Continue ReadingWhy I didn’t hire an event manager for my wedding
Why I didn’t hire an event manager for my weddingYasith LokugeBlockedUnblockFollowFollowingJan 19rsvp and automated mobile notification system using webtask and AWSAfter reading…
Continue ReadingStatistics is the Grammar of Data Science — Part 1
????Machine Learning libraries like Tensorflow or scikit-learn hide almost all the complex mathematics away from the user. That means that…
Continue ReadingPerpetual Currying in JavaScript
Perpetual Currying in JavaScriptParam SinghBlockedUnblockFollowFollowingJan 17Vatican Museum StairsOne of the best characteristics of JavaScript is it being a Functional Programming Languages…
Continue ReadingUsing Ascending Timers in Android Studio
Using Ascending Timers in Android StudioEvan LiuBlockedUnblockFollowFollowingJan 15The use of countdown timers in mobile apps and computer programs adds an extra…
Continue ReadingCurse of Dimensionality
Curse of DimensionalityBadreesh ShettyBlockedUnblockFollowFollowingJan 15In Machine Learning, we often have high-dimensional data. If we’re recording 60 different metrics for each…
Continue ReadingHow Smart is Your News Source?
How Smart is Your News Source?Text Data Analysis of 21 Different News OutletsMichael TaubergBlockedUnblockFollowFollowingJan 12I think it’s more important than ever to…
Continue ReadingProblem Solving With SQL
Maybe this was the only way you knew how to solve this problem. Take a moment and think about how…
Continue ReadingHow to use OpenCorporates and Companies House APIs
How to use OpenCorporates and Companies House APIsCarmen Aguilar GarcíaBlockedUnblockFollowFollowingJan 8Bitcoin — CC0A few days after starting my new job, one of my…
Continue ReadingThe Most Important Data Science Tool for Market and Customer Segmentation
The Most Important Data Science Tool for Market and Customer SegmentationUse K-means and let AI advise you how many segments…
Continue ReadingAirbnb Rental Listings Dataset Mining
One can perhaps attribute the success of Airbnb in NYC to the high rates charged by the hotels, which are…
Continue ReadingA Bayesian approach to estimate the effect of a content and a weekday on the post published on a Facebook page
A Bayesian approach to estimate the effect of a content and a weekday on the post published on a Facebook pageGulzina…
Continue ReadingA Billion Taxi Rides: AWS S3 versus HDFS
$ hive CREATE EXTERNAL TABLE trips_orc_s3 ( trip_id INT, vendor_id STRING, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag STRING, rate_code_id SMALLINT, pickup_longitude…
Continue ReadingCheck sums and error detection
s = “H88CMK9BVJ10” n = decode(s) r = n % 37 print(r) print(encode(n, checksum=True)) This produces 32 H88CMK9BVJ10* As we…
Continue ReadingSpectral graph clustering and optimal number of clusters estimation
Next, we will provide an implementation for the eigengap heuristic computing of the optimal number of clusters in a dataset…
Continue ReadingAn introduction to context-oriented programming in Kotlin
Usually, functional-oriented languages have better scoping rules (pedant’s comment: again, not all procedural languages are C, so there are some…
Continue ReadingCode Challenge: Traversing Data Structures in Swift
We can use fast enumeration to traverse an Array in linear time — O(n):let sequence : Array<Int> = [1, 2, 3, 4,…
Continue ReadingPlanet Beehive
After all our goal today is to map out each activity on the map and making it comparable on the…
Continue ReadingCode doodling: isPrime
Then I’ll run the code on my laptop and time the results for each algorithm testing for primes in the…
Continue ReadingAn overview of topics extraction in Python with LDA
An example of a topic is shown below:flower * 0,2 | rose * 0,15 | plant * 0,09 |…Illustration of…
Continue ReadingTopic Mining on Amazon Reviews
topics review.What can we do with Topic Mining?We have seen how you can implement topic modelling, but often the question is:…
Continue ReadingThe complete guide for topics extraction with LDA (Latent Dirichlet Allocation) in Python
An example of a topic is shown below:flower * 0,2 | rose * 0,15 | plant * 0,09 |…There are…
Continue Reading