A basic idea in numerical integration is that if a method integrates polynomials exactly, it should do well on polynomial-like…

Continue Reading# points

## Logistic trajectories

This post is a follow-on to the post on how to make the logistic bifurcation diagram below. That post plotting…

Continue Reading## Advantages of redundant coordinates

Barycentric coordinates make some things much simpler. For example, the coordinates of the three vertices are (1, 0, 0), (0,…

Continue Reading## Predicting environmental carcinogens with logistic regression, knn, gradient boosting and molecular fingerprinting

Predicting environmental carcinogens with logistic regression, knn, gradient boosting and molecular fingerprintingBalancing imbalanced data, exploring accuracy metrics, and an introduction…

Continue Reading## Here’s how you can accelerate your Data Science on GPU

Here’s how you can accelerate your Data Science on GPUGeorge SeifBlockedUnblockFollowFollowingJul 3Data Scientists need computing power. Whether you’re processing a big…

Continue Reading## “Where today?” — Planning my Singapore trip with clusters

“Where today?” — Planning my Singapore trip with clustersOutlining travel plans with R, k-medoids and Google MapsJuan De Dios SantosBlockedUnblockFollowFollowingJul 1Planning is not easy.…

Continue Reading## Linear Algebra. Points matching with SVD in 3D space

Linear Algebra. Points matching with SVD in 3D spaceAndrey NikishaevBlockedUnblockFollowFollowingJun 30ProblemWe need to find best rotation & translation params between two…

Continue Reading## Suicide in the 21st Century (Part 2)

If you didn’t catch part 1 you can find it below:Suicide in the 21st Century (Part 1)Suicide is not contagious,…

Continue Reading## An overview of different unsupervised learning techniques

Well, there are several ways like cross-validation, information criteria, the information theoretic jump method, the silhouette method, and the G-means…

Continue Reading## Mobility Data, Feature Engineering and Hierarchical Clustering

One concept that beautifully captures the level of randomness in a sequence of events is found in the domain of…

Continue Reading## When Machine Learning Solutions Are Not Possible!

When Machine Learning Solutions Are Not Possible!Five Scenarios Every Data Scientist Should Consider before Proposing Machine Learning Solutions. Rasoul BanaeeyanBlockedUnblockFollowFollowingJun…

Continue Reading## Best clustering algorithms for anomaly detection

How to use it?”Now we have the clusters…How can we detect anomalies in the test data?The approach I’ve followed to classify…

Continue Reading## SVM: Feature Selection and Kernels

(Source: https://towardsdatascience. com/support-vector-machine-vs-logistic-regression-94cc2975433f)SVM: Feature Selection and KernelsPier Paolo IppolitoBlockedUnblockFollowFollowingJun 2A Support Vector Machine (SVM) is a supervised machine learning algorithm that…

Continue Reading## Modern Astrophysics At The Forefront of Data Science

So, what can we do with this data?Figure 3. Simplified diagram showing exoplanet transiting in front of the host star…

Continue Reading## An Easy Introduction to SQL for Data Scientists

Inserting data into our table can be done using a command called INSERT followed by the table name and a…

Continue Reading## Using machine learning to understand customers behavior

You were using Euclidean distance. It is the square root of the sum of squared differences between corresponding elements of…

Continue Reading## Outlier Detection and Treatment: A Beginner's Guide

Outlier Detection and Treatment: A Beginner's GuideSwetha LakshmananBlockedUnblockFollowFollowingMay 8One of the most important steps in data pre-processing is outlier detection…

Continue Reading## Extracting and Analyzing 1000 Basketball Games using Pandas and Chartify

We will narrow our scope to some specific fields for this project: GameId: This is not crucial for analysis but database-wise…

Continue Reading## K-Means Clustering in SAS

K-Means Clustering in SASDhilip SubramanianBlockedUnblockFollowFollowingMay 1What is Clustering?“Clustering is the process of dividing the datasets into groups, consisting of similar data-points”.…

Continue Reading## Getting started with Visualizations in Python

First things first, don’t even think of relating it with Bar graphs. -Histograms are very different from Bar graphs, in the…

Continue Reading## Plotting business locations on maps using multiple Plotting libraries in Python

GeoPandas was able to plot all the points. However, I found two major drawbacks here which include not having an…

Continue Reading## AI Series: A walk into the theory of learning.

well…making sure that the algorithm does not learn…too much!If an algorithm maps very precisely all the data points of a…

Continue Reading## Unsupervised Learning Project: Creating Customer Segments

Unsupervised Learning Project: Creating Customer SegmentsLearn how to develop and end-to-end Clustering and Dimensionality Reduction Project!Victor RomanBlockedUnblockFollowFollowingApr 28IntroductionThroughout this project, we…

Continue Reading## How to do that animated ‘race’ bar chart

How to do that animated ‘race’ bar chartExplore the all time best teams of English football and learn how to make…

Continue Reading## The Fundamental Problem of Machine Learning, Without Math

Apparently there’s an entire field dedicated to algorithms which create models that extract patterns from data and apply them to…

Continue Reading