OverviewLearn about Information Retrieval (IR), Vector Space Models (VSM), and Mean Average Precision (MAP)Create a project on Information Retrieval using…
Continue Readinginformation
Hands-on NLP Project: A Comprehensive Guide to Information Extraction using Python
Overview Information extraction is a powerful NLP concept that will enable you to parse through any piece of text Learn…
Continue ReadingGame (Theory) for AI? An Illustrated Guide for Everyone
For Nash Equilibrium, we can conclude that it is a “No Regret” solution for any game, but not necessarily the…
Continue ReadingComputed IDs and privacy implications
Thirty years ago, a lot of US states thought it would be a good idea to compute someone’s drivers license…
Continue ReadingChinese character frequency and entropy
Yesterday I wrote a post looking at the frequency of Koine Greek letters and the corresponding entropy. David Littleboy asked…
Continue ReadingInformation Gain and Mutual Information for Machine Learning
Information gain calculates the reduction in entropy or surprise from transforming a dataset in some way. It is commonly used…
Continue ReadingA Gentle Introduction to Information Entropy
Because they are two sides of the same coin. […] Information theory and machine learning still belong together. Brains are…
Continue ReadingAmendment to CCPA regarding personal information
California’s new privacy law takes effect January 1, 2020, less than 100 days from now. The bill was written in…
Continue ReadingHow Search Engines like Google Retrieve Results: Introduction to Information Extraction using Python and spaCy
Pattern: X, including Y doc = nlp(“Eight people, including two children, were injured in the explosion”) for tok in doc:…
Continue ReadingHow fast were dead languages spoken?
A new paper in Science suggests that all human languages carry about the same amount of information per unit time.…
Continue ReadingWhat does CCPA say about de-identified data?
Section 1798. 140 says thatFor purposes of this title: (a) “Aggregate consumer information” means information that relates to a group…
Continue ReadingUnderstanding Differential Privacy
The idea behind PATE is applying a DP aggregation (Report Noisy Max algorithm) on outputs of sensitive models, called “teachers”…
Continue ReadingCross-entropy: From an Information theory point of view
Cross-entropy: From an Information theory point of viewashutosh nayakBlockedUnblockFollowFollowingJun 21Cross-entropy is a widely used loss function in machine learning for classification…
Continue ReadingFootball Wages- 15 year Analysis with Python’s Beautiful Soup
Football Wages- 15 year Analysis with Python’s Beautiful SoupA Multiple Page Web Scraping Tutorial using Beautiful SoupStephen FordhamBlockedUnblockFollowFollowingJun 13IntroductionWeb scraping…
Continue ReadingThe Too-much-Information Paradox
The Too-much-Information ParadoxChema Alvarez EchevarriaBlockedUnblockFollowFollowingJun 8To understand what is going on today in technology, we should have a look back…
Continue ReadingHow AI Technologies Can Put Purpose and Profit into ESG Investments
Many institutional investors today are seeking more than simple profit. They are looking to make a social and environmental difference,…
Continue ReadingA Data Science Approach to Choosing a Physician
In our case, we have hundreds of specialties and hundreds of typos, so…we’ll come back and extract these features if…
Continue ReadingData Demystified — DIKW model
In short, a person who is better at statistics than any software engineer and better at software engineering than any…
Continue ReadingKeys to Successful AI Solutions
Faisal Akkawi, faculty director of Northwestern University’s Master of Science in Information Systems program, highlights the school’s recently announced AI…
Continue ReadingA Very Simple Data Integration Project: Rebrickable LEGO Datasets
A Very Simple Data Integration Project: Rebrickable LEGO DatasetsEmanuele SicurellaBlockedUnblockFollowFollowingMay 20I perfectly remember the first time I got an injection. I…
Continue ReadingThe most influential factor of IMDB movie rating — Part I: Data Scraping
The most influential factor of IMDB movie rating — Part I: Data ScrapingYuri DaiBlockedUnblockFollowFollowingMay 15I have always been an enthusiastic fan of movie,…
Continue ReadingWeb Scraping For Financial Analyst Beautifulsoup, Scrapy, Selenium & Twitter API
Web Scraping For Financial Analyst Beautifulsoup, Scrapy, Selenium & Twitter APISenthil EBlockedUnblockFollowFollowingMay 1IntroductionI was learning about web scraping recently and thought…
Continue ReadingLet’s understand what an API is
Let’s understand what an API isMikelBlockedUnblockFollowFollowingApr 21Photo by Nina Ž. on UnsplashWhat is REST?REST, REpresentational State Transfer, is a type of web development…
Continue ReadingWhen Data is Scarce… Ways to Extract Valuable Insights
Photo credit: PixabayWhen Data is Scarce… Ways to Extract Valuable InsightsDescriptive statistics, Exploratory Data Analysis, and Natural Language Processing (NLP) techniques to…
Continue ReadingTips for Keeping Data HIPAA Compliant
googletag. cmd. push(function() { googletag. display(div-gpt-ad-1439400881943-0); }); HIPAA stands for Health Insurance Portability and Accountability Act, and it was introduced…
Continue Reading