Overview Data science hackathons are a great way to test, improve and build your data science skillset Hear from top…
Continue Readingengineering
A Guide to the Databricks + AWS Cloud Data Lake Dev Day Workshop
The Databricks team has been working hard to recreate content and enhance the experience as we transition all our events…
Continue ReadingFeature Engineering and Selection (Book Review)
Data preparation is the process of transforming raw data into learning algorithms. In some cases, data preparation is a required…
Continue ReadingWant to Ace Data Science Hackathons? This Feature Engineering Guide is for you
Overview Feature engineering is a key aspect in acing data science hackathons Learn how to perform feature engineering here as…
Continue Reading“Above the Trend Line” – Your Industry Rumor Central for 4/22/2020
Above the Trend Line: your industry rumor central is a recurring feature of insideBIGDATA. In this column, we present a…
Continue ReadingData Science with Azure Databricks at Clifford Chance
Guest blog by Mirko Bernardoni (Fiume Ltd) and Lulu Wan (Clifford Chance) With headquarters in London, Clifford Chance is a…
Continue ReadingAI Under the Hood: Kaskada, Inc.
In this regular insideBIGDATA feature we highlight our industry’s movers and shakers, companies that are pushing technology forward, and setting…
Continue ReadingBuilding a Mature Machine Learning Team
By Ji Chao Zhang, Director of Software Engineering at Georgian Partners. While the knowledge and skills of your hires can…
Continue Reading9 Books Every Data Engineering Aspirant Must Read!
Did you know that ‘Data Engineer’ is the fastest-growing role in the industry? Currently, most data science aspirants are still…
Continue Reading7 Resources to Becoming a Data Engineer
Date Engineering is one of the fastest growing and in-demand occupations among Data Science practitioners. The ability to collect, store,…
Continue Reading6 Powerful Feature Engineering Techniques For Time Series Data (using Python)
Overview Feature engineering is a skill every data scientist should know how to perform, especially in the case of time…
Continue ReadingEmre Rencberoglu
Fundamental Techniques of Feature Engineering for Machine LearningAll required methods for…How to Calibrate Undersampled Model ScoresImbalanced data problems in binary prediction models and a…..…
Continue ReadingMachine Learning Pipelines: Feature Engineering Numbers
The answer: log scaling. Log Transformation#Plotting a histogram – log scalesns. set_style('whitegrid') #Picking a background colorfig, ax =plt. subplots()books_file['ratings_count']. hist(ax=ax,…
Continue Reading3 Design Principles for Engineering Data
3 Design Principles for Engineering DataJiaqi LiuBlockedUnblockFollowFollowingJul 8This outlines three design paradigms — Immutable Data, Data Lineage and Data Validation— and the…
Continue ReadingData Engineering — the Cousin of Data Science, is Troublesome
I guess I’ve found the answer. Lissie MeiBlockedUnblockFollowFollowingMay 7We always deem data science as the “sexiest job of the 21st century”.…
Continue ReadingWho Is a Data Engineer & How to Become a Data Engineer?
If you want to be such a hero, it’s never too late to start learning. In this post, I have…
Continue ReadingFeature engineering
Feature engineeringDiogo RibeiroBlockedUnblockFollowFollowingApr 29Feature engineering is the process of transforming raw, unprocessed data into a set of targeted features that…
Continue ReadingWhy you should do Feature Engineering first, Hyperparameter Tuning second as a Data Scientist
Why you should do Feature Engineering first, Hyperparameter Tuning second as a Data ScientistAdmond LeeBlockedUnblockFollowFollowingApr 21In fact, the realization that…
Continue ReadingInsight’s Data Science & Data Engineering programs expand to Los Angeles
Insight’s Data Science & Data Engineering programs expand to Los AngelesGeneviève SmithBlockedUnblockFollowFollowingApr 10Photo by Pedro Marroquin on UnsplashWe are excited to announce…
Continue ReadingFeature Engineering in SQL and Python: A Hybrid Approach
Feature Engineering in SQL and Python: A Hybrid ApproachSet up your workstation, reduce workplace clutter, maintain a clean namespace, and effortlessly…
Continue ReadingWe Found Structure in a Structureless Place: A Smarter Approach for Data Science Solutions
Identifying Data Sources is circled too. Yes, you may have noticed “Identifying Data Sources” is coupled with “Decomposing the Ask”…
Continue ReadingData Engineering Light: Expanding Choice for Production Workloads
Adding a Lower Tier, Lower Price Option Since its inception a guiding principle for Databricks has been the unification of…
Continue ReadingA Programmer’s Guide to Creating an Eclectic Bookshelf
– Data Driven InvestorAbout a month back, while I was sitting at a café and working on developing a website…
Continue ReadingMost impactful AI trends of 2018: The rise of ML Engineering
By Emmanuel Ameisen, Head of AI at Insight Data ScienceThe field of Machine Learning (ML) has been consistently evolving since…
Continue ReadingBuild a Pipeline for Harvesting Medium Top Author Data
Build a Pipeline for Harvesting Medium Top Author DataHow to Use Luigi and Docker to Build a Simple Data Engineering Pipeline…
Continue Reading