Here is a Visualization of Skills Needed to Become a Data Scientist: Where are Data Scientists in the Big Data…
Continue Readingdata
Data Science and Law | What One Lawyer Learned From a 50-Hour Data Science Bootcamp
Based on my criteria, Data Science Dojo’s data science bootcamp fit the bill: it’s a reasonably priced 5-day, 50-hour onsite program that didn’t…
Continue ReadingIntroduction to Blockchain and What it Means to Big Data
Using the Blockchain development technology for storing Big Data can be cost saving for companies..Blockchain has the capacity for storing…
Continue ReadingBig Data Ethics and 10 Controversial Data Science Experiments
Data and Big Data Ethics Data science is changing the game when it comes to manipulating data sets and visualizing…
Continue ReadingTen Myths About Data Science
Myth #5: Data Science Requires a Deep Understanding of Statistics and Statistical Methods While it’s true that Data Science requires…
Continue ReadingData Privacy and Anonymization Techniques
Simple Techniques to Anonymize Data A simple approach to maintaining personal data privacy when using data for predictive modeling or…
Continue ReadingDoes Data Democratization Result in Data Anarchy and Bad Business Decisions?
An Augmented Data Discovery Solution is not an Argument for Data Anarchy Rather, with the tools provided, an organization can…
Continue ReadingData Scientists, Stand out by Sharing Your Notebooks
To test out importing the notebook from Gist, we are going to create a free account with Watson Studio..This will…
Continue Reading1.1 Billion Taxi Rides with MapD & AWS EC2
$ vi create_trips_table.sql CREATE TABLE trips ( trip_id INTEGER, vendor_id VARCHAR(3) ENCODING DICT, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag VARCHAR(1) ENCODING…
Continue ReadingData Science Professional Certificate
The courses in the Data Science Professional Certificate include: What is Data Science Tools for Data Science Data Science Methodology…
Continue ReadingA Billion Taxi Rides in Elasticsearch
$ sudo /etc/init.d/elasticsearch restart Importing a Billion Trips into Elasticsearch The machine used in this blog post has two physical…
Continue ReadingThis Week in Data Science (May 30, 2017)
Interesting Data Science Articles and News Managing Spark data handles in R – How to work with data handles and…
Continue ReadingHadoop 3 Single-Node Install Guide
$i fi done unset i fi export HADOOP_HOME=/opt/hadoop export PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:$HADOOP_HOME/bin:$HADOOP_HOME/sbin:/opt/hive/bin:/opt/spark/bin:/opt/presto/bin export HADOOP_CONF_DIR=/opt/hadoop/etc/hadoop export HDFS_NAMENODE_USER=root export HDFS_DATANODE_USER=root export HDFS_SECONDARYNAMENODE_USER=root export SPARK_HOME=/opt/spark…
Continue ReadingA Review of “Designing Data-Intensive Applications”
Chapter 3 discusses storage and retrieval of data and I think this is probably one of the best chapters for…
Continue Reading1.1 Billion Taxi Rides on Amazon Athena
CREATE EXTERNAL TABLE trips_parquet ( trip_id INT, vendor_id STRING, pickup_datetime TIMESTAMP, dropoff_datetime TIMESTAMP, store_and_fwd_flag STRING, rate_code_id SMALLINT, pickup_longitude DOUBLE, pickup_latitude…
Continue Reading1.1 Billion Taxi Rides on kdb+/q & 4 Xeon Phi CPUs
td:{` sv`:trips,(`$string x),`trips`} i:-1; wr:{td[(`$n),(i+:1),y]set update `p#passenger_count from delete year from select from x where year=y} rd:.Q.fc[{flip c!(t;",")0:x}] sr:{`year`passenger_count xasc…
Continue ReadingData Science Survey: The Results Are In!
Data Science Survey Q4: Which Data Science tool are you most interested in?.Finally, we asked about the primary tool or…
Continue ReadingArticle | “How Predictive Analytics is Transforming Risk Management” | Strategic Risk
But how can corporates and risk managers make better use of data in risk management?.Huge volumes of data make it…
Continue ReadingArticle | “Predictive Analytics: How Big Data Will Improve Outcomes and Efficiencies in Diagnosing and Treating Patients”
When this level of personal data is layered on top of healthcare claims, electronic medical records, or other data representative…
Continue ReadingData Visualization: Blending Art and Science to Tell Data Stories
In this special guest feature, Caitlin Willich, Associate Partner, Technology, Media and Entertainment at Clarity Insights, discusses how data visualization…
Continue ReadingDetecting Anomalies in Time Series Data: Deciphering the Noise and Zoning in on the Signals
Anomaly detection for time series data with deep learning – identifying the “unknown unknowns” One of the most effective ways…
Continue ReadingOnline Education is Paving a Smoother Path to Earning Data Science Skills
Turning this avalanche of data into meaningful business insights creates challenges that require data science skills..The need for professionals skilled…
Continue ReadingHow Can Data Science Improve UX Design?
It’s interesting to note that of all things, data science can help guide designers in a more creative direction, tailoring…
Continue ReadingHow Big Data Is Helping Drivers Stay Safer on the Road
Autonomous Vehicles Big data is also being used to make autonomous cars safer, reducing the already small amount of crashes…
Continue ReadingThe Data Scientist Shortage is Huge. Here’s How to Beat It.
What they need is (1) a strategic roadmap toward building data science skills and (2) an effective hiring and resourcing…
Continue Reading