Migrating from Hadoop on-premises to the cloud has been a common theme in recent Databricks blog posts and conference sessions.…
Continue Readinghadoop
How Informatica Data Engineering Goes Hadoop-less with Databricks
Back in May, we announced our partnership with Informatica to build out a rich set of integrations between our two…
Continue ReadingA sneak peak of my transformational journey with Data Science
But, Data Science acted as a Savior for me. And like a true savior, it lending me a hand to…
Continue ReadingIs Hadoop Dead?
First, there are HDFS clusters with 600+ PB of capacity. The in-memory nature of HDFS metadata means you can happily…
Continue Reading7 Reasons Why Java Developers Should Learn Hadoop
If you’re like me, you will choose the attractive girl. You see, life is full of options and making the…
Continue Reading“Architecting Modern Data Platforms” Book Review
In December OReilly published Architecting Modern Data Platforms, a 636-page guide to implementing Hadoop projects in enterprise environments. The book…
Continue ReadingHadoop Up and Running
The output from the above job will look something like the following: creating new scratch bucket mrjob-3568101f09d5f75c using s3://mrjob-3568101f09d5f75c/tmp/ as…
Continue Reading