Ad by Google
But I would say books are always a very good material to learn anything beat Java, Hadoop, Cooking etc.
In terms of technology, nowadays big data technologies are on demand and most search able items on the internet,here I have listed top 10 resources to learn big data technology. In this post, I am going to list top 10 or you can say very popular books to learn Big data. If you want learn a hello world program in Hadoop MapReduce framework, than you may follow this step by step guide to learn Hello world in Hadoop.
Top 10 Big Data Books
1. The Data Lake Development with Big Data: Authors Pradeep Pasupuleti and Beulah Salome Purra.
This book mainly explore architectural approaches to building Data Lakes that ingest, index, manage, and analyse massive amounts of data using Big Data technologies. It will focus on architect data governance, security, data quality, data lineage tracking, meta-data management, and semantic data tagging.
One of my favourite book, I was started with The Definitive Guide published in 2009, I would say Big data developer should must go with this book to start.
This book is ideal for programmers looking to analyse datasets of any size, and for administrators who want to set up and run Hadoop.
This book has step by step explanation of single node cluster set-up to distributed cluster set-up.
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.
The book not only clarifies most big data tools in the market but also provides best practices for using them. The book provides recipes that are based on the latest versions of Apache Hadoop 2.X, YARN, Hive, Pig, Sqoop, Flume, Apache Spark, Mahout and many more such ecosystem tools.
One of the quick start guide, It focus on the concept and components of Hadoop 2 and covering the Hadoop eco systems
Book explain the concept of big data, what is big data and the value of big data.
Second edition of book updated with Hadoop 2 and explain the way to create YARN application, integrate real-time technologies like Spark and Storm as well as predictive analytic using Mahout.
It explain what is big data, how it will change our lives, and what we can do to protect ourselves from its hazards.
Books defined big data, designing and configuring pseudo distributed cluster system as well as setting up fully distributed clustered environment. Managing DFS and MapReduce cluster.
The go-to guidebook for deploying Big Data solutions with Hadoop Today's enterprise architects need to understand how the Hadoop frameworks and APIs fit together, and how they can be integrated to deliver real-world solutions.
Here is a quick start guide to Hadoop Single node cluster setup and Hello World in MapReduce.