0
Sponsored Links


Ad by Google
To learn Big data technologies one can join institute/training centre, or can watch videos on YouTube, learn from blogs,tutorials bla.. bla..
But I would say books are always a very good material to learn anything beat Java, Hadoop, Cooking etc.
In terms of technology, nowadays big data technologies are on demand and most search able items on the internet,here I have listed top 10 resources to learn big data technology. In this post, I am going to list top 10 or you can say very popular books to learn Big data. If you want learn a hello world program in Hadoop MapReduce framework, than you may follow this step by step guide to learn Hello world in Hadoop.

Top 10 Big Data Books


1. The Data Lake Development with Big Data: Authors Pradeep Pasupuleti and Beulah Salome Purra.
This book mainly explore architectural approaches to building Data Lakes that ingest, index, manage, and analyse massive amounts of data using Big Data technologies. It will focus on architect data governance, security, data quality, data lineage tracking, meta-data management, and semantic data tagging.

2. Hadoop: The Definitive Guide, 4th Edition: Author Tom White
One of my favourite book, I was started with The Definitive Guide published in 2009, I would say Big data developer should must go with this book to start.
This book is ideal for programmers looking to analyse datasets of any size, and for administrators who want to set up and run Hadoop.
This book has step by step explanation of single node cluster set-up to distributed cluster set-up.

3. Hadoop In Action: Author Chuck Lam
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.

4. Hadoop Real-World Solutions Cookbook- Second Edition: Author Tanmay Deshpande
The book not only clarifies most big data tools in the market but also provides best practices for using them. The book provides recipes that are based on the latest versions of Apache Hadoop 2.X, YARN, Hive, Pig, Sqoop, Flume, Apache Spark, Mahout and many more such ecosystem tools.

5. Hadoop 2 Quick-Start Guide: Author Douglas Eadline
One of the quick start guide, It focus on the concept and components of Hadoop 2 and covering the Hadoop eco systems
6. Hadoop for Dummies: Author Dirk deRoos
Book explain the concept of big data, what is big data and the value of big data.
7. Hadoop In Practice: Author Alex Holmes
Second edition of book updated with Hadoop 2 and explain the way to create YARN application, integrate real-time technologies like Spark and Storm as well as predictive analytic using Mahout.
8. Big Data: A Revolution That Will Transform How We Live, Work, and Think: Authors Viktor Mayer-Schönberger and Kenneth Cukier

It explain what is big data, how it will change our lives, and what we can do to protect ourselves from its hazards.
9. Hadoop Operations 1st Edition: Author Eric Sammer
Books defined big data, designing and configuring pseudo distributed cluster system as well as setting up fully distributed clustered environment. Managing DFS and MapReduce cluster.
10. Professional Hadoop Solutions: Author Boris Lublinsky
The go-to guidebook for deploying Big Data solutions with Hadoop Today's enterprise architects need to understand how the Hadoop frameworks and APIs fit together, and how they can be integrated to deliver real-world solutions.






Here is a quick start guide to Hadoop Single node cluster setup and Hello World in MapReduce.
Sponsored Links

0 comments:

Post a Comment