- Paperback: 316 pages
- Publisher: Packt Publishing Limited (7 February 2013)
- Language: English
- ISBN-10: 1849519129
- ISBN-13: 978-1849519120
- Product Dimensions: 19 x 1.8 x 23.5 cm
- Average Customer Review: Be the first to review this item
- Amazon Bestsellers Rank: #3,41,729 in Books (See Top 100 in Books)
Hadoop Real-World Solutions Cookbook Paperback – Import, 7 Feb 2013
Customers who viewed this item also viewed
Customers who bought this item also bought
Description for Hadoop Real-World Solutions Cookbook
About the Author
Jonathan R. Owens has a background in Java and C++, and has worked in both the private and public sectors as a software engineer. Most recently, he has been working with Hadoop, and related distributing processing technologies. Currently, Jonathan R. Owens works for comScore, Inc, a widely regarded digital measurement and analytics company. At comScore, he is a member of the core-processing team, which uses Hadoop and other custom distributed systems to aggregate, analyze, and manage over 40+ billion transactions per day. Jon Lentz is a software engineer on the Core Processing team at comScore, an online audience measurement and analytics company. He prefers to do most of his coding in Pig. Before working at comScore he wrote software to optimize supply chains and to allocate fixed income securities. Brian Femiano has a B.S in Computer Science and has been programming professionally for over 6 years, the last 2 of which have been spent building advanced analytics and big data capabilities using Apache Hadoop. He has worked for the commercial sector in the past, but the majority of his experience comes from the government contracting space. He currently works for Potomac Fusion in the DC/Virginia area, where they develop scalable algorithms to study and enhance some of the most advanced and complex datasets in the government space. Within Potomac Fusion, he has taught courses and training sessions to help teach Apache Hadoop and related cloud-scale technology.
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter mobile phone number.
|5 star (0%)|
|4 star (0%)|
|3 star (0%)|
|2 star (0%)|
|1 star (0%)|
Most helpful customer reviews on Amazon.com
First example in chapter one uses a text file from chapter 4 without mentioning where to get the file (I had to search for find it).
8th page of the book mentions some random script in Pig without giving any background on what the Pig is or how/where to execute the script.
This book has really helpful examples over the configuration, map-reducing, scripting, a little opening of machine learning and other relative operations of hadoop(hdfs, pig, hive) with some samples of source codes in the form of Q and A.
This book looks like a good group of googled results, but the number of solutions is not many, a few of solutions, that is the lacking point of this book.
Anyway, the solutions this book handles themselves are well-written.
You not only given lessons solely about Hadoop, but you also get introduced to other packages such as Giraph, Mahout and Accumulo and how to integrate these with Hadoop.
This is definitely a book that will be within easy reach and used heavily for reference for quite some time. If you're working with Hadoop for anything and you want quick answers on how to do things, give this book a close look.
Any one who has little understanding of Hadoop and eco system tools will enjoy the examples , many of these are of real time scenarios (basic to little complex) as the book title suggests.
I still feel some more concrete examples on machine learning and analytics should have made this book more valuable..