Hadoop in Action

2019-03-25 21:50|来源: Chuck Lam

DESCRIPTION

Big data can be difficult to handle using traditional databases.Apache Hadoop is a NoSQL applications framework that runs on distributed clusters. This lets it scale to huge datasets. If you need analytic information from your data, Hadoop’s the way to go.

Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.

This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.

WHAT’S INSIDE

  • Introduction to MapReduce

  • Examples illustrating ideas in practice

  • Hadoop’s Streaming API

  • Other related tools, like Pig and Hive

About the Author

Chuck Lam is a Senior Engineer at RockYou! He has a PhD in pattern recognition from Stanford University.

WHAT REVIEWERS ARE SAYING

“I really love this book, is made for normal people just trying to get something done. The streaming coverage is perty good, it’s the best book for python type of people I’ve seen.”


Hadoop in Action teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. Hadoop in Action will lead the reader from obtaining

相关问答

更多