Apache Spark Machine Learning Blueprints
(2016)
By: Liu, Alex

Nonfiction

eBook

Provider: hoopla

Details

PUBLISHED
[United States] : Packt Publishing, 2016
Made available through hoopla
DESCRIPTION

1 online resource (252 pages)

ISBN/ISSN
9781785887789 MWT17596805, 1785887785 17596805
LANGUAGE
English
NOTES

Develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide If you are a data scientist, a data analyst, or an R and SPSS user with a good understanding of machine learning concepts, algorithms, and techniques, then this is the book for you. Some basic understanding of Spark and its core elements and application is required. There's a reason why Apache Spark has become one of the most popular tools in Machine Learning - its ability to handle huge datasets at an impressive speed means you can be much more responsive to the data at your disposal. This book shows you Spark at its very best, demonstrating how to connect it with R and unlock maximum value not only from the tool but also from your data. Packed with a range of project "blueprints" that demonstrate some of the most interesting challenges that Spark can help you tackle, you'll find out how to use Spark notebooks and access, clean, and join different datasets before putting your knowledge into practice with some real-world projects, in which you will see how Spark Machine Learning can help you with everything from fraud detection to analyzing customer attrition. You'll also find out how to build a recommendation engine using Spark's parallel computing powers. This book offers a step-by-step approach to setting up Apache Spark, and use other analytical tools with it to process Big Data and build machine learning projects.The initial chapters focus more on the theory aspect of machine learning with Spark, while each of the later chapters focuses on building standalone projects using Spark

Mode of access: World Wide Web

Additional Credits