图书标签: Spark 大数据 分布式 spark O'Reilly 编程 计算机科学 数据平台
发表于2024-07-12
Spark pdf epub mobi txt 电子书 下载 2024
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.
You’ll explore the basic operations and common functions of Spark’s structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of monitoring, tuning, and debugging Spark, and explore machine learning techniques and scenarios for employing MLlib, Spark’s scalable machine-learning library.
目测是豆瓣上第一个 "读过" 这本书的人吧...
评分是因为我的pdf版本问题吗。很多讲的很表面,如其他短评所说,api 使用手册
评分少有的Spark2.x入门书,但实在太浅,通篇api介绍。。
评分看了除流和机器学习以外的章节,整体很推荐,而且是基于Spark2.2的。既细致的讲述了Scala和Python版的api,又有部署和调优相关的细节。英语不太好,经常Google和有道换着来。Spark的好书很少,这是其中一本。
评分虽然是 Matei Zaharia 写的,但是这书明显过誉啊
评分
评分
评分
评分
Spark pdf epub mobi txt 电子书 下载 2024