圖書標籤: 數據科學 英文版 編程 DataScience
发表于2024-11-26
Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka pdf epub mobi txt 電子書 下載 2024
This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large datasets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.
Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:
The language: ScalaThe engine: Spark (SQL, MLib, Streaming, GraphX)The container: Mesos, DockerThe view: AkkaThe storage: CassandraThe message broker: Kafka
What you’ll learn
How to make big data architecture without using complex Greek letter architectures.How to build a cheap but effective cluster infrastructure.How to make queries, reports, and graphs that business demands.How to manage and exploit unstructured and No-SQL data sources.How use tools to monitor the performance of your architecture.How to integrate all technologies and decide which replace and which reinforce.
Who This Book Is For
This book is for developers, data architects, and data scientists looking for how to integrate the most successful big data open stack architecture and how to choose the correct technology in every layer.
From the Back Cover
Integrate full-stack open-source fast data pipeline architecture and choose the correct technology―Spark, Mesos, Akka, Cassandra, and Kafka (SMACK)―in every layer. Fast data is becoming a requirement for many enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large data sets in a timely manner. In many cases organizations need more than one paradigm to perform efficient analyses.Big Data SMACK explains each technology and, more importantly, how to integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by each technology. This book covers the five main concepts of data pipeline architecture and how to integrate, replace, and reinforce every layer:The engine: Apache SparkThe container: Apache MesosThe model: Akka<The storage: Apache CassandraThe broker: Apache Kafka
Read more
About the Author
Raúl Estrada is the co-founder of Treu Technologies, an enterprise for Social Data Marketing and BigData research. Estrada is an Enterprise Architect with more than 15 years of experience in cluster management and Enterprise Software. Prior to founding Treu Technologies, Estrada worked as an Enterprise Architect in Application Servers & evangelist for Oracle Inc. Estrada loves functional languages like Elixir and Scala, and also has a Master degree on Computer Science. Isaac Ruiz is a Java programmer since 2001, and a consultant and architect since 2003. Ruiz had participated in projects of different areas and varied scopes (education, communications, retail, and others). Ruiz specializes in systems integration and has participated in projects mainly related to the financial sector. Ruiz is a supporter of free software. Ruiz like to experiment with new technologies (frameworks, languages, methods).
Read more
主要是對Spark Mesos Akka Cassandra Kafka做一個簡短的簡紹,以及這些組件如何組閤使用。 可以對大數據的架構有個簡單的認識。
評分主要是對Spark Mesos Akka Cassandra Kafka做一個簡短的簡紹,以及這些組件如何組閤使用。 可以對大數據的架構有個簡單的認識。
評分如果想瞭解些入門概念可以看看,否則嚴重不推薦,純粹就是掛瞭smack的噱頭寫的一份上手指南。
評分如果想瞭解些入門概念可以看看,否則嚴重不推薦,純粹就是掛瞭smack的噱頭寫的一份上手指南。
評分如果想瞭解些入門概念可以看看,否則嚴重不推薦,純粹就是掛瞭smack的噱頭寫的一份上手指南。
This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different ent...
評分This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different ent...
評分This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different ent...
評分This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different ent...
評分This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different ent...
Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka pdf epub mobi txt 電子書 下載 2024