圖書標籤: Hadoop 分布式 並行計算 數據挖掘 大數據 計算機 O'Reilly 編程
发表于2025-02-17
Hadoop pdf epub mobi txt 電子書 下載 2025
Ready to unleash the power of your massive dataset? With the latest edition of this comprehensive resource, you'll learn how to use Apache Hadoop to build and maintain reliable, scalable, distributed systems. It's ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. This third edition covers recent changes to Hadoop, including new material on the new MapReduce API, as well as version 2 of the MapReduce runtime (YARN) and its more flexible execution model. You'll also find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. * Store large datasets with the Hadoop Distributed File System (HDFS), then run distributed computations with MapReduce * Use Hadoop's data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence * Discover common pitfalls and advanced features for writing real-world MapReduce programs * Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud * Use Pig, a high-level query language for large-scale data processing * Analyze datasets with Hive, Hadoop's data warehousing system * Load data from relational databases into HDFS, using Sqoop * Take advantage of HBase, the database for structured and semi-structured data * Use ZooKeeper, the toolkit for building distributed systems
看瞭前兩部分,講的比較清楚
評分中英結閤療效好,不過還是建議中文為主,英文為輔。
評分很細緻,跟著入門一遍。對這種快速演進的技術,還是老老實實看原版吧..中文版落後太多,等新翻譯齣來黃花菜都涼瞭
評分這段時間學完瞭大數據,看這本書復習一遍還是很棒的
評分Hadoop權威指南英文版,非常給力
详见:http://www.cnblogs.com/aprilrain/archive/2013/03/07/2947664.html
評分中文版412页: 所以理论上,任何东西都可以表示成二进制形式,然后转化成为长整型的字符串或直接对数据结构进行序列化,来作为键值。 原文460页: ..., so theoretically anything can serve as row key, from strings to binary representations of long or even serialized ...
評分-- china-pub 赠书活动 -- http://www.douban.com/group/topic/20965935/ 一直比较忙,整本书还没读完,只是粗略翻了个大概,其中有两三章细读了一遍。先做个大体评价吧,有时间全部细读后再评论。 从书的内容上来讲,大致上与网上该书的内容介绍一致。简单点概括:这本书对...
評分详见:http://www.cnblogs.com/aprilrain/archive/2013/03/07/2947664.html
評分首先,翻译太差,很多句子就是瞎翻,根本不通顺,很多时候你要停下来断句,慢慢去理解。 然后,这本书是很多人去翻译的,很多人连代码都不懂,曾经一段代码看到我蒙圈,去看了一下源代码,好家伙,四行有五个错误。另外,从代码瞎缩进也可以看出这是群没写过代码的人翻的,而且...
Hadoop pdf epub mobi txt 電子書 下載 2025