Hadoop

Hadoop pdf epub mobi txt 电子书 下载 2025

出版者:O'Reilly Media
作者:Tom White
出品人:
页数:688
译者:
出版时间:2012-5-26
价格:USD 49.99
装帧:Paperback
isbn号码:9781449311520
丛书系列:
图书标签:
  • Hadoop
  • 分布式
  • 并行计算
  • 数据挖掘
  • 大数据
  • 计算机
  • O'Reilly
  • 编程
  • 大数据
  • Hadoop
  • 分布式存储
  • 分布式计算
  • MapReduce
  • YARN
  • 数据分析
  • 数据挖掘
  • 云计算
  • 开源技术
想要找书就要到 小美书屋
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

具体描述

Ready to unleash the power of your massive dataset? With the latest edition of this comprehensive resource, you'll learn how to use Apache Hadoop to build and maintain reliable, scalable, distributed systems. It's ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. This third edition covers recent changes to Hadoop, including new material on the new MapReduce API, as well as version 2 of the MapReduce runtime (YARN) and its more flexible execution model. You'll also find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. * Store large datasets with the Hadoop Distributed File System (HDFS), then run distributed computations with MapReduce * Use Hadoop's data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence * Discover common pitfalls and advanced features for writing real-world MapReduce programs * Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud * Use Pig, a high-level query language for large-scale data processing * Analyze datasets with Hive, Hadoop's data warehousing system * Load data from relational databases into HDFS, using Sqoop * Take advantage of HBase, the database for structured and semi-structured data * Use ZooKeeper, the toolkit for building distributed systems

作者简介

目录信息

读后感

评分

中文版412页: 所以理论上,任何东西都可以表示成二进制形式,然后转化成为长整型的字符串或直接对数据结构进行序列化,来作为键值。 原文460页: ..., so theoretically anything can serve as row key, from strings to binary representations of long or even serialized ...  

评分

看了几章中文版的,各种错误,太低级,实在是看不下去了。 建议还是看原版吧。 译者们的脸皮可真厚,英文译不明白也就罢了,中文都组织的不通顺,好意思吗!! 什么叫 “但是,......,但是”啊,“但是体”啊。  

评分

买了第一版,时间太紧,没来得及看,后来出了个号称修订升级的第二版,毫不犹豫又买了,后来听说第二版比第一版翻译得好,心中窃喜,再后来看了第二版,我震惊了,我TM就是一傻子,放着好好的英文版不看,赶什么时髦买中文版呢。在这个神奇的国度,牛奶里放的是三聚氰胺,火腿...  

评分

评分

用户评价

评分

这书到后面已经神游了,没这环境先不玩

评分

可以当做概览

评分

The system of Big Data, all focuse on the Scality, Fault torlerance, Scheduler, Shuffle.

评分

过了一遍,只知道个大概结构。细节还不是很懂

评分

简单过了一遍

本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度google,bing,sogou

© 2025 book.quotespace.org All Rights Reserved. 小美书屋 版权所有