图书标签: 分布式 大数据 系统设计 计算机 architecture 架构 data O'Reilly
发表于2024-10-03
Designing Data-Intensive Applications pdf epub mobi txt 电子书 下载 2024
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?
In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
Peer under the hood of the systems you already use, and learn how to use and operate them more effectively
Make informed decisions by identifying the strengths and weaknesses of different tools
Navigate the trade-offs around consistency, scalability, fault tolerance, and complexity
Understand the distributed systems research upon which modern databases are built
Peek behind the scenes of major online services, and learn from their architectures
Martin is a researcher in distributed systems at the University of Cambridge. Previously he was a software engineer and entrepreneur at Internet companies including LinkedIn and Rapportive, where he worked on large-scale data infrastructure. In the process he learned a few things the hard way, and he hopes this book will save you from repeating the same mistakes.
Martin is a regular conference speaker, blogger, and open source contributor. He believes that profound technical ideas should be accessible to everyone, and that deeper understanding will help us develop better software.
值得再读一遍。分布式数据系统 真•big picture
评分只看了免费的前四章,感觉都是常识啊
评分主要看了前两部分,我觉得是最好的数据库/分布式存储的入门扫盲书,每章后面引用的paper可以让你更深入。
评分强烈推荐。行文流畅,深入浅出,分布式系统的各种工程实践梳理得很清晰。
评分全面、深入的大数据系统介绍,不但有how,还有why,建立技术全景。
1.事务及隔离级别 1.1.Read Committed 定义 一个事务只能看到其它事务已经提交的修改,不能看到其它事务进行中产生的修改。 实现方法 对任一事务修改的数据,在事务提交前均同时记录新值和旧值。其它事务读到此数据时,使用旧值;本事务读取时,使用新值。 一致性缺陷:不可重复...
评分以前看过一些分布式系统相关的资料,比如《分布系统原理---刘杰》,还有一些关于HDFS、MapReduce、Google 三大经典论文之类的,也去官方看过一些开源项目(Hadoop、Zookeeper)的官方文档,总觉得分布式是很高深的理论知识。看到这本书后,发现它真的是深入浅出,对分布式原理的...
评分 评分 评分每一个数据工程师和产品经理都应该把最后一节,Doing the right thing,读十遍。 为了说服大家为什么值得读十遍,我摘抄几句: automated systems can systematically and arbitrarily exclude a person from participating in society without any proof of guilt, and with l...
Designing Data-Intensive Applications pdf epub mobi txt 电子书 下载 2024