It's been said that data is the new "dirt"—the raw material from which and on which you build the structures of the modern world. And like dirt, data can seem like a limitless, undifferentiated mass. The ability to take raw data, access it, filter it, process it, visualize it, understand it, and communicate it to others is possibly the most essential business problem for the coming decades.
"Machine learning," the process of automating tasks once considered the domain of highly-trained analysts and mathematicians, is the key to efficiently extracting useful information from this sea of raw data. By implementing the core algorithms of statistical data processing, data analysis, and data visualization as reusable computer code, you can scale your capacity for data analysis well beyond the capabilities of individual knowledge workers.
Machine Learning in Action is a unique book that blends the foundational theories of machine learning with the practical realities of building tools for everyday data analysis. In it, you'll use the flexible Python programming language to build programs that implement algorithms for data classification, forecasting, recommendations, and higher-level features like summarization and simplification.
As you work through the numerous examples, you'll explore key topics like classification, numeric prediction, and clustering. Along the way, you'll be introduced to important established algorithms, such as Apriori, through which you identify association patterns in large datasets and Adaboost, a meta-algorithm that can increase the efficiency of many machine learning tasks.
Peter Harrington holds Bachelors and Masters Degrees in Electrical Engineering. He worked for Intel Corporation for seven years in California and China. Peter holds five US patents and his work has been published in three academic journals. He is currently the chief scientist for Zillabyte Inc. Peter spends his free time competing in programming competitions, and building 3D printers.
1. 这本书的价值是提供了一系列有趣的「实验作业」和「对应的数据」,以及乱七八糟的 Python 代码,迫使读者在同样数据集上自己写一个更好的。 2. 作者的 Python 代码写得真的真的很渣。 3. 作者的 SVM 写错了,不是 Platt 的原始 SMO 算法,里面的 error cache 形同虚设。 ...
評分我的学习过程如下,供大家参考: 1、有些python的基础编程能力,如果没有,先花半个小时学习下; 2、数学基本统计基础,如果不懂数学原理,可以先不要去理解数学原理; 3、先上手写下代码,沉浸进入,熟悉了代码流程,再回头去看数据原理,就明白了。 5、一句话,先不求甚解,...
評分为什么我会力荐这本书? 也许书中分类器都非常的简单,数学理论都非常的粗浅(为了看明白书中SVM分类器的训练过程,不得不去复习了二次凸优化解法,自己推导被作者略去的中间过程),算法测试也只在轻量级的数据集上完成。 不过,大可不必像其他评论一样对贬低本书。聪明的读...
評分如果你是机器学习的入门者,如果你想快速看到算法的执行效果,那么这本书适合你。 作者把算法的基本原理讲的很清楚,而且代码是完整可执行的。当然,如果你想了解算法背后的数学原理,还需要花时间去复习一下概率论、高等数学和线性代数。 BTW:读者最好有编程经验,有抽象思维。
看這書可以同時入門機器學習,python,mapreduce,作者可以幾個方麵都講清楚,真不容易
评分理論條理清楚、舉重若輕。可惜程序代碼水平稍差。
评分over simplified in maths, you do need refer to other textbooks for get better idea how it works. and too much coding details, I can understand as the author was from CS background, but I think you need read more, beside this is indeed a nice start point.
评分讀瞭LR,ada boost,略讀瞭svm,psvm。數學渣子的福音,碼農最愛的實例。 雖然大傢都說寫的不好,不過入個門還是不錯。
评分看這書可以同時入門機器學習,python,mapreduce,作者可以幾個方麵都講清楚,真不容易
本站所有內容均為互聯網搜索引擎提供的公開搜索信息,本站不存儲任何數據與內容,任何內容與數據均與本站無關,如有需要請聯繫相關搜索引擎包括但不限於百度,google,bing,sogou 等
© 2025 book.quotespace.org All Rights Reserved. 小美書屋 版权所有