Introduction
Rapid advances in data collection and storage technology have enabled or
ganizations to accumulate vast amounts of data. However, extracting useful
information has proven extremely challenging. Often, traditional data analy
sis tools and techniques cannot be used because of the massive size of a data
set. Sometimes, the non-traditional nature of the data means that traditional
approaches cannot be applied even if the data set is relatively small. In other
situations, the questions that need to be answered cannot be addressed using
existing data analysis techniques, and thus, new methods need to be devel
oped.
Data mining is a technology that blends traditional data analysis methods
with sophisticated algorithms for processing large volumes of data. It has also
opened up exciting opportunities for exploring and analyzing new types of
data and for analyzing old types of data in new ways. In this introductory
chapter, we present an overview of data mining and outline the key topics
to be covered in this book. We start with a description of some well-known
applications that require new techniques for data analysis.
Business Point-of-sale data collection (bar code scanners, radio frequency
identification (RFID), and smart card technology) have allowed retailers to
collect up-to-the-minute data about customer purchases at the checkout coun
ters of their stores. Retailers can utilize this information, along with other
business-critical data such as Web logs from e-commerce Web sites and cus
tomer service records from call centers, to help them better understand the
needs of their customers and make more informed business decisions.
Data mining techniques can be used to support a wide range of business
intelligence applications such as customer profiling, targeted marketing, work
flow management, store layout, and fraud detection. It can also help retailers
Pang-Ning Tan现为密歇根州立大学计算机与工程系助理教授,主要教授数据挖掘、数据库系统等课程。此前,他曾是明尼苏达大学美国陆军高性能计算研究中心副研究员(2002-2003)。
Michael Steinbach 明尼苏达大学计算机与工程系研究员,在读博士。
Vipin Kumar明尼苏达大学计算机科学与工程系主任,曾任美国陆军高性能计算研究中心主任。他拥有马里兰大学博士学位,是数据挖掘和高性能计算方面的国际权威,IEEE会士。
我的习惯就是在蹲坑的时候读一些艰涩高深的科学读物,这样有助于我在排泄的时候大脑保持高度的兴奋状态,不至于被熏晕或者不至于被引人入胜的小说情节所陶醉最后导致肛瘘…… 但是,这本书另我惊诧了…… 第一他不艰涩,是我读到过的关于统计、关于数据、关于计算的最科普的读...
评分统计学经典入门书籍,对数据处理、分类、相关分析、聚类等方面做了事无巨细的讲解,兼顾通俗性和理论推导,浏览一遍目录就会发现,这不就是机器学习嘛! 看这书名一开始以为这只是一本讲数据抓取、数据分析的书籍,这比市面上一些夸夸其谈机器学习、人工智能的书要低调很多,而...
评分我是拿这本书当作课程书的,这本书基本上涵盖了数据挖掘的许多经典算法,分类,聚类,关联规则。比较适合对数据挖掘感兴趣的人,这本书看完之后基本上就可以进行对数据的分析,挖掘了。然而这仅仅是一门入门书,对于理论部分并没有做过多的解释。如果想进一步的了解理论知识,...
评分屎一样狗屁不通的翻译。 原文: As a result, Z is as likely to be chosen for splitting as the interacting but useful attributes, X and Y. 译文:因此,Z 可能被选作划分有相互作用但有效的属性 X 和 Y。 还有其他很多地方就不一一列举了,本来作为入门读物,很多东西就...
评分统计学经典入门书籍,对数据处理、分类、相关分析、聚类等方面做了事无巨细的讲解,兼顾通俗性和理论推导,浏览一遍目录就会发现,这不就是机器学习嘛! 看这书名一开始以为这只是一本讲数据抓取、数据分析的书籍,这比市面上一些夸夸其谈机器学习、人工智能的书要低调很多,而...
挺容易的
评分挺容易的
评分挺容易的
评分挺容易的
评分挺容易的
本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度,google,bing,sogou 等
© 2025 book.quotespace.org All Rights Reserved. 小美书屋 版权所有