Python for Data Analysis

Python for Data Analysis pdf epub mobi txt 電子書 下載2025

出版者:O'Reilly Media
作者:Wesly McKinney
出品人:
頁數:450
译者:
出版時間:2013-6-16
價格:0
裝幀:Paperback
isbn號碼:9781549329784
叢書系列:
圖書標籤:
  • Python
  • 數據分析
  • python大數據分析
  • Python基礎教程
  • 計算機
  • 編程
  • python
  • python培訓
  • Python
  • 數據分析
  • 數據科學
  • 編程
  • 機器學習
  • 可視化
  • 數據清洗
  • 統計分析
  • 大數據
  • 開源
想要找書就要到 小美書屋
立刻按 ctrl+D收藏本頁
你會得到大驚喜!!

具體描述

這本書主要是用 pandas 連接 SciPy 和 NumPy,用pandas做數據處理是Pycon2012上一個很熱門的話題。另一個功能強大的東西是Sage,它將很多開源的軟件集成到統一的 Python 接口。

Python for Data Analysis is concerned with the nuts and bolts of manipulating, processing, cleaning, and crunching data in Python. It is also a practical, modern introduction to scientific computing in Python, tailored for data-intensive applications. This is a book about the parts of the Python language and libraries you’ll need to effectively solve a broad set of data analysis problems. This book is not an exposition on analytical methods using Python as the implementation language.

Written by Wes McKinney, the main author of the pandas library, this hands-on book is packed with practical cases studies. It’s ideal for analysts new to Python and for Python programmers new to scientific computing.

Use the IPython interactive shell as your primary development environment

Learn basic and advanced NumPy (Numerical Python) features

Get started with data analysis tools in the pandas library

Use high-performance tools to load, clean, transform, merge, and reshape data

Create scatter plots and static or interactive visualizations with matplotlib

Apply the pandas groupby facility to slice, dice, and summarize datasets

Measure data by points in time, whether it’s specific instances, fixed periods, or intervals

Learn how to solve problems in web analytics, social sciences, finance, and economics, through detailed examples

著者簡介

Wes McKinney 資深數據分析專傢,對各種Python庫(包括NumPy、pandas、matplotlib以及IPython等)等都有深入研究,並在大量的實踐中積纍瞭豐富的經驗。撰寫瞭大量與Python數據分析相關的經典文章,被各大技術社區爭相轉載,是Python和開源技術社區公認的權威人物之一。開發瞭用於數據分析的著名開源Python庫——pandas,廣獲用戶好評。在創建Lambda Foundry(一傢緻力於企業數據分析的公司)之前,他曾是AQR Capital Management的定量分析師。

圖書目錄

Chapter 1 Preliminaries
What Is This Book About?
Why Python for Data Analysis?
Essential Python Libraries
Installation and Setup
Community and Conferences
Navigating This Book
Acknowledgements
Chapter 2 Introductory Examples
1.usa.gov data from bit.ly
MovieLens 1M Data Set
US Baby Names 1880-2010
Conclusions and The Path Ahead
Chapter 3 IPython: An Interactive Computing and Development Environment
IPython Basics
Using the Command History
Interacting with the Operating System
Software Development Tools
IPython HTML Notebook
Tips for Productive Code Development Using IPython
Advanced IPython Features
Credits
Chapter 4 NumPy Basics: Arrays and Vectorized Computation
The NumPy ndarray: A Multidimensional Array Object
Universal Functions: Fast Element-wise Array Functions
Data Processing Using Arrays
File Input and Output with Arrays
Linear Algebra
Random Number Generation
Example: Random Walks
Chapter 5 Getting Started with pandas
Introduction to pandas Data Structures
Essential Functionality
Summarizing and Computing Descriptive Statistics
Handling Missing Data
Hierarchical Indexing
Other pandas Topics
Chapter 6 Data Loading, Storage, and File Formats
Reading and Writing Data in Text Format
Binary Data Formats
Interacting with HTML and Web APIs
Interacting with Databases
Chapter 7 Data Wrangling: Clean, Transform, Merge, Reshape
Combining and Merging Data Sets
Reshaping and Pivoting
Data Transformation
String Manipulation
Example: USDA Food Database
Chapter 8 Plotting and Visualization
A Brief matplotlib API Primer
Plotting Functions in pandas
Plotting Maps: Visualizing Haiti Earthquake Crisis Data
Python Visualization Tool Ecosystem
Chapter 9 Data Aggregation and Group Operations
GroupBy Mechanics
Data Aggregation
Group-wise Operations and Transformations
Pivot Tables and Cross-Tabulation
Example: 2012 Federal Election Commission Database
Chapter 10 Time Series
Date and Time Data Types and Tools
Time Series Basics
Date Ranges, Frequencies, and Shifting
Time Zone Handling
Periods and Period Arithmetic
Resampling and Frequency Conversion
Time Series Plotting
Moving Window Functions
Performance and Memory Usage Notes
Chapter 11 Financial and Economic Data Applications
Data Munging Topics
Group Transforms and Analysis
More Example Applications
Chapter 12 Advanced NumPy
ndarray Object Internals
Advanced Array Manipulation
Broadcasting
Advanced ufunc Usage
Structured and Record Arrays
More About Sorting
NumPy Matrix Class
Advanced Array Input and Output
Performance Tips
Appendix Python Language Essentials
The Python Interpreter
The Basics
Data Structures and Sequences
Functions
Files and the operating system
· · · · · · (收起)

讀後感

評分

評分

作者对于利用Python进行数据分析有着很丰富的经验,因此写出的书也是深入浅出,让人很容易就能看懂,尤其是在我看过Python学习手册后再看,基本都能看懂。 其中译者的翻译非常值得称道,堪称良心之作,非常的用心。 感谢Python社区的无私奉献的程序员们,也感谢我们有这么好的...  

評分

評分

pandas主要基于numpy.ndarray构造了更高级的Series和DataFrame数据结构。这本书主要就是说明基于这两种数据结构的API用法。这些API主要是对原本numpy操作的补充。行列Index在DataFrame的加强对于各种数据逻辑操作帮助比较大。对pyplot的绘图函数也和两种数据结构绑定的很好。越...  

評分

用戶評價

评分

就著coursera的一門課翻完瞭。。。代碼太多,有點無聊,不過還算係統。可以拿來當工具書用——

评分

就著coursera的一門課翻完瞭。。。代碼太多,有點無聊,不過還算係統。可以拿來當工具書用——

评分

又是一個入門讀物, 不過pandas真是太強大瞭 讓人眼前一亮

评分

人生苦短, 我學python。 好書, 入門以後的中階讀物? 最主要的是實踐中很多用得上

评分

介紹性的文檔, 熟悉一下numpy和pandas

本站所有內容均為互聯網搜索引擎提供的公開搜索信息,本站不存儲任何數據與內容,任何內容與數據均與本站無關,如有需要請聯繫相關搜索引擎包括但不限於百度google,bing,sogou

© 2025 book.quotespace.org All Rights Reserved. 小美書屋 版权所有