Python for Data Analysis

Python for Data Analysis pdf epub mobi txt 电子书 下载 2025

出版者:O'Reilly Media
作者:Wesly McKinney
出品人:
页数:450
译者:
出版时间:2013-6-16
价格:0
装帧:Paperback
isbn号码:9781549329784
丛书系列:
图书标签:
  • Python
  • 数据分析
  • python大数据分析
  • Python基础教程
  • 计算机
  • 编程
  • python
  • python培训
  • Python
  • 数据分析
  • 数据科学
  • 编程
  • 机器学习
  • 可视化
  • 数据清洗
  • 统计分析
  • 大数据
  • 开源
想要找书就要到 小美书屋
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

具体描述

这本书主要是用 pandas 连接 SciPy 和 NumPy,用pandas做数据处理是Pycon2012上一个很热门的话题。另一个功能强大的东西是Sage,它将很多开源的软件集成到统一的 Python 接口。

Python for Data Analysis is concerned with the nuts and bolts of manipulating, processing, cleaning, and crunching data in Python. It is also a practical, modern introduction to scientific computing in Python, tailored for data-intensive applications. This is a book about the parts of the Python language and libraries you’ll need to effectively solve a broad set of data analysis problems. This book is not an exposition on analytical methods using Python as the implementation language.

Written by Wes McKinney, the main author of the pandas library, this hands-on book is packed with practical cases studies. It’s ideal for analysts new to Python and for Python programmers new to scientific computing.

Use the IPython interactive shell as your primary development environment

Learn basic and advanced NumPy (Numerical Python) features

Get started with data analysis tools in the pandas library

Use high-performance tools to load, clean, transform, merge, and reshape data

Create scatter plots and static or interactive visualizations with matplotlib

Apply the pandas groupby facility to slice, dice, and summarize datasets

Measure data by points in time, whether it’s specific instances, fixed periods, or intervals

Learn how to solve problems in web analytics, social sciences, finance, and economics, through detailed examples

作者简介

Wes McKinney 资深数据分析专家,对各种Python库(包括NumPy、pandas、matplotlib以及IPython等)等都有深入研究,并在大量的实践中积累了丰富的经验。撰写了大量与Python数据分析相关的经典文章,被各大技术社区争相转载,是Python和开源技术社区公认的权威人物之一。开发了用于数据分析的著名开源Python库——pandas,广获用户好评。在创建Lambda Foundry(一家致力于企业数据分析的公司)之前,他曾是AQR Capital Management的定量分析师。

目录信息

Chapter 1 Preliminaries
What Is This Book About?
Why Python for Data Analysis?
Essential Python Libraries
Installation and Setup
Community and Conferences
Navigating This Book
Acknowledgements
Chapter 2 Introductory Examples
1.usa.gov data from bit.ly
MovieLens 1M Data Set
US Baby Names 1880-2010
Conclusions and The Path Ahead
Chapter 3 IPython: An Interactive Computing and Development Environment
IPython Basics
Using the Command History
Interacting with the Operating System
Software Development Tools
IPython HTML Notebook
Tips for Productive Code Development Using IPython
Advanced IPython Features
Credits
Chapter 4 NumPy Basics: Arrays and Vectorized Computation
The NumPy ndarray: A Multidimensional Array Object
Universal Functions: Fast Element-wise Array Functions
Data Processing Using Arrays
File Input and Output with Arrays
Linear Algebra
Random Number Generation
Example: Random Walks
Chapter 5 Getting Started with pandas
Introduction to pandas Data Structures
Essential Functionality
Summarizing and Computing Descriptive Statistics
Handling Missing Data
Hierarchical Indexing
Other pandas Topics
Chapter 6 Data Loading, Storage, and File Formats
Reading and Writing Data in Text Format
Binary Data Formats
Interacting with HTML and Web APIs
Interacting with Databases
Chapter 7 Data Wrangling: Clean, Transform, Merge, Reshape
Combining and Merging Data Sets
Reshaping and Pivoting
Data Transformation
String Manipulation
Example: USDA Food Database
Chapter 8 Plotting and Visualization
A Brief matplotlib API Primer
Plotting Functions in pandas
Plotting Maps: Visualizing Haiti Earthquake Crisis Data
Python Visualization Tool Ecosystem
Chapter 9 Data Aggregation and Group Operations
GroupBy Mechanics
Data Aggregation
Group-wise Operations and Transformations
Pivot Tables and Cross-Tabulation
Example: 2012 Federal Election Commission Database
Chapter 10 Time Series
Date and Time Data Types and Tools
Time Series Basics
Date Ranges, Frequencies, and Shifting
Time Zone Handling
Periods and Period Arithmetic
Resampling and Frequency Conversion
Time Series Plotting
Moving Window Functions
Performance and Memory Usage Notes
Chapter 11 Financial and Economic Data Applications
Data Munging Topics
Group Transforms and Analysis
More Example Applications
Chapter 12 Advanced NumPy
ndarray Object Internals
Advanced Array Manipulation
Broadcasting
Advanced ufunc Usage
Structured and Record Arrays
More About Sorting
NumPy Matrix Class
Advanced Array Input and Output
Performance Tips
Appendix Python Language Essentials
The Python Interpreter
The Basics
Data Structures and Sequences
Functions
Files and the operating system
· · · · · · (收起)

读后感

评分

评分

1.调试轮子的时候发现,pandas由于基于numpy,有多个float数据类型(float16/float32/float64),多个float类型会被pandas认为是不同类导致无法进行后续处理,用apply方法把数据全转成字符串再转回来就行。 但是如果直接用type查看类型返回的class都是float。 不定期更新…

评分

评分

评分

这本书的作者就是pandas的开发者,全书以numpy为基础、按照数据分析的工作流程,详细介绍了如何使用pandas进行数据分析。每一章节最后一部分一般是一个数据分析的project,并且书中每一个小功能几乎都附上了相应代码说明,是一部名副其实的python数据分析cookbook。 另外,这本...  

用户评价

评分

迅速过了一遍,蛮适合推荐给不会编程的数据处理人员

评分

Pandas

评分

迅速过了一遍,蛮适合推荐给不会编程的数据处理人员

评分

用Python 3.6的我哭晕在角落..

评分

介绍性的文档, 熟悉一下numpy和pandas

本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度google,bing,sogou

© 2025 book.quotespace.org All Rights Reserved. 小美书屋 版权所有