【Python数据分析-1】：Pandas的常用操作-Pandas的主要数据结构Series（一维数据）和DataFrame（二维数据）

2021/11/20 11:11:43

编程Tag： data index python PD dataframe Pandas Series dtype

本文主要是介绍【Python数据分析-1】：Pandas的常用操作-Pandas的主要数据结构Series（一维数据）和DataFrame（二维数据），对大家解决编程问题具有一定的参考价值，需要的程序猿们随着小编来一起学习吧！

大家好，这里是为代码封神的封神榜（有点吹牛皮了，哈哈）。还是新人初来乍到，希望大家多多指教。

本系列呢，是为大家带来的是Python数据分析，希望大家能够喜欢，多多支持。

安装Pandas

从PyPi安装：

pip install Pandas
复制代码

Pandas适用场景

Pandas 适用于处理与Excel表类似的二维表格数据

堆数	名称	描述
1	Series	代表钱的数据类型相同的数组
2	DataFrame	带标签的大小可变的不同数据类型的表格

Pandas的数据结构

Pandas 的主要数据结构是 Series（一维数据，类似Python中的数组）与 DataFrame（二维数据，类似excel表）。

维数	名称	描述
1	Series	带标签的数据类型相同的数组
2	DataFrame	带标签的大小可变的不同数据类型的表格

Series

Series是带标签的一维数组。标签统称为索引。

创建Series的方法

pd.Series(data, index=index)
复制代码

data可以是字典、数组、字面量。

>>>import pandas as pd
# data为字典
>>>data = {"name": "ridingroad", "age": 21}
>>>pd.Series(data)
name    ridingroad
age             21
dtype: object
# 字典为数组
>>>data = [1,2,3,4]
>>>pd.Series(data)
0    1
1    2
2    3
3    4
dtype: int64
# data为字面量
>>>data = 5
>>>pd.Series(data)
0    5
dtype: int64
# data为字面量，Series按索引(轴标签重复该字面量)
>>>pd.Series(data, index=['a', 'b', 'c'])
a    5
b    5
c    5
dtype: int64
复制代码

Series的操作

>>>data = pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])
>>>data
a    1
b    2
c    3
d    4
dtype: int64
# 按索引取值
>>>data[0]
1
>>>data[2]
3
# 按标签取值
>>>data['a']
1
>>>'a' in data
True
# Series.get(key) 当key不存在，返回None
>>>data.get('5')
# 支持大多数Numpy的数组的方法
>>>data + data
a    2
b    4
c    6
d    8
dtype: int64
# 支持大多数Numpy的数组的方法
>>>data[data > data.median()]
c    3
d    4
dtype: int64
复制代码

DataFrame

在生成DataFrame的时候可以指定列标签(columns)和行索引(idnex)。用字典生成DataFrame：

>>>pd.DataFrame({"0": [1, 2, 3, 4], "1": [5, 6, 7, 8]})
   0  1
0  1  5
1  2  6
2  3  7
3  4  8
# 指定行索引
>>>pd.DataFrame({"0": [1, 2, 3, 4], "1": [5, 6, 7, 8]}, index=['first', 'second', 'third', 'fourth'])
        0  1
first   1  5
second  2  6
third   3  7
fourth  4  8
复制代码

DataFrame.index和DataFrame.columns用来分别访问DataFrame的行和列的标签。

>>>data = pd.DataFrame({"0": [1, 2, 3, 4], "1": [5, 6, 7, 8]}, index=['first', 'second', 'third', 'fourth'])
>>>data.index
Index(['first', 'second', 'third', 'fourth'], dtype='object')
>>>data.columns
Index(['0', '1'], dtype='object')

这篇关于【Python数据分析-1】：Pandas的常用操作-Pandas的主要数据结构Series（一维数据）和DataFrame（二维数据）的文章就介绍到这儿，希望我们推荐的文章对大家有所帮助，也希望大家多多支持为之网！