量化交易系列【2】Pandas数据筛选及处理相关操作
量化交易系列【2】Pandas数据筛选及处理相关操作
- 筛选数据
- 处理缺失值
- 删除空值dropna
- 补全缺失值fillna
- 排序函数
- 合并操作
- 去除重复数据
示例中使用的‘000001.XSHE.csv’文件,已上传至csdn资源中,可直接下载
date |
open |
close |
high |
low |
volume |
money |
|
0 |
2015/1/5 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
1 |
2015/1/6 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
2 |
2015/1/7 |
9.72 |
9.67 |
9.88 |
9.55 |
272274401 |
2.634796e+09 |
3 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
4 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |
筛选数据
date |
open |
close |
high |
low |
volume |
money |
code |
|
0 |
2015/1/5 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
000001.XSHE |
1 |
2015/1/6 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
000001.XSHE |
2 |
2015/1/7 |
9.72 |
9.67 |
9.88 |
9.55 |
272274401 |
2.634796e+09 |
000001.XSHE |
3 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
000001.XSHE |
4 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |
000001.XSHE |
date |
open |
close |
high |
low |
volume |
money |
code |
|
0 |
2015-01-05 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
000001.XSHE |
1 |
2015-01-06 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
000001.XSHE |
2 |
2015-01-07 |
9.72 |
9.67 |
9.88 |
9.55 |
272274401 |
2.634796e+09 |
000001.XSHE |
处理缺失值
删除空值dropna
补全缺失值fillna
排序函数
合并操作
date |
open |
close |
high |
low |
volume |
money |
|
0 |
2015/1/5 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
1 |
2015/1/6 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
date |
open |
close |
high |
low |
volume |
money |
|
3 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
4 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |
date |
open |
close |
high |
low |
volume |
money |
|
0 |
2015/1/5 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
1 |
2015/1/6 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
2 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
3 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |
去除重复数据
date |
open |
close |
high |
low |
volume |
money |
|
0 |
2015/1/5 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
1 |
2015/1/6 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
2 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
3 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |
3 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
4 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |
date |
open |
close |
high |
low |
volume |
money |
|
0 |
2015/1/5 |
9.98 |
10.00 |
10.17 |
9.74 |
458099037 |
4.565388e+09 |
1 |
2015/1/6 |
9.90 |
9.85 |
10.23 |
9.71 |
346952496 |
3.453446e+09 |
2 |
2015/1/8 |
9.68 |
9.34 |
9.72 |
9.30 |
225445502 |
2.128003e+09 |
3 |
2015/1/9 |
9.30 |
9.42 |
9.91 |
9.19 |
401736419 |
3.835378e+09 |