关于 pandas 的 apply 性能的一些疑惑 - V2EX

关于 pandas 的 apply 性能的一些疑惑 - V2EX

Home Sign Up Sign In

推荐学习书目

Learn Python the Hard Way

Python Sites

PyPI - Python Package Index

http://diveintopython.org/toc/index.html

值得关注的项目

Stackless Python

结巴中文分词

Python 编程

Styles

Google Python Style Guide

Code Style from The Hitchhiker's Guide

This topic created in 1322 days ago, the information mentioned may be changed or developed.

现在有一个 dataframe, 其中的几列需要单独处理

方案 1 是：

 df['foo1'] = df['foo1'].apply(lambda x: func1(x)) df['foo2'] = df['foo2'].apply(lambda x: func2(x)) df['foo3'] = df['foo3'].apply(lambda x: func3(x)) ...

方案 2 是:

 def pipeline(ser): ser = (ser.pipe(func1) .pipe(func2) .pipe(func3)) df = df.apply(lambda x: pipeline(x), axis=1)

在我感觉上是方案 2/code>应该是比方案 1要快的，但是实际运行下来发现方案 2时间更长...

 难道是我理解错了吗，按列处理会更快，即使是使用了 apply

  
  
 apply
 lambda
 ser
 PIPE
4 replies    2022-09-28 17:01:58 +08:00 
           1
 
 cyberpoint   
   Sep 28, 2022 
 不理解
  
 
 
           2
 
 Renormalization   
   Sep 28, 2022    2 
 pandas 按列处理肯定是最快的。<<Python for Finance>>书上的原话是"Working with the columns (Series objects) directly is the fastest approach"。另一句是"The slowest option is to use the apply() method row-by-row; this is like looping on the Python level over all rows".
  
 
 
           3
 
 ipwx   
   Sep 28, 2022    2 
 Pandas 是按列存储的。
  
 
 
           4
 
 LuJason   OP
   Sep 28, 2022    1 
 @Renormalization 是的 我已经试验出来了，一开始以为 apply 的少的会快一些

    
   
 About      Help      Advertise      Blog      API      FAQ      Solana      3733 Online   Highest 6679         Select Language 
 创意工作者们的社区 
 World is powered by solitude 
 VERSION: 3.9.8.5 46ms UTC 04:30 PVG 12:30 LAX 21:30 JFK 00:30
 Do have faith in what you're doing. 
 
 
 
     
ubao
msn
snddm
index
pchome
yahoo
rakuten
mypaper
meadowduck
bidyahoo
youbao
zxmzxm
asda
bnvcg
cvbfg
dfscv
mmhjk
xxddc
yybgb
zznbn
ccubao
uaitu
acv
GXCV
ET
GDG
YH
FG
BCVB
FJFH
CBRE
CBC
GDG
ET54
WRWR
RWER
WREW
WRWER
RWER
SDG
EW
SF
DSFSF
fbbs
ubao
fhd
dfg
ewr
dg
df
ewwr
ewwr
et
ruyut
utut
dfg
fgd
gdfgt
etg
dfgt
dfgd
ert4
gd
fgg
wr
235
wer3
we
vsdf
sdf
gdf
ert
xcv
sdf
rwer
hfd
dfg
cvb
rwf
afb
dfh
jgh
bmn
lgh
rty
gfds
cxv
xcv
xcs
vdas
fdf
fgd
cv
sdf
tert
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
sdf
shasha9178
shasha9178
shasha9178
shasha9178
shasha9178
liflif2
liflif2
liflif2
liflif2
liflif2
liblib3
liblib3
liblib3
liblib3
liblib3
zhazha444
zhazha444
zhazha444
zhazha444
zhazha444
dende5
dende
denden
denden2
denden21
fenfen9
fenf619
fen619
fenfe9
fe619
sdf
sdf
sdf
sdf
sdf
zhazh90
zhazh0
zhaa50
zha90
zh590
zho
zhoz
zhozh
zhozho
zhozho2
lislis
lls95
lili95
lils5
liss9
sdf0ty987
sdft876
sdft9876
sdf09876
sd0t9876
sdf0ty98
sdf0976
sdf0ty986
sdf0ty96
sdf0t76
sdf0876
df0ty98
sf0t876
sd0ty76
sdy76
sdf76
sdf0t76
sdf0ty9
sdf0ty98
sdf0ty987
sdf0ty98
sdf6676
sdf876
sd876
sd876
sdf6
sdf6
sdf9876
sdf0t
sdf06
sdf0ty9776
sdf0ty9776
sdf0ty76
sdf8876
sdf0t
sd6
sdf06
s688876
sd688
sdf86