PyGotham 2014 Introduction to Profiling

PyGotham 2014
Introduction to
Profiling
Perrin Harkins

“We should forget about small efﬁciencies, say
about 97% of the time: premature optimization is
the root of all evil. Yet we should not pass up our
opportunities in that critical 3%. A good
programmer will not be lulled into complacency
by such reasoning, he will be wise to look carefully
at the critical code; but only after that code has
been identiﬁed.”
–Donald Knuth

“Bottlenecks occur in surprising places, so don't
try to second guess and put in a speed hack until
you have proven that's where the bottleneck is.”
–Rob Pike

What will a profiler tell us?
❖ Function execution time!
❖ Memory usage, etc. are possible, but for another day!
❖ More about line proﬁling later!
❖ Real (wall clock) time!
❖ Inclusive vs exclusive time!
❖ Number of calls, primitive and recursive

cProfile
❖ Generates proﬁle data that can be read in shell or GUI
tools!
❖ 30% or more speed penalty

cProfile
From command line:!
$ python -m cProfile -o myscript.prof myscript.py

cProfile
Or, in your program:!
import cProfile
cProfile.run('slow_function', 'myscript.prof')

cProfile
Or, even more ﬂexible:!
pr = cProfile.Profile()
pr.enable()
… thing you want to proﬁle …!
pr.disable()

pstats
import pstats
profile = pstats.Stats('myscript.prof')
profile.add('myscript.prof2')
profile.strip_dirs()
profile.sort_stats('cumulative')
profile.print_stats(20)

12192418 function calls (11990470 primitive calls) in 84.268 seconds
!
Ordered by: cumulative time
List reduced from 1211 to 20 due to restriction <20>
!
ncalls tottime percall cumtime percall filename:lineno(function)
1 0.000 0.000 84.402 84.402 <string>:1(<module>)
1 0.021 0.021 84.402 84.402 act_bench.py:243(_do_act)
500 0.096 0.000 84.381 0.169 __init__.py:170(act)
500 0.007 0.000 35.874 0.072 petition_actions.py:460(save)
500 0.066 0.000 33.431 0.067 action_processor.py:1303(save)
500 0.160 0.000 22.684 0.045 users.py:1002(save)
10501 0.175 0.000 21.963 0.002 query.py:852(_fetch_all)
14001 0.286 0.000 21.472 0.002 compiler.py:758(execute_sql)
6501 0.047 0.000 14.200 0.002 query.py:76(__len__)

profile.print_callees('full_clean', 10)
!
List reduced from 1211 to 2 due to restriction <'full_clean'>
!
Function called...
ncalls tottime cumtime
forms.py:260(full_clean) -> 500 0.177 2.855 forms.py:
277(_clean_fields)
500 0.003 0.030 forms.py:298(_clean_form)
500 0.031 2.784 models.py:
393(_post_clean)
base.py:918(full_clean) -> 500 0.001 0.001 base.py:738(clean)
500 0.096 2.399 base.py:952(clean_fields)

profile.print_callers('full_clean')
!
List reduced from 1211 to 2 due to restriction <'full_clean'>
!
Function was called by...
ncalls tottime cumtime
forms.py:260(full_clean) <- 500 0.009 5.678 forms.py:117(errors)
base.py:918(full_clean) <- 500 0.005 2.405 models.py:
393(_post_clean)

KCacheGrind
!
❖ GUI for viewing proﬁle data!
❖ Run your proﬁle output through pyprof2calltree!
❖ On a Mac, qcachegrind is easier to install

RunSnakeRun
❖ Squaremap of call tree!
❖ Maybe useful for spotting large exclusive time functions

Using your results
❖ Bottom up approach!
❖ Start with a large exclusive time sub!
❖ Climb up call graph to ﬁnd something you can affect!
❖ "We're spending a lot of time in deepcopy(). What's
calling that so much?"!
❖ Might miss higher-level ﬁxes

Using your results
❖ Top down approach!
❖ Start with a large inclusive time sub!
❖ Walk down call graph to ﬁnd something you can
affect!
❖ "We're spending a lot of time in this validate() method.
What's it doing that takes so long?"!
❖ Look for structural changes

Line profiling
❖ line_proﬁler does exist!
❖ Results are not very actionable!
❖ If you get this far, you probably should stop (or refactor
your methods!)

Good profiling technique
❖ Create a repeatable benchmark test!
❖ Allows you to measure progress!
❖ Iterations/second!
❖ Time for n iterations

What usually helps
❖ Removing unnecessary work!
❖ “We load that conﬁg data every time, even when we don’t
use it.”!
❖ Using a more efﬁcient algorithm

What usually helps
❖ Batching I/O (disk or net) operations!
❖ Database stuff!
❖ SQL tuning!
❖ Indexes!
❖ Transactions

What usually helps
❖ Caching!
❖ Easy to add, hard to live with!
❖ Code complexity!
❖ Invalidation calls!
❖ Dependency tracking!
❖ Business customers care about data freshness

PyGotham 2014 Introduction to Profiling

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a PyGotham 2014 Introduction to Profiling

Semelhante a PyGotham 2014 Introduction to Profiling (20)

Mais de Perrin Harkins

Mais de Perrin Harkins (10)

Último

Último (20)

PyGotham 2014 Introduction to Profiling