Processes can differ massively in the rates at which they consume the computational resources of space and time, as the previous examples illustrate. However, exactly determining just how much space or time will be used when calling a function is a very difficult task that depends upon many factors. A useful way to analyze a process is to categorize it along with a group of processes that all have similar requirements. A useful categorization is the order of growth of a process, which expresses in simple terms how the resource requirements of a process grow as a function of the input.

As an introduction to orders of growth, we will analyze the function count_factors below, which counts the number of integers that evenly divide an input n. The function attempts to divide n by every integer less than or equal to its square root. The implementation takes advantage of the fact that if k divides n and k<n , then there is another factor j=n/k such that j>n.

1from math import sqrt
2def count_factors(n):
3    sqrt_n = sqrt(n)
4    k, factors = 1, 0
5    while k < sqrt_n:
6        if n % k == 0:
7            factors += 2
8        k += 1
9    if k * k == n:
10        factors += 1
11    return factors
12
13result = count_factors(576)
End
line that has just executed

next line to execute

Global
sqrt
 
count_factors
 
result21
f1: count_factors [parent=Global]
n576
sqrt_n24.0
k24
factors21
Return
value
21
func sqrt(...) [parent=Global]
func count_factors(n) [parent=Global]

How much time is required to evaluate count_factors? The exact answer will vary on different machines, but we can make some useful general observations about the amount of computation involved. The total number of times this process executes the body of the while statement is the greatest integer less than n. The statements before and after this while statement are executed exactly once. So, the total number of statements executed is wn+v, where w is the number of statements in the while body and v is the number of statements outside of the while statement. Although it isn't exact, this formula generally characterizes how much time will be required to evaluate count_factors as a function of the input n.

A more exact description is difficult to obtain. The constants w and v are not constant at all, because the assignment statements to factors are sometimes executed but sometimes not. An order of growth analysis allows us to gloss over such details and instead focus on the general shape of growth. In particular, the order of growth for count_factors expresses in precise terms that the amount of time required to compute count_factors(n) scales at the rate n, within a margin of some constant factors.

Theta Notation. Let n be a parameter that measures the size of the input to some process, and let R(n) be the amount of some resource that the process requires for an input of size n. In our previous examples we took n to be the number for which a given function is to be computed, but there are other possibilities. For instance, if our goal is to compute an approximation to the square root of a number, we might take n to be the number of digits of accuracy required.

R(n) might measure the amount of memory used, the number of elementary machine steps performed, and so on. In computers that do only a fixed number of steps at a time, the time required to evaluate an expression will be proportional to the number of elementary steps performed in the process of evaluation.

We say that R(n) has order of growth Θ(f(n)), written R(n)=Θ(f(n)) (pronounced "theta of f(n)"), if there are positive constants k1 and k2 independent of n such that

k1f(n)R(n)k2f(n)

for any value of n larger than some minimum m. In other words, for large n, the value R(n) is always sandwiched between two values that both scale with f(n):

  • A lower bound k1f(n) and
  • An upper bound k2f(n)

We can apply this definition to show that the number of steps required to evaluate count_factors(n) grows as Θ(n) by inspecting the function body.

First, we choose k1=1 and m=0, so that the lower bound states that count_factors(n) requires at least 1n steps for any n>0. There are at least 4 lines executed outside of the while statement, each of which takes at least 1 step to execute. There are at least two lines executed within the while body, along with the while header itself. All of these require at least one step. The while body is evaluated at least n1 times. Composing these lower bounds, we see that the process requires at least 4+3(n1) steps, which is always larger than k1n.

Second, we can verify the upper bound. We assume that any single line in the body of count_factors requires at most p steps. This assumption isn't true for every line of Python, but does hold in this case. Then, evaluating count_factors(n) can require at most p(5+4n), because there are 5 lines outside of the while statement and 4 within (including the header). This upper bound holds even if every if header evaluates to true. Finally, if we choose k2=5p, then the steps required is always smaller than k2n. Our argument is complete.