What is Big O notation?

Big O notation describes the upper bound on how an algorithm's time or space requirements grow relative to the input size, ignoring constant factors and lower-order terms.

What is the difference between O(n) and O(log n)?

O(n) grows linearly with the input, while O(log n) grows much slower—doubling the input adds only one more step. Binary search is O(log n); a single loop over n items is O(n).

Why does Big O ignore constants?

Because it describes growth as input size approaches infinity, where constant factors and lower-order terms become negligible compared to the dominant term.

What's a good Big O to aim for in interviews?

It depends on the problem, but generally aim for the lowest complexity achievable—often O(n) or O(n log n). Interviewers usually want you to improve a brute-force solution toward the optimal one.

Big O Notation: Time & Space Complexity Explained

In this free email course, I'll teach you the right way of thinking to break down tricky algorithmic coding interview questions.

7 days. One short-but-helpful email a day. Unsubscribe whenever.

No CS degree necessary.

The idea behind big O notation

Big O notation is the language we use for talking about how long an algorithm takes to run. It's how we compare the efficiency of different approaches to a problem.

It's like math except it's an awesome, not-boring kind of math where you get to wave your hands through the details and just focus on what's basically happening.

With big O notation we express the runtime in terms of—brace yourself—how quickly it grows relative to the input, as the input gets arbitrarily large.

Let's break that down:

how quickly the runtime grows—It's hard to pin down the exact runtime of an algorithm. It depends on the speed of the processor, what else the computer is running, etc. So instead of talking about the runtime directly, we use big O notation to talk about how quickly the runtime grows.
relative to the input—If we were measuring our runtime directly, we could express our speed in seconds. Since we're measuring how quickly our runtime grows, we need to express our speed in terms of...something else. With Big O notation, we use the size of the input, which we call "n." So we can say things like the runtime grows "on the order of the size of the input" () or "on the order of the square of the size of the input" ().
as the input gets arbitrarily large—Our algorithm may have steps that seem expensive when n is small but are eclipsed eventually by other steps as n gets huge. For big O analysis, we care most about the stuff that grows fastest as the input grows, because everything else is quickly eclipsed as n gets very large. (If you know what an asymptote is, you might see why "big O analysis" is sometimes called "asymptotic analysis.")

If this seems abstract so far, that's because it is. Let's look at some examples.

Some examples

void printFirstItem(const vector<int>& items) { cout << items[0] << endl; }

This function runs in time (or "constant time") relative to its input. The input vector could be 1 item or 1,000 items, but this function would still just require one "step."

void printAllItems(const vector<int>& items) { for (int item : items) { cout << item << endl; } }

This function runs in time (or "linear time"), where n is the number of items in the vector. If the vector has 10 items, we have to print 10 times. If it has 1,000 items, we have to print 1,000 times.

void printAllPossibleOrderedPairs(const vector<int>& items) { for (int firstItem : items) { for (int secondItem : items) { cout << firstItem << ", " << secondItem << endl; } } }

Here we're nesting two loops. If our vector has n items, our outer loop runs n times and our inner loop runs n times for each iteration of the outer loop, giving us n^2 total prints. Thus this function runs in time (or "quadratic time"). If the vector has 10 items, we have to print 100 times. If it has 1,000 items, we have to print 1,000,000 times.

N could be the actual input, or the size of the input

Both of these functions have runtime, even though one takes an integer as its input and the other takes a vector:

void sayHiNTimes(size_t n) { for (size_t i = 0; i < n; ++i) { cout << "hi" << endl; } } void printAllItems(const vector<int>& items) { for (int item : items) { cout << item << endl; } }

So sometimes n is an actual number that's an input to our function, and other times n is the number of items in an input vector (or an input map, or an input object, etc.).

Drop the constants

This is why big O notation rules. When you're calculating the big O complexity of something, you just throw out the constants. So like:

void printAllItemsTwice(const vector<int>& items) { for (int item : items) { cout << item << endl; } // once more, with feeling for (int item : items) { cout << item << endl; } }

This is , which we just call .

void printFirstItemThenFirstHalfThenSayHi100Times(const vector<int>& items) { cout << items[0] << endl; size_t middleIndex = items.size() / 2; size_t index = 0; while (index < middleIndex) { cout << items[index] << endl; ++index; } for (size_t i = 0; i < 100; ++i) { cout << "hi" << endl; } }

This is , which we just call .

Why can we get away with this? Remember, for big O notation we're looking at what happens as n gets arbitrarily large. As n gets really big, adding 100 or dividing by 2 has a decreasingly significant effect.

Keep up the momentum! Sign up to get a data structures and algorithms practice question sent to you every week.

Cool, watch your inbox!

Drop the less significant terms

For example:

void printAllNumbersThenAllPairSums(const vector<int>& numbers) { cout << "these are the numbers:" << endl; for (int number : numbers) { cout << number << endl; } cout << "and these are their sums:" << endl; for (int firstNumber : numbers) { for (int secondNumber : numbers) { cout << (firstNumber + secondNumber) << endl; } } }

Here our runtime is , which we just call . Even if it was , it would still be .

Similarly:

Again, we can get away with this because the less significant terms quickly become, well, less significant as n gets big.

We're usually talking about the "worst case"

Often this "worst case" stipulation is implied. But sometimes you can impress your interviewer by saying it explicitly.

Sometimes the worst case runtime is significantly worse than the best case runtime:

bool contains(const vector<int>& haystack, int needle) { // does the haystack contain the needle? for (int n : haystack) { if (n == needle) { return true; } } return false; }

Here we might have 100 items in our haystack, but the first item might be the needle, in which case we would return in just 1 iteration of our loop.

In general we'd say this is runtime and the "worst case" part would be implied. But to be more specific we could say this is worst case and best case runtime. For some algorithms we can also make rigorous statements about the "average case" runtime.

Space complexity: the final frontier

Sometimes we want to optimize for using less memory instead of (or in addition to) using less time. Talking about memory cost (or "space complexity") is very similar to talking about time cost. We simply look at the total size (relative to the size of the input) of any new variables we're allocating.

This function takes space (we use a fixed number of variables):

void sayHiNTimes(size_t n) { for (size_t i = 0; i < n; ++i) { cout << "hi" << endl; } }

This function takes space (the size of hiVector scales with the size of the input):

vector<string> vectorOfHiNTimes(size_t n) { vector<string> hiVector; for (size_t i = 0; i < n; ++i) { hiVector.push_back("hi"); } return hiVector; }

Usually when we talk about space complexity, we're talking about additional space, so we don't include space taken up by the inputs. For example, this function takes constant space even though the input has n items:

int getLargestItem(const vector<int>& items) { int largest = numeric_limits<int>::min(); for (int item : items) { if (item > largest) { largest = item; } } return largest; }

Sometimes there's a tradeoff between saving time and saving space, so you have to decide which one you're optimizing for.

Big O analysis is awesome except when it's not

You should make a habit of thinking about the time and space complexity of algorithms as you design them. Before long this'll become second nature, allowing you to see optimizations and potential performance issues right away.

Asymptotic analysis is a powerful tool, but wield it wisely.

Big O ignores constants, but sometimes the constants matter. If we have a script that takes 5 hours to run, an optimization that divides the runtime by 5 might not affect big O, but it still saves you 4 hours of waiting.

Beware of premature optimization. Sometimes optimizing time or space negatively impacts readability or coding time. For a young startup it might be more important to write code that's easy to ship quickly or easy to understand later, even if this means it's less time and space efficient than it could be.

But that doesn't mean startups don't care about big O analysis. A great engineer (startup or otherwise) knows how to strike the right balance between runtime, space, implementation time, maintainability, and readability.

You should develop the skill to see time and space optimizations, as well as the wisdom to judge if those optimizations are worthwhile.

Big O Notation

Using not-boring math to measure code's efficiency

Get the 7-day crash course!

Get the coding interview crash course