A warning about .clear()

→ Pay attention

Before contest
Codeforces Round 1098 (Div. 2)
2 days
Register now »

*has extra registration

→ Top rated

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3611
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	dXqwq	3436
8	Radewoosh	3415
9	Otomachi_Una	3413
10	Um_nik	3376

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	Qingyu	163
2	adamant	150
3	Um_nik	146
4	Dominater069	144
5	errorgorn	141
6	cry	139
7	Proof_by_QED	136
8	YuukiS	135
9	chromate00	134
9	TheScrasse	134

View all →

→ Find user

→ Recent actions

Detailed →

DNR's blog

A warning about .clear()

By DNR, 8 months ago, In English

The following code takes $$$O(n \cdot x)$$$ memory (NOT $$$O(x)$$$!!!):

vector<vector<int>> a(n);
for(int i = 0; i < n; i ++)
{
    a[i].assign(x, 0);
    a[i].clear();
}

You must call a[i].shrink_to_fit() after a[i].clear() to actually free the reserved memory, and cap memory usage to $$$O(x)$$$. I unfortunately didn't know this and was consequently traumatized by seemingly inexplicable MLEs in today's d2D.

Now that I think about it, all of those tens of times where I called .clear() on smaller vectors after merging them into larger vectors (small-to-large merging) have been pointless.

+281

DNR
8 months ago
18

Comments (18)

Write comment?

Nyemot

8 months ago, hide # |

-67

https://en.cppreference.com/w/cpp/container/vector/clear.html
Meme

jokes aside, its a good announcement to those who don't know.

→ Reply

Mindeveloped

8 months ago, hide # |

+36

what about vector<int> ().swap (a[i])

→ Reply

Nyemot

8 months ago, hide # ^ |

I just read about shrink_to_fit(), my sources say that it's a non-binding request, meaning it may or may not actually shrink the memory.

imho I feel safer using this example than shrink_to_fit()

→ Reply

virinci

8 months ago, hide # ^ |

+16

→ Reply

virinci

8 months ago, hide # ^ |

Alternative options (wait for the last one containing UB):

a[i] = vector<int>();

vector<int> kill;
a[i] = ::move(kill);

a[i] = ::move(a[i]);

→ Reply

AksLolCoding

8 months ago, hide # |

This also happens if you use a[i].resize(0)

→ Reply

AlRntn

8 months ago, hide # |

is the same true for sets and maps?

→ Reply

DNR

8 months ago, hide # ^ |

no, .clear() does free memory for std::set and std::map

→ Reply

-firefly-

8 months ago, hide # |

I prefer to let RAII do the work.

→ Reply

Xellos

8 months ago, hide # |

Note: symptom of constant-inefficient code. If your code's efficient but still running into a memory bottleneck, it's usually obvious how that bottleneck happens. Assume that shuffling around data in STL data structures can have unexpected effects on memory and you'll be fine.

Example of code using std::vector that uses known amounts of memory:

// asymptotically 4N^2 bytes
std::vector<std::vector<int>> A(N, std::vector<int>(N));
std::copy(begin(input), end(input), begin(A[0]));
for(int i = 0; i < N-1; i++) calculate(A[i], A[i+1]);

// 8N bytes
std::vector<int> result(N), tmp(N);
std::copy(begin(input), end(input), begin(tmp));
for(int i = 0; i < N-1; i++) {
    calculate(tmp, result);
    std::copy(begin(result), end(result), begin(tmp));
}

The pattern of "reserve resources you'll be working with, then call functions that work in-place or receive their input and output as arguments" is common when memory is a concern since it makes reasoning about consumed memory much easier.

→ Reply

DNR

8 months ago, hide # ^ |

← Rev. 2 →

I don't think there's any trivial way to do such "in-place" things in my particularly cursed use case (at least without adding a log factor). I was doing something along the following lines:

vector<vector<int>> a(n);

for(int i = 0; i < n; i ++) 
    a[0].push_back(i);

for(int i = 0; i < n; i ++)
{
    for(auto x : a[i])
    {
        int nxt = f(i, x);    //i < f(i, x) 
        if(nxt < n)
            a[nxt].push_back(x);    //performed at most O(n^(3/2)) times  
    }
    a[i].clear();
}

The total "algorithmic" size of all the vectors at any time here is clearly $$$\leq n$$$ but no memory is freed, so the total memory consumed ends up being $$$O(n \sqrt{n})$$$.

→ Reply

Xellos

8 months ago, hide # ^ |

Ok, you're making many push_backs to construct many relatively small vectors, which is going to result in many ($$$O(n \log n)$$$-ish) allocations. Those are slow, perhaps even slow enough to be a real bottleneck. This is what I mean by inefficiency.

If you store $$$x$$$ at $$$a[f(x)]$$$, then $$$i \lt f(x)$$$ can't always hold when processing $$$a[i=f(x)]$$$. It'd result in TLE or MLE.

Assuming you're instead storing some g(x) at a[f(x)], where $$$g(x) \gt f(x)$$$, then you could observe that you're dealing with chains of $$$x \rightarrow f(x), g(x) \ldots n, g(x)$$$ from each $$$x = i \in [0, n)$$$. Process them directly and independently, there's no need to store anything. Since your goal isn't to populate a but do something else, you could perhaps do it independently for each chain.

Even if you wanted to store stuff, you can precalculate sizes of a[i] using those chains and perhaps do something with them. If memory limit wasn't a concern, flattening a jagged 2D array with known sizes would erase the time bottleneck of allocations.

If your real code is more complex than this example, you could end up in a doomed situation no matter what, but that's the problem with complex code where you're shuffling around a lot of mutually dependent data. It's hard to code without bugs, hard to debug and has all kinds of nasty inefficiencies in runtime. Sometimes it's best stop and try another way... and sometimes to just grit your teeth and look for side effects.

→ Reply

virinci

8 months ago, hide # ^ |

push_back is amortized $$$O(1)$$$ though. Should we really be concerned about memory allocation speed? I can't imagine it leading to a TLE in real scenarios.

→ Reply

Xellos

8 months ago, hide # ^ |

It absolutely can. $$$O(10^{10})$$$ is $$$O(1)$$$ too and you have to consider that constant factor in real scenarios. I've been burned multiple times by push_backing to create many small vectors.

→ Reply

virinci

8 months ago, hide # ^ |

← Rev. 2 →

Hmm, that might be the case in high rated problems where you have to do too many operations per test case. Personally, I have never faced this issue.

Still I try to optimize the memory used by using tricks like:

swap(dp, ndp) when the current row states depend only on the previous row states. You should use swap ($$$O(1)$$$) instead of std::copy ($$$O(n)$$$) in your first example as it's less error-prone.
Using a k length vector and i % k if the current state depends on last k states.
vector<array<T, K>> (single allocation), instead of vector<vector<T>> (one allocation per inner vector), if K is a constant or if its upper bound is small.

Processing each chain independently is a lot cooler though.

→ Reply

Xellos

8 months ago, hide # ^ |

I wouldn't know, I solve mid-rated problems. Of course it won't happen in problems that have straightforward clean solutions.

You should use swap instead of std::copy in your first example as it's less error-prone.

Also less obvious what it's supposed to do — but it uses move semantics instead of copying data using a temporary, so that's nice.

→ Reply

Jelefy

8 months ago, hide # |

Usually you don't need to think particularly about the memory usage of vectors in CP. The time complexity of initializing memory is the same as the memory complexity. When you exceed the memory limit you are probably exceeding the time limit as well.

→ Reply

Zorojuro

8 months ago, hide # |

for everything ? or just for vectors ? didn't know about this

→ Reply

The only programming contests Web 2.0 platform

Server time: May/14/2026 14:01:29 (g2).

Desktop version, switch to mobile version.

Supported by