[C++ Question] Reading Compile Time Constants

#	User	Rating
1	tourist	3985
2	jiangly	3741
3	jqdai0815	3682
4	Benq	3529
5	orzdevinwang	3526
6	ksun48	3489
7	Radewoosh	3483
8	Kevin114514	3442
9	ecnerwala	3392
9	Um_nik	3392

#	User	Contrib.
1	cry	169
2	maomao90	162
2	Um_nik	162
2	atcoder_official	162
5	djm03178	158
6	-is-this-fft-	157
7	adamant	155
8	awoo	154
8	Dominater069	154
10	nor	150

Hey!

Consider the following solutions for the CSES problem Coin Combination I (Time Limit is 1s):

Solution with `constexpr` or `const` mod

#include <bits/stdc++.h>
using namespace std;
constexpr int mod = 1000000007;
 
int dp[1000001];
vector<int> v;
 
int main() {
        cin.tie(0)->sync_with_stdio(0);

	int n, x; cin >> n >> x; v.resize(n);
	for (auto &x : v) { cin >> x; }
	dp[0] = 1; for (int i = 1; i <= x; i++)
		for (int j = 0; j < n; j++) if (v[j] <= i)
			dp[i] = (dp[i] + dp[i - v[j]]) % mod;
 
	cout << dp[x] << '\n';
}

Solution with non `const` mod

#include <bits/stdc++.h>
using namespace std;
int mod = 1000000007;
 
int dp[1000001];
vector<int> v;
 
int main() {
        cin.tie(0)->sync_with_stdio(0);

	int n, x; cin >> n >> x; v.resize(n);
	for (auto &x : v) { cin >> x; }
	dp[0] = 1; for (int i = 1; i <= x; i++)
		for (int j = 0; j < n; j++) if (v[j] <= i)
			dp[i] = (dp[i] + dp[i - v[j]]) % mod;
 
	cout << dp[x] << '\n';
}

Note that using const or constexpr in the first solution is the same since we are initializing a global variable with a prvalue literal.

The first solution passes with Runtime ~0.58s in the worst test case, whereas the second solution gives a TLE outcome.

It seems that reading from compile time constants may be twice as fast as reading from arbitrary variables. Is there a particular reason for this?

EDIT: Big thanks to AkibAzmain and MattTheNub, I got the following answers:

Compile time constants are inlined by the compiler which saves multiple machine register reading operations.
When $$$mod$$$ is constant the modulo operator uses a technique called Barret Reduction to speed up computations.

Comments (6)

Write comment?

AkibAzmain

19 months ago, # |

← Rev. 2 →

-6

It's because the compiler inlines the constants. It means that the constants are not stored in a variable at all when the program is running, instead all references to the constant is replaced directly by it's value. Since the CPU doesn't need to access the memory to get the value, it's faster.

Moreover, since the compiler knows that it's constant, so it can apply extra optimizations that doesn't work if you use a variable.

→ Reply

0-jij-0

19 months ago, # ^ |

Make sense. I'm still surprised though that this creates such an overhead... Thank you!

I think there's something more involved here, like optimizations, CPU cache, etc, etc.

fmoeran

I think the reason it’s had such a large impact in this solution is because of the 3 lines or so of the main algorithm, the only line with anything more complex than just one addition or comparison is the line where you’re calling mod.

The machine code this creates is a bunch of calls where the value stored in mod has to be retrieved and put in a register then replaced with the next value. Which adds a good few instructions inside the loop.

Also constexpr allows lines to be precomputed in compile time. Whilst it obviously can’t just precompute every possible answer before it’s even given an input, it does allow the compiler to bake some information that could make it faster in run time.

MattTheNub

+27

Computing values modulo a constant is typically faster since the compiler can perform a Barrett reduction to compute the result using multiplication and bit shifts, which is faster than computing it with integer division. If the modulus is not constant, the compiler has to use regular, slow integer division.

Very interesting. Thank you for sharing! :D

0-jij-0's blog