Yet another compiler bug [?]

→ Обратите внимание

До соревнования
Codeforces Round (Div. 2)
5 дней

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	Benq	3792
2	VivaciousAubergine	3647
3	jiangly	3631
4	Kevin114514	3574
5	maroonrk	3521
6	strapple	3515
7	Radewoosh	3461
8	tourist	3428
9	turmax	3378
10	Um_nik	3376

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	Qingyu	162
2	adamant	148
3	Um_nik	146
4	Dominater069	143
5	errorgorn	140
6	cry	138
7	Proof_by_QED	136
8	YuukiS	135
9	chromate00	134
10	soullless	133

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя Xiaohuba

Yet another compiler bug [?]

Автор Xiaohuba, история, 20 месяцев назад, По-английски

UPD: I've reduced the code size.

I've recently found that the following code will generate wrong output.

#include <bitset>
#include <iostream>
const int N = 105;
std::bitset<N> ok[N][N];
int n = 5;
int main() {
  ok[2][2].set(2);
  for (int i = n; i; i--)
    for (int j = i; j <= n; j++) {
      ok[i][j] = ok[i][j] | ok[i + 1][j] | ok[i][j - 1];
    }
  std::cout << ok[2][5][2] << '\n';
  return 0;
}

Compiled with -O3 -mtune=skylake -march=skylake, the code outputs 0.

However if you simulate the code you will know that the correct answer should be 1.

Note that the compiler seems to generate wrong sse instruction.

Godbolt link

Again, I believe this code is ub-free, and has nothing to do with implementation-defined stuff.

compiler bug

Xiaohuba
20 месяцев назад
6

Комментарии (6)

Написать комментарий?

Xiaohuba

20 месяцев назад, скрыть # |

Auto comment: topic has been updated by Xiaohuba (previous revision, new revision, compare).

→ Ответить

purplesyringa

20 месяцев назад, скрыть # |

← Rev. 8 →

+13

Further reduced:

typedef struct {
  unsigned long words[2];
} Child;

typedef struct {
  Child child;
} Parent;

Parent my_or(Parent x, const Parent *y) {
  const Child *y_child = &y->child;
  for (int i = 0; i < 2; i++) {
    x.child.words[i] |= y_child->words[i];
  }
  return x;
}

int main() {
  Parent bs[4];
  __builtin_memset(bs, 0, sizeof(bs));

  bs[0].child.words[0] = 1;
  for (int i = 1; i <= 3; i++) {
    bs[i] = my_or(bs[i], &bs[i - 1]);
  }
  return bs[2].child.words[0];
}

→ Ответить

purplesyringa

20 месяцев назад, скрыть # ^ |

+16

I've submitted a bug. https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116768

→ Ответить

Xiaohuba

20 месяцев назад, скрыть # ^ |

+14

It seems that the trunk branch has been updated, and the bug is fixed. Thank you!

→ Ответить

bashkort

20 месяцев назад, скрыть # |

-13

I bet AI couldn't do that

→ Ответить

TianyiChen

20 месяцев назад, скрыть # |

← Rev. 2 →

It's good with O2. I am always scared to use O3, now I have an additional reason to keep being scared. While searching I found something related to O3 and AVX probably not being addressed for a long time (>10 years) https://gcc.gnu.org/bugzilla/show_bug.cgi?id=49001 . This may not be this issue though, as I tried to adjust alignas and that didn't help.

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 25.05.2026 12:51:29 (g1).

Десктопная версия, переключиться на мобильную.

При поддержке