Facebook Hacker Cup 2019: Round 2

5 years ago, # |

+99

Gosh, why do you put constraints like T<=350, n<=4000, m<=10000 and do not give any constraint on their sum and then in an actual test give input data so that sum of m's is like 1% of what it could be? These constraint were so high I was fairly dubious whether $$$O(tnm \cdot \alpha(n))$$$ would pass but decided to go with it because many people submitted it and it seemed really hard to get something faster then $$$O(tnm)$$$ and that my laptop is pretty fast and then you give input data so that my program runs 0.5s when I was afraid it would time out on 6 minutes?

Could you not take the worst ideas from Jagiellonian University where they sometimes give "t is the number of testcases, $$$t \le 2 \cdot 10^9$$$" and want contestants to rely on guessed assumptions about what the input size really is?

→ Reply

the_art_of_war

5 years ago, # ^ |

← Rev. 2 →

in Facebook, usually I get the number of max test is not bigger than 10-20 in testset.

→ Reply

teja349

5 years ago, # ^ |

+24

You can do O(t*n*n). I expected them to maybe make it hard for O(t*n*m) but that was not the scenario.

Idea

→ Reply

Um_nik

5 years ago, # ^ |

You need only longest palindrome for each center, so you can build the graph in $$$O(m+n^{2})$$$, don't know why do you need DSU when you can do DFS. $$$O(tn^{2})$$$ looked like pretty safe bet.

→ Reply

5 years ago, # ^ |

← Rev. 2 →

No reason for DSU other than I prefer coding it over DFS ¯\_(ツ)_/¯. And I had that observation about longest palindrome as well, but it decreases m from 10000 to 8000, so that's not that big gain :P. But on the other hand if m=8000 then many intervals are short and we actually need to traverse just half of each interval... Yeah, constant factor analysis xd...

→ Reply

ko_osaga

5 years ago, # ^ |

DFS on 8000000 edges for 350 times doesn't look like a safe bet for me. (I'm quite bad at constant tuning)

I only wrote $$$O(tn^2)$$$ because I can't do knapsack in $$$O(n^2/64)$$$ when I should track the actual answer.

→ Reply

Um_nik

5 years ago, # ^ |

+10

What is so special about DFS? Number of recursive calls is only O(n), all other things are just array traversing, even in cache-friendly order.
You need one test to work no more than 0.5s. Looks fine to me.

→ Reply

kessido

5 years ago, # ^ |

← Rev. 2 →

doesn't 350 * 4000 * 10000 = 14,000,000,000 usually holds when the memory usage is around 40,000,000 ints (ten megabytes?). I kinda assumed that it should finish in about 10 seconds or so... but maybe I assumed wrong?

→ Reply

riadwaw

5 years ago, # ^ |

+18

40mln ints is 160MB btw

→ Reply

xuanquang1999

5 years ago, # ^ |

← Rev. 2 →

I thought that idea was from Yangon Regional Contest, in which they put $$$n \leq 2^{31}$$$ in problem C ($$$n$$$ is the number of vertices in the graph)

→ Reply

dacin21

5 years ago, # ^ |

+18

Union find with path compression runs in $$$O(V + E \log_{2 + \frac{E}{V}}(V))$$$ (here $$$V = n$$$ and $$$E \leq \frac{n m}{2}$$$), which is $$$O(V + E)$$$ on dense graphs (more precisely, if $$$E \in O(V ^{1 + \varepsilon})$$$ for some $$$\varepsilon > 0$$$). Hence in this task, the worst case runtime with union find is also $$$O(t (n^{1 + \varepsilon} + n m) )$$$. And since this only uses $$$O(n+m)$$$ memory, stuff fits into cache, so I expected it to run quite fast.

I would agree that having more precise precise bounds on the input size would be nice (i.e. $$$n \leq 100$$$ in all except at most $$$50$$$ cases, similar to what they do in codejam).

→ Reply

5 years ago, # ^ |

If $$$E=V$$$ it will be $$$O(V + E \log_3{(V)})$$$, how is it $$$O(V + E)$$$?

→ Reply

ko_osaga

5 years ago, # ^ |

"on dense graphs"

→ Reply

5 years ago, # ^ |

+31

LoneFox is that possible to address this for future contests? It should be very easy to add this to future problems and it would improve "quality of life" a lot.

→ Reply

TheOneYouWant

5 years ago, # |

← Rev. 2 →

+15

So, solutions of the two problems I solved:

A — All of the points should have same (x+y)%2 if k = 2, else answer is NO

B — Used DSU to find the sizes of the sets which have to be the same, then used knapsack and backtracked to find the solution. This should have timed out ideally, but passed in like 2s because FBHC.

Rank — 450, happy for the T-shirt.

→ Reply

the_art_of_war

5 years ago, # ^ |

← Rev. 2 →

I am usually stupid, so couldn't solve A in 20 minutes, just skipped.

B — just dsu and knapsack. Like everyone else.

C — just some standard dp problem. Can we get, dp[0/1][i][j] — can we have on i-th letter have j blocks starting on 0/1. Got O ( s * h^2) complexity, can easily reduced to s * h * log(h), but why write such thing.
Easy 240 and T-shirt.

→ Reply

teja349

5 years ago, # ^ |

+10

How to solve C in s*h*log(h). Do you use some extra greedy strategy in this solution compared to O(s*h*h).

→ Reply

the_art_of_war

5 years ago, # ^ |

yes. Second get just greedy union.

→ Reply

Merripium

5 years ago, # |

How to solve "Seafood" ?

→ Reply

5 years ago, # ^ |

← Rev. 2 →

+18

For each pair of clams, if $$$p_i < p_j$$$ and $$$h_i < h_j$$$, then you can delete clam $$$i$$$.

After that, after the sort, heights of clams are non-decreasing.

So let's calculate $$$dp_x$$$ is the smallest cost of deleting all clams on positions $$$\leq x$$$, using only rocks on positions $$$\leq x$$$ with ending on position $$$x$$$.

To calculate $$$dp_i$$$, let's find all clams, such that there are no rocks of bigger height on the segment from this clam to $$$i$$$. From there, you need to fix maximum clam that you will delete when you will go to the left and then to the right. You should go to the left to the first rock that is bigger than this clam.

You can optimize this $$$dp$$$ using stacks.

Code

#include <bits/stdc++.h>

using namespace std;

typedef long long ll;

#ifdef iq
  mt19937 rnd(228);
#else
  mt19937 rnd(chrono::high_resolution_clock::now().time_since_epoch().count());
#endif

ll inf = 1e12 + 7;

int main() {
#ifdef iq
  freopen("a.in", "r", stdin);
  freopen("a.out", "w", stdout);
#endif
  ios::sync_with_stdio(0);
  cin.tie(0);
  ll t;
  cin >> t;
  for (ll tc = 1; tc <= t; tc++) {
    ll n;
    cin >> n;
    vector <ll> p(n), h(n);
    ll ap, bp, cp, _dp;
    ll ah, bh, ch, dh;
    cin >> p[0] >> p[1] >> ap >> bp >> cp >> _dp;
    cin >> h[0] >> h[1] >> ah >> bh >> ch >> dh;
    for (ll i = 2; i < n; i++) {
      p[i] = (ap * (ll) p[i - 2] + bp * (ll) p[i - 1] + cp) % _dp + 1;
      h[i] = (ah * (ll) h[i - 2] + bh * (ll) h[i - 1] + ch) % dh + 1;
    }
    string o;
    cin >> o;
    vector <pair <ll, ll> > e;
    for (ll i = 0; i < n; i++) {
      e.push_back({p[i], i});
    }
    sort(e.begin(), e.end());
    ll need = 0;
    ll cnt = 0;
    vector <pair <ll, ll> > st;
    ll best = inf;
    vector <ll> q;
    vector <ll> x(n);
    vector <ll> dp(n + 1, inf);
    vector <ll> rind(n);
    dp[0] = 0;
    ll mx = -1;
    vector <bool> good(n);
    for (ll i = n - 1; i >= 0; i--) {
      if (o[e[i].second] == 'C') {
        if (h[e[i].second] > mx) {
          good[i] = true;
          mx = h[e[i].second];
          need++;
        }
      }
    }
    vector <ll> val_dp;
    vector <ll> mn_dp;
    vector <ll> val_die;
    vector <ll> mn_die;
    for (ll i = 0; i < n; i++) {
      if (o[e[i].second] == 'C') {
        if (good[i]) {
          rind[e[i].second] = i;
          q.push_back(e[i].second);
          if (st.empty() || st[0].first <= h[e[i].second]) {
            x[i] = -1;
          } else {
            ll l = 0, r = (ll) st.size();
            while (l < r - 1) {
              ll m = (l + r) / 2;
              if (st[m].first > h[e[i].second]) {
                l = m;
              } else {
                r = m;
              }
            }
            x[i] = p[st[l].second];
          }
          ll go_dp = (x[i] == -1 ? inf : dp[i] - 2 * x[i]);
          ll go_die = (x[i] == -1 ? inf : dp[i] - x[i]);
          val_dp.push_back(go_dp);
          if (mn_dp.empty())
            mn_dp.push_back(go_dp);
          else
            mn_dp.push_back(min(mn_dp.back(), go_dp));
          val_die.push_back(go_die);
          if (mn_die.empty())
            mn_die.push_back(go_die);
          else
            mn_die.push_back(min(mn_die.back(), go_die));
          cnt++;
        }
      } else {
        while (!q.empty() && h[q.back()] < h[e[i].second]) {
          q.pop_back();
          mn_die.pop_back();
          val_die.pop_back();
          mn_dp.pop_back();
          val_dp.pop_back();
        }
        while (!st.empty() && st.back().first <= h[e[i].second]) {
          st.pop_back();
        }
        st.push_back({h[e[i].second], e[i].second});
      }
      ll die = inf;
      if (q.empty()) {
        dp[i + 1] = 0;
        die = 0;
      } else {
        dp[i + 1] = min(dp[i + 1], mn_dp.back() + 2 * p[q.back()]);
        die = min(die, mn_die.back() + e[i].first);
        /*
        for (ll j : q) {
          if (x[rind[j]] == -1) continue;
          dp[i + 1] = min(dp[i + 1], dp[rind[j]] + 2 * (p[q.back()] - x[rind[j]]));
          die = min(die, dp[rind[j]] + e[i].first - x[rind[j]]);
        }
        */
      }
      if (!good[i]) {
        dp[i + 1] = min(dp[i + 1], dp[i]);
      }
      die = min(die, dp[i + 1]);
      if (cnt == need) {
        best = min(best, die + e[i].first);
      }
    }
    if (best == inf) best = -1;
    cout << "Case #" << tc << ": " << best << endl;
  }
}

→ Reply

Fdg

5 years ago, # ^ |

My greedy solution passed system test. We need a few observations for this:

In the optimal solution we would always go to the rightmost clam.
After we reached the rightmost clam it makes sense to visit only one rock, which is hard enough to open all still not opened clams.

So the solution is to fix the hardest clam (suppose the value is $$$X$$$), which we could have not opened when we reach the rightmost clam. Now we need to deal with all clams which hardness exceeds $$$X$$$. This could be done in a greedy way by going to the closest rock for each such clam. The only thing is that we need to account for possible overlaps when opening multiple clams with the same rock.

If we iterate over $$$X$$$ in the decreasing order we could calculate the answer for each case with total complexity $$$O(Nlog(N))$$$.

code

#include <iostream>
#include <algorithm>
#include <cstdio>
#include <vector>
#include <queue>
#include <set>
#include <map>
#include <memory.h>
#include <sstream>
#include <ctime>
#include <unordered_map>

using namespace std;

const long long INF = 1e+15 + 7;

int p[800002];
int h[800002];
string s;

set<long long> heavyRocks;
set<pair<long long, long long>> covered;

void solve() {
  int n;
  cin >> n;
  int ap, bp, cp, dp;
  cin >> p[0] >> p[1] >> ap >> bp >> cp >> dp;
  for (int i = 2; i < n; ++i) {
    p[i] = (1LL * ap * p[i - 2] + 1LL * bp * p[i - 1] + cp) % dp + 1;
  }
  int ah, bh, ch, dh;
  cin >> h[0] >> h[1] >> ah >> bh >> ch >> dh;
  for (int i = 2; i < n; ++i) {
    h[i] = (1LL * ah * h[i - 2] + 1LL * bh * h[i - 1] + ch) % dh + 1;
  }
  cin >> s;

  long long lastClam = -1;

  vector<pair<int, int>> vRock, vClam;
  for (int i = 0; i < n; ++i) {
    if (s[i] == 'R') {
      vRock.push_back({h[i], p[i]});
    } else {
      vClam.push_back({h[i], p[i]});
      lastClam = max(lastClam, (long long) p[i]);
    }
    // cout << p[i] << " " << h[i] << endl;
  }
  sort(vRock.begin(), vRock.end());
  sort(vClam.begin(), vClam.end());

  heavyRocks.clear();
  covered.clear();

  long long detour = 0, bAns = INF;

  int posClam = vClam.size(), posRock = vRock.size();
  while (posClam > 0) {
    --posClam;

    while (posRock > 0 && vRock[posRock - 1].first > vClam[posClam].first) {
      --posRock;
      heavyRocks.insert(vRock[posRock].second);
    }

    if (heavyRocks.size() == 0) {
      cout << -1 << endl;
      return;
    }

    long long bestFinish = INF;
    set<long long>::iterator it = heavyRocks.lower_bound(lastClam);
    if (it != heavyRocks.end()) {
      bestFinish = min(bestFinish, *it - lastClam);
    }
    if (it != heavyRocks.begin()) {
      --it;
      bestFinish = min(bestFinish, lastClam - *it);
    }

    bAns = min(bAns, lastClam + bestFinish + detour);

    // Process heavy clam.
    long long currentClamPos = vClam[posClam].second;
    long long toRight = INF;
    long long toLeft = INF;

    it = heavyRocks.lower_bound(currentClamPos);
    if (it != heavyRocks.end()) {
      toRight = min(toRight, *it - currentClamPos);
    }
    if (it != heavyRocks.begin()) {
      --it;
      toLeft = min(toLeft, currentClamPos - *it);
    }

    if (currentClamPos + toRight <= lastClam) {
      // It's okay.
    } else if (toLeft >= toRight) {
      // We are done here.
      break;
    } else {
      bool isCovered = false;
      set<pair<long long, long long>>::iterator it2 = covered.upper_bound(make_pair(currentClamPos, INF));
      if (it2 != covered.begin()) {
        --it2;
        if (it2->first <= currentClamPos && currentClamPos <= it2->second) {
          isCovered = true;
        }
      }

      if (!isCovered) {
        detour += 2 * toLeft;
        covered.insert({currentClamPos - toLeft, currentClamPos});
        heavyRocks.insert(currentClamPos);
      }
    }
  }
  cout << bAns << endl;
}

int main() {
  freopen("output.txt", "w", stdout);
  int T;
  cin >> T;
  for (int t = 1; t <= T; ++t) {
    cout << "Case #" << t << ": ";
    solve();
  }
  return 0;
}

→ Reply

Hitesh_0301

5 years ago, # |

← Rev. 2 →

-48

Created 2D array of size n*n locally in problem B, tested on the sample input and then downloaded the input file. Now, my code isn't running for more than 9 cases. What to do? Clock ticking! Couldn't find the fault that 4000*4000 can't be declared locally, has to be global. So, I just submit anything, knowing that it is gonna fail eventually. Also, now they won't allow me to resubmit. So, I sit there waiting for the contest to get over and check my solution(which turns out to be correct). I absolutely hate this 6-minute thing!

Use this comment as a (Remove-the-timer) button.

Why so much hate? Seems like everyone loves the timer.

→ Reply

LoneFox

5 years ago, # |

Auto comment: topic has been updated by LoneFox (previous revision, new revision, compare).

→ Reply

5 years ago, # |

$$$C$$$ can actually be solved in $$$O(hs)$$$ per testcase, I presume majority of people did $$$O(h^2s)$$$, but it actually becomes much more interesting question to make it faster.

→ Reply

Deemo

5 years ago, # ^ |

Can you elaborate a bit?

→ Reply

5 years ago, # |

+10

Actually B can be solved in much faster complexity as well as pointed out to me by Radewoosh and mnbvmar. $$$O(n \log^2 n)$$$ I think.

→ Reply

vintage_Vlad_Makeev

5 years ago, # ^ |

+10

How to solve it faster than $$$O(n^2)$$$ after building the graph? This problem seems to be equivalent to knapsack problem.

→ Reply

5 years ago, # ^ |

← Rev. 2 →

+18

FFT + Divide & Conquer

→ Reply

Redux

5 years ago, # ^ |

How to solve knapsack with FFT + D&C?

→ Reply

5 years ago, # ^ |

Divide all elements into two parts, solve them recursively.

After that, multiply polynomials from the left and from the right to get the answer for all elements.

→ Reply

5 years ago, # ^ |

And presumably the divide-and-conquer also allows the partition to be reconstructed efficiently. Once you know the sum you want, you can scan the two input polynomials to determine a feasible sum from each half, and then recursively reconstruct those two sums. Nice!

→ Reply

5 years ago, # ^ |

Yeah, that's knapsack, but you can significantly optimize this knapsack in at least three ways :). First one is exactly as ~300iq said. However there is an alternative approach where you observe that sum of sizes of items is at most n and hence there are at most $$$O(\sqrt{n})$$$ different sizes and you can use that to do it in $$$O(n^\frac32)$$$ which shouldn't be slower. And you can divide bruteforce's complexity by 64 using bitsets as well (and you can even restore the result).

→ Reply

5 years ago, # ^ |

+13

How do you restore the result if you use bitsets?

→ Reply

5 years ago, # ^ |

Like in a solution with FFT, find which sum you need to find to the left and to the right and proceed recursively, bounding the size of bitset.

→ Reply

5 years ago, # ^ |

+10

I see, so you'd need to use divide-and-conquer rather than just adding one item at a time; and presumably also use a dynamically-sized bitset rather than std::bitset.

→ Reply

dacin21

5 years ago, # ^ |

+10

If your items have weights $$$w_i$$$, let $$$DP[i] = $$$ bitset of weights you can achieve with the first $$$i$$$ items, compute $$$DP[i+1] = DP[i] \mid DP[i] < < w_i$$$, then you can reconstruct by walking back in the $$$DP$$$ table:

Start at $$$DP[n][j]$$$ where $$$j$$$ is the optimal possible weight. To to reconstruct from $$$DP[i+1][j]$$$, simply check if $$$DP[i][j]$$$ is set. If yes, then go to $$$DP[i][j]$$$, otherwise go to $$$DP[i][j - w_i]$$$ and add the $$$i$$$-th item to the answer. Repeat this until you reach $$$DP[0][0]$$$.

This does require storing the whole $$$DP$$$ table. If memory is an issue, then only storing $$$DP[\sqrt{n}], DP[2\sqrt{n}], \dots$$$ and recomputing the last $$$\sqrt{n}$$$ rows also works.

Getting $$$O\big(\frac{n^{\frac{3}{2}}}{64}\big)$$$ with bitset should also be possible .

→ Reply

mnbvmar

5 years ago, # ^ |

+20

I think we can go even better: let $$$last[i]$$$ be the weight you last added when you achieved total weight $$$i$$$. Initially, all values are uninitialized, and each value will be set exactly once (the first time we get to weight $$$i$$$).

Now, as you previously mentioned, we have a transition of the form $$$DP[i+1] := DP[i]\, \mathrm{or}\, (DP[i] \mathrm{< <} w_i)$$$. Notice however that $$$DP[i+1] \supseteq DP[i]$$$. We can therefore iterate over the set bits in $$$DP[i + 1] \setminus DP[i]$$$ to find out which new total weights we can achieve with $$$w_i$$$. For these weights, set the corresponding elements in $$$last[\star]$$$ array.

Notice that such array allows us to recover the result easily. Moreover, we don't need to remember past rows of $$$DP[\star]$$$, and therefore we only need $$$O(n)$$$ memory. Obviously, we get $$$O(n^2/64)$$$ time complexity, but $$$O(n\sqrt{n}/64)$$$ should be possible with this approach as well.

→ Reply

5 years ago, # ^ |

Nice. I knew that it could be done with $$$O(n^2)$$$ memory, but hadn't thought of this approach for $$$O(n)$$$ memory.

→ Reply