Analysis of adversarial binary search game

possiblywrong.wordpress.com

61 points by RafelMri 10 months ago

patrakov 10 months ago

The exact analysis for n=13 (instead of n=100) is presented--or rather, just the answer from the exhaustive search. The answer for n=100 is still unknown.

krackers 10 months ago

Also https://news.ycombinator.com/item?id=41463330

quuxplusone 10 months ago

The punch line of this post is that if Ballmer defines the payouts so that the interviewee receives floor(lg N) dollars for hitting in one guess, then Ballmer's expected payout decreases as N increases — with a sharp step upward each time N reaches a power of two.

This is... exactly how the floor function is defined to behave!

The graphs would be much more enlightening if the artificial step function were removed. Instead of subtracting from floor(lg N), subtract from plain old lg N — or even don't subtract from anything at all, and just graph the number of guesses required.

possiblywrong 10 months ago

Author of the linked article here; you're right that just the expected number of guesses is roughly monotonic. However, that expected number of guesses (along with Ballmer's expected payout) increases, not decreases, as N increases, with a sharp step downward, not upward, each time N reaches a power of two.
The intent here is to reasonably compare hypothetical job interview questions for different N. If N=100, then Ballmer offers 6 dollars in exchange for one dollar per guess, and the linked Gukov article shows that Ballmer's expected return is negative.
What if he instead challenged an interview candidate with N=127? What about N=255, or N=511, etc.? He presumably wouldn't continue to offer only 6 dollars to play in these cases, but also presumably a similarly nice round number of dollars... and now the question/conjecture is interesting: from the updated graph, it looks like Ballmer's expected return decreases as N=2^k-1 increases, but does it always remain positive? Or is there some sufficiently large key space beyond which even an optimal guesser can't ever win on average?
To me, this question seems easier to ask when the expected return is presented in this way. That is, does each sawtooth interval always span zero expected return?

yorwba 10 months ago

See also https://bowaggoner.com/blahg/2024/09-06-adversarial-binary-s... using the Multiplicative Weight Update algorithm ( https://www.jeremykun.com/2017/02/27/the-reasonable-effectiv... a recurring feature on HN) to compute an approximate Nash equilibrium up to n = 100.

joshka 10 months ago

I don't see why this can't be solved (approximated) recursively.

If I know the EV of 1 and 2 choices, then the 3 number situation is the same as whatever weight I assign to choosing the first number multipled by Ballmer's strategy multiplied by the EV of the 1 case when I pick 2, and the 2 case if I pick 1 or 3. Run this as a Counter Factual Regret minimization problem for each value from 3 through 100 with some sort of error threshold and you have a good enough answer.

I'm not 100% sure of that though. Perhaps there's a dependency in the two steps that confounds this approach?

jstanley 10 months ago

It might be fun to play an iterated version of this game.

Also I'd be keen to see a web ui where you can play against an ai. Not quite sure how the ai would work though. You don't necessarily want the ai to play optimally (if the optimal strategy were even known) because the fun is in working out how to exploit the other player's flaws.

nullc 10 months ago

A variation on http://www.rpscontest.com/
hoseja 10 months ago

What would the fun be? Comparing the quality of each player's RNG?
- Joker_vD 10 months ago
  
  Well, some people think that mental poker [0] is fun, you know.
  [0] https://en.wikipedia.org/wiki/Mental_poker

andrewla 10 months ago

My feeling without being rigorous is that something is wrong with this solution.

Notably the size 13 case enumerated in the post, choosing 1 as the Chooser will always take 3 guesses (the most possible). Even in an iteration of a mixed strategy the Guesser will never guess 1 in fewer than 3. This is also true for 13.

rachofsunshine 10 months ago

Max guesses for n=13 is floor(log2(13)) + 1 = 4, not 3.
And in a Chooser-always-chooses-1-or-13 strategy, the Guesser has a trivial strategy ("guess 1, then guess 13") that takes much less than 3 guesses on average.
- andrewla 10 months ago
  
  You are correct -- this was my misreading of the preorder traversal.

n4r9 10 months ago

I wonder how many people failed a Ballmer interview by pointing out that his solution isn't quite right.