I'm not sure there's a way to frame it as maximization of some value. Here's basically the axiom of the decision theory: if strategy A results in higher wealth than strategy B with probability 1 (en.wikipedia.org/wiki/Almost_su), then I prefer strategy A to strategy B.
Conversation
It turns out that under certain assumptions (most unrealistically: infinite time), having the above heuristic means that at each step, I need to maximize the rate of growth of my wealth.
2
1
But that log appears in the OUTPUT of the theory, not as an input assumption (the way it does in Bernoulli's theory of log utility of wealth).
1
1
Yes - even if you aren't *deliberately* trying to maximize the expectation of anything, it may be possible to construct a random variable X whose EV you are equivalently but *inadvertently* maximizing
(I think may be doing this fairly explicitly now)
3
6
And it may be reasonable to call this random variable X your 'utility'
1
2
I think "utility" has misleading connotations because historically it has been used to refer to subjective preferences and circumstances (like what I would be able to spend that wealth on)
2
3
In other words, log(wealth) is the ONLY utility function that is consistent with the prefers-an-almost-surely-dominant-strategy heuristic (in the model, in the long run)
2
1
to be explicit: are you saying that you are always trying to maximize ev(log(wealth)), or only in the case of infinitely repeating log-identical bets where you can only choose a single % of your wealth to bet each time?
2
1
If the outcomes are i.i.d., maximizing ev(log(wealth))) yields an optimal fixed fraction strategy.
If they are not i.i.d., with no restrictions on the distribution of the process, maximizing the conditionally expected log given current information is asymptotically optimal.
2
2
what do you mean by "optimal"?
I think (?) the following strategy will beat your proposed strategy more than 50% of the time:
a) bet 1.1x Kelly until you're in the 75th percentile outcome that Kelly would be at or the 25th percentile
b) afterwards, do Kelly
Optimal in that it induces terminal wealth W* such that E(W/W*) <=1 for all other strategies W (Cover's "Elements of Info Theory" 16.33).
Then by Markov's inequality, P(W>= tW*) <= 1/t, t>=1: an opponent can't outperform W* by a factor t with p > 1/t.
3
2
If you suggest some gamble dynamics we can simulate this and show that the Kelly strategy dominates yours in the limit, as expected.
1



