Ah I understand now!! How about output? Is it generally to try to model a continuous variable (futureRet), or to a logit regression to model odds of a category (TopDecileRet)? If Y is cont, confused on how to trade a model, do you adjust your exposure as expctd y changs?
-
-
Replying to @M1tchRosenthal
I exclusively try to predict continuous variables, generally either (excess) return over some period or (excess) return scaled by a volatility forecast
2 replies 0 retweets 5 likes -
Replying to @macrocephalopod @M1tchRosenthal
Following this, another example is categorical to predict as a way to get around non-linear dynamics. E.g. if you have a factor implicated in extreme conditions, it may show up as a skew in returns (24hRet < 0) but not show up properly in linear regression.
1 reply 0 retweets 3 likes -
So in this case simply checking for the categorical (a pretty naive parameter threshold of 0, not overfitting) will give you quicker insight into the value of the relationship
2 replies 0 retweets 2 likes -
I just try to make money every month call me old fashioned
3 replies 0 retweets 13 likes -
Replying to @goodalexander @nope_its_lily and
The real answer is that a binary variable is likely a shitty continuous variable you tried to flatten to make your backtest look better. Example: rising google trends binary works better than continuous bc it smooths news spikes. There are better ways to normalize (web vs search)
3 replies 0 retweets 7 likes -
Gotcha, but how do you set your exposure, do you use scaling or draw lines in the sand? Also arent there some signals that mean something diff if you scale it. For example, I believe a breakout from a range (True/False) has diff meaning than dist from range top
1 reply 0 retweets 1 like -
It’s reasonable to assume your size scales with your signal strength and it’s good to see signal strength correlated w pnl over time. For range breakout - (distance from high)/(realized vol) is normalized
2 replies 0 retweets 1 like -
I see, when done this way your variable will mostly hover around a low value and then spike when a breakout happens. However, the issue is you're assuming a bigger breakout will be more profitable. In thry, isnt it possible massive breakouts are too unstable (more likely2revers
3 replies 0 retweets 0 likes -
You can always apply a function to the underlying signal, eg tanh(x) if you want to cap large signal values, or x*e^(-x^2) to push large signal values back toward zero (many CTAs do something like this)
1 reply 0 retweets 3 likes
Be aware that you are giving yourself more opportunities to overfit by doing this though.
-
-
It helps to have a good justification for your threshold to begin with. 0 is a fairly obvious one.
0 replies 0 retweets 2 likesThanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.