Q about modeling. How do u decide betw using something as a continuous variable, or a dummy variable (either 0 or 1)? Let's say Im modeling futureReturns. A potential feature is currentPriceDistance from50dayTrend. I could use a dummy, 1 if distance>thresh,else0. @therobotjames
-
-
Assuming you are trying for a predictive regression, that is. Compressing to 0/1 is throwing away information, plus it gives you many more opportunities to overfit as you will be tempted to choose the threshold to get a good result.
-
Only reasons for using dummy variables are (a) you have a categorical (unordered) variable that you need to encode (b) the feature is essentially two-class anyway (e.g. very bimodal distribution) and you don’t think the variations around the peaks add any information.
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.