Machine learning is too often perceived as a kind of oracle-like power capable of predicting the unknowable and working out of distribution.
-
-
But the “proper setup” would be using a less useful target variable (i.e., article is predicted to be evergreen rather than article actually withstood the test of time). So there’s a tradeoff between these two possible setups, right?
-
There's no tradeoff here. You have to understand that such classifiers work not because they have a crystal ball, but because they latch onto superficial markers that are statistically correlated with evergreeness or not (e.g. lots of dates = not evergreen).
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.