And why would you restrict yourself to the original encoding space and to the Euclidean distance? These are completely arbitrary choices (and completely biased). There's an infinity of possibly encodings of the data you could have chosen, and an infinity of distance functions.https://twitter.com/k_saifullaah/status/1409972975835566081 …
-
-
The correct encoding and distance to use is of course the ones that are natural to the data -- that is to say, the latent manifold of the data. Which *is* an interpolative space for many problems (hence why deep learning models are able to generalize).
Show this thread -
Saying "I expect the problem to be linearly interpolative in the original encoding space" is equivalent to saying "linear regression with *no* feature engineering is enough to solve any prediction problem." Which we have known to be nonsensical since long before computers existed
Show this thread
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.