Context- regarding the new Bit-M models. I'm personally excited because I know the Facebook WSL models make a big difference practically. There's an easy to miss variation that looks more practical: 2x wide 152 layer. Competitive benchmarks, way less memory needed.https://twitter.com/wightmanr/status/1263615215108870151 …
-
-
I’m going to look at their code tomorrow and see how it goes. It could be interesting to try. The only concern I have is, I use a vgg network trained on danbooru for the feature loss in deoldify and I wonder how it’ll interact or if I’ll even need it.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.