A portfolio project for undergrads? There's a 300 lines of code http://keras.io example that does exactly this (with better performance). Anyone with some Python experience could read it and modify it. Machine learning just keeps getting easier and more accessible. https://twitter.com/NirantK/status/1443567619718582274 …
-
-
But also BERT, GPT-3 etc require hundreds of engineers, massive server farms, millions of dollars, etc these models are not something a single person can do from scratch. It's like going from the Wright brothers cobbling together a glider to building an A-380.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
But just scaling the data and network size is not a research contribution (although studying scaling laws certainly is).
-
That's a strong statement. There are many ways of scaling data and network size. There are plenty of counterarguments on why doing this can bring many research contributions. Not to mention the fact that solving engineering challenges can also lead to research contributions.
- Show replies
New conversation -
-
-
People training larger models on larger datasets is a super small subsets of all researchers. Even if my university is lucky to have access to a decent amount of GPUs, I cannot go in that direction without much more comp. power
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.