can't simply "let it do all the job". Maybe that's something for the future. The whole issue here is either anyone (not only Github) should be allowed to train a neural network with public available data, even if that data is licensed. (+)
-
-
Antwort an @_odelucca @NoraDotCodes und
Since you're not """"""using""""""" (with a lot of quotes) it directly, it is debatable if you're violating the license. You're using it to train your neural network, but what your neural network is outputting is the result of every data that it was trained with. (+)
2 Antworten 0 Retweets 0 Gefällt mir -
Antwort an @_odelucca @NoraDotCodes und
Also, the majority of the data used to train Copilot (which came from GPT-3) is not code. The biggest part of it was the GPT-3 part that was trained with public internet content (which could also be licensed). So, you can't also state that "everything" (+)
1 Antwort 0 Retweets 0 Gefällt mir -
Antwort an @_odelucca @NoraDotCodes und
that Github used to train Copilot was potentially licensed since the majority of the training was done in the transformer (GPT-3). So, the "code" part is just the tip of the iceberg. Just to specialize the GPT-3 algorithm to write code (+)
1 Antwort 1 Retweet 1 Gefällt mir -
Antwort an @_odelucca @NoraDotCodes und
Anyway, we need to do a LOT of debate on this subject. Not only for copilot but for ANY licensed content. For example: - If we train an AI to write songs, what should we do? Since we used licensed music to train it (+)
1 Antwort 0 Retweets 0 Gefällt mir -
Antwort an @_odelucca @NoraDotCodes und
- If we train an AI to write movie scripts, what should we do? Since we used licensed movie scripts to train it - If we train an AI to take pictures, what should we do? Since we used professional pictures to train it (+)
1 Antwort 0 Retweets 0 Gefällt mir -
Antwort an @_odelucca @NoraDotCodes und
This is a WHOLE new area. We can't simplify it as "OMG! COPILOT IS WRITING THE SAME LINE AS WE CAN SEE HERE IN THIS SOFTWARE" Common, it is way more complicated than that!
1 Antwort 0 Retweets 0 Gefällt mir -
Actually, we can, and we should simplify it, because no matter how complex the AI system is, it boils down to the same end result. It is designed to provide useful recommendations, not to avoid copyright infringement, that's the problem.
1 Antwort 0 Retweets 0 Gefällt mir -
You're mixing concepts, maybe because (as you've said) you're not an expert in machine learning. Recommendations are different from predictions. What Copilot (and also GPT-3) does is predicting your expectations based on a previously stated context. (+)
1 Antwort 0 Retweets 0 Gefällt mir -
Antwort an @_odelucca @awakecoding und
As I've said, if we go to this road, we can cancel any machine learning initiative, since every ML initiative uses publicly available data, not considering if they are licensed or not.
2 Antworten 0 Retweets 1 Gefällt mir
Good.
-
-
Just a final thought. I don't have the answer for it! So, that's why I've said that this needs a LOT of debate. Since both positions could be correct. And I don't have a final opinion yet. I truly don't know if using public available licensed data is morally correct.
2 Antworten 0 Retweets 0 Gefällt mir -
Considering that translating a book in another language does not free you of the original copyright, I fail to see how GitHub Copilot can "AI translate" source code and strip the copyright. It is not a question of moral obligation, it is a question of current copyright law.
0 Antworten 0 Retweets 2 Gefällt mir
Ende der Unterhaltung
Neue Unterhaltung -
Das Laden scheint etwas zu dauern.
Twitter ist möglicherweise überlastet oder hat einen vorübergehenden Schluckauf. Probiere es erneut oder besuche Twitter Status für weitere Informationen.