.@OpenAI controversy, no one (including me) talked about #ethics of using @reddit for training. It is toxic to #women #minorities. I was harassed and threatened on @mxlearn when I spoke up about #NeurIPS @Miles_Brundage @jackclarkSF can u comment on this choice? @samcharringtonhttps://twitter.com/__yoson__/status/1101593216992849920 …
-
-
Replying to @AnimaAnandkumar @OpenAI and
we didn't use reddit data, we used outbound links from reddit ranked at two stars or higher by users. Part of the reason for thinking about different release of this model is precisely because of issues like this (which occur in larger models also). We'll be sharing more here!
3 replies 1 retweet 19 likes -
Replying to @jackclarkSF @OpenAI and
Thanks. I still find it problematic.
@reddit threads have problematic links to all sorts of sites. Right thing to do would have been to choose appropriate data source to begin with. Certainly not have a big press coverage with highly biased model#responsibleAI6 replies 1 retweet 19 likes -
Replying to @AnimaAnandkumar @OpenAI and
I think the q of appropriate data sources for large-scale, generally released models is a really great topic for discussion. It seems like there's some value to the inherent heterogeneity of sources that get linked to from various social forums. What might be a good source?
2 replies 0 retweets 4 likes -
Replying to @jackclarkSF @OpenAI and
I agree, there isn't a great answer to that, given how biased internet is. Techniques like debiasing word embeddings can help remove bias to some extent. When
@OpenAI model was covered by journalists, bias was never highlighted. https://arxiv.org/abs/1607.065201 reply 1 retweet 5 likes -
Replying to @AnimaAnandkumar @jackclarkSF and
Our models need lots and lots of data so we'll just have to take whatever biased garbage is out there seems to me to have things the wrong way around though.
#nlproc#ethnlp1 reply 3 retweets 13 likes -
Replying to @emilymbender @AnimaAnandkumar and
Some of what motivated the partial release is the observation we saw this behavior in response to some prompts and the internet is a shared resource so it seems likely others will contemplate. I'm so glad this discussion is happening! We'll be sharing more here also
1 reply 0 retweets 3 likes
I agree, these discussions are great and we need to discuss about risks and ethics in #AI I hope in future, media distortion and #fakenews about #AI can be avoided. It would be far more productive to focus on these deep issues first and then think about a big press coverage
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.