If you write out a sufficiently concrete proposal for exactly how to input 'will of humanity' numbers in a reward function for a superintelligent reinforcement learner, I can try to explain to you how that'd still kill everyone, even assuming away all inner misalignment issues.