LLMs become more covertly racist with human intervention

ylai@lemmy.ml · 2 years ago

LLMs become more covertly racist with human intervention

AbouBenAdhem@lemmy.world · 2 years ago

“Feedback training teaches models to consider their racism,” says Valentin Hofmann, a researcher at the Allen Institute for AI and a coauthor on the paper. “But dialect prejudice opens a deeper level.”

Hmm… I think dialect bias is a distinct problem, which may need a separate approach that doesn’t just lump it together with racism and try to eliminate both using the same means.

Renegade@infosec.pub · 2 years ago

Nothing in the article corroborated the claim in the title that human intervention made things worse, just that the problem goes deeper.

dumpsterlid@lemmy.world · 2 years ago

Ah so the ven diagram for cops and LLMs includes more than just “bullshitters” but also now includes “hopelessly rascist”