More bad news for Google’s beleaguered spinoff Jigsaw, whose flagship project is “Perspective,” a machine-learning system designed to catch and interdict harassment, hate-speech and other undesirable online speech.
From the start, Perspective has been plagued by problems, but the latest one is a doozy: University of Washington experts have found that Perspective misclassifies inoffensive writing as hate speech far more frequently when the author is Black.
Specifically, candidate texts written in African American English (AAE) are 1.5x more likely to be rated as offensive than texts written in “white-aligned English.”
The authors do a pretty good job of pinpointing the cause: the people who hand-labeled the training data for the algorithm were themselves biased, and incorrectly, systematically misidentified AAE writing as offensive. And since machine learning models are no better than their training data (though they are often worse!), the bias in the data propagated through the model.
In other words, Garbage In, Garbage Out remains the iron law of computing and has not been repealed by the deployment of machine learning systems.
We analyze racial bias in widely-used corpora of annotated toxic language, establishing correlations between annotations of offensiveness and the African American English (AAE) dialect. We show that models trained on these corpora prop-agate these biases, as AAE tweets are twice as likely to be labelled offensive compared to others.Finally, we introduce dialect and race priming,two ways to reduce annotator bias by highlightingthe dialect of a tweet in the data annotation, and show that it significantly decreases the likelihood of AAE tweets being labelled as offensive. Wefind strong evidence that extra attention should be paid to the confounding effects of dialect so as to avoid unintended racial biases in hate speech detection.
The Risk of Racial Bias in Hate Speech Detection [Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi and Noah A. Smith/University of Washington]
(via Naked Capitalism)
After NYC raised its minimum wage from $7.25/h to $15/h this year — the largest pay hike for low-waged workers in half a century — the city’s restaurants boomed, posting the highest growth levels in the country.
Atomik Grain Spirit is a (largely) (radiation-free) moonshine vodka distilled from grains grown in the Chernobyl Exclusion Zone as part of an experiment to determine the transfer of radiation from soil to crops; so far, the University of Portsmouth researchers behind the project have only made one bottle, but they hope to go into production […]
Stephen Wolfram’s podcast features a 90-minute lecture that he delivered at the 2019 Wolfram Summer School (MP3), recapitulating the history of mathematics from prehistory to the present day.
Big companies take on big projects. When they do that, they need a project manager to lay out a roadmap for the entire team – and they’re typically willing to pay a big paycheck to the person who can fill those shoes. So what does it take to become a project manager? If you don’t […]
Your phone doesn’t have to be the only smart piece of tech in your pocket. We regularly take pens, lighters, and wallets for granted, but here are 10 portable items that improve on those everyday bits of gear and others just like them. TEC Accessories The Orbiter™ Pinstripe Magnetic Fidget Device Tired of that same […]
The secret’s out. Increasingly, people are relying on hemp-derived cannabidiol, or CBD, as a way to not just deal with pain but everyday stress. Admittedly, it can be hard to find a reliable source of the stuff in a still-unregulated marketplace. But here are a few top picks, a roundup of tasty (and largely natural) […]