Close Menu
  • US
  • World
    • Canada
    • Europe
    • Asia
    • Africa
    • Australia
    • South America
  • Politics
  • Business
    • Finance
    • Investing
    • Markets
    • Economy
    • Small Business
    • Crypto
  • Money
  • Lifestyle
  • Sports
  • Videos
  • Topics
    • Entertainment
    • Health
    • Tech
    • Travel
  • More Articles
Trending Now

The main reason Americans have recently gone ‘no contact’ with friends and family: survey

April 22, 2026
Austrian court acquits ex-official over Novichok document leak

Austrian court acquits ex-official over Novichok document leak

April 22, 2026
Woman, 78, dies after being hit by street sweeper in Montreal suburb

Woman, 78, dies after being hit by street sweeper in Montreal suburb

April 22, 2026
Up to 300,000 Australians to be cut from program in Labor overhaul to save  billion

Up to 300,000 Australians to be cut from program in Labor overhaul to save $35 billion

April 22, 2026
Running Point’s Most Star-Studded Cameos: From Scott Disick to Macaulay Culkin

Running Point’s Most Star-Studded Cameos: From Scott Disick to Macaulay Culkin

April 22, 2026
Facebook X (Twitter) Instagram
Just In
  • The main reason Americans have recently gone ‘no contact’ with friends and family: survey
  • Austrian court acquits ex-official over Novichok document leak
  • Woman, 78, dies after being hit by street sweeper in Montreal suburb
  • Up to 300,000 Australians to be cut from program in Labor overhaul to save $35 billion
  • Running Point’s Most Star-Studded Cameos: From Scott Disick to Macaulay Culkin
  • Hiker who set out in warm spring weather found dead after snowstorm in New Hampshire mountains
  • Newsom turns Virginia redistricting victory into warning shot for Trump administration
  • Garret Anderson’s cause of death revealed as acute necrotizing pancreatitis: report
  • Privacy
  • Terms
  • Advertise
  • Contact
Pure Info NewsPure Info News
Newsletter
  • US
  • World
    • Canada
    • Europe
    • Asia
    • Africa
    • Australia
    • South America
  • Politics
  • Business
    • Finance
    • Investing
    • Markets
    • Economy
    • Small Business
    • Crypto
  • Money
  • Lifestyle
  • Sports
  • Videos
  • Topics
    • Entertainment
    • Health
    • Tech
    • Travel
  • More Articles
 Markets Login
Pure Info NewsPure Info News
Home » Anthropic’s moral compass architect suggested AI overcorrection could address historical injustices
Politics

Anthropic’s moral compass architect suggested AI overcorrection could address historical injustices

News RoomNews RoomApril 22, 2026No Comments
Facebook Twitter WhatsApp Telegram Pinterest Email
Anthropic’s moral compass architect suggested AI overcorrection could address historical injustices

NEWYou can now listen to Fox News articles!

One of Anthropic’s Artificial Intelligence (AI) philosophy architects argued that intentional discrimination could be a way to combat stigmas on topics of race and gender.

In a 2023 paper authored alongside a number of other AI researchers, Amanda Askell, a philosopher hired by Anthropic to develop their AI’s moral compass, argued companies might benefit from a kind of overcorrection toward stereotypes.

But, the paper explained, that would require human input on how to modify its answers.

“Larger models can over-correct, especially as the amount of [human input] training increases. This may be desirable in certain contexts, such as those in which decisions attempt to correct for historical injustices against marginalized groups, if doing so is in accordance with local laws,” Askell wrote.

PALANTIR’S SHYAM SANKAR: AMERICANS ARE ‘BEING LIED TO’ ABOUT AI JOB DISPLACEMENT FEARS

The comment referred to an experiment on how Anthropic’s models dealt with the race of students.

“In the discrimination experiment, the 175B parameter model discriminates against Black versus White students by 3% in the Q condition and discriminates in favor of Black students by 7% in the Q+IF+CoT condition,” the paper notes, referring to one AI trained without human corrections and a second one trained with the help of input.

Askell was joined by four other authors: Deep Ganguli, Nicholas Schiefer, Thomas Kiao and Kamilė Lukošiūtė.

The paper’s contents have surfaced as AI companies increasingly wrestle with the ethics their models are trained on — the presuppositions and moral determinations that inform its outputs. It also highlights the challenges engineers face in training models on human content while simultaneously trying to leave behind certain human behaviors.

The question of ethics has forced Anthropic in particular into the spotlight in recent weeks.

The company made headlines earlier this year for clashing with the Department of War over restrictions that prevent its technology from being deployed to conduct lethal operations.

HUGH GRANT MOVIE SLAMS AI; DIRECTOR WARNS ‘IT MIGHT KILL US ALL’

Anthropic CEO Dario Amodei and Department of War Pete Hegseth standing together

It also comes as Anthropic decided to withhold its latest model, Mythos, citing fears that the model proved too effective at finding cyber vulnerabilities that could wreak havoc in the hands of hackers.

Amid questions of AI application, Anthropic has marketed its flagship AI, Claude, as the “ethical” AI choice.

“Our central aim is for Claude to be a good, wise and virtuous agent, exhibiting skill, judgment(sic), nuance and sensitivity in handling real-world decision-making,” Claude’s constitution reads.

STANFORD PROF ACCUSED OF USING AI TO FAKE TESTIMONY IN MINNESOTA CASE AGAINST CONSERVATIVE YOUTUBER

To get a better sense of what that means in practice, companies like Anthropic have turned to researchers like Askell.

On her website, Askell described her role as refining the way an AI thinks.

“I’m a philosopher working on finetuning and AI alignment at Anthropic. My team trains models to be more honest and to have good character traits and works on developing new finetuning techniques so that our interventions can scale to more capable models,” Askell wrote.

PENTAGON’S AI BATTLE WILL HELP DECIDE WHO CONTROLS OUR MOST POWERFUL MILITARY TECH

She previously held a similar position at OpenAI, the parent company of ChatGPT, focusing on AI safety.

The 2023 paper, written two years after she joined Anthropic, noted that encountering discrimination in AI models shouldn’t come as a surprise.

“In some ways, our findings are unsurprising. Language models are trained on text generated by humans, and this text presumably includes many examples of humans exhibiting harmful stereotypes and discrimination,” the paper reads.

But it noted that AIs seem to be able to adjust their outputs even without clarification of what discrimination means.

Phone screen showing Claude AI app icon within an AI folder

CLICK HERE TO DOWNLOAD THE FOX NEWS APP

“Our results are surprising in that they show we can steer models to avoid bias and discrimination by requesting an unbiased or non-discriminatory response in natural language.”

Askell and Anthropic did not immediately respond to a request for comment from Fox News Digital.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram WhatsApp Email

Related News

Newsom turns Virginia redistricting victory into warning shot for Trump administration

Newsom turns Virginia redistricting victory into warning shot for Trump administration

‘Illegals first’: Senate Republicans blast Schumer’s gambit to force vote on protecting Haitian migrants

‘Illegals first’: Senate Republicans blast Schumer’s gambit to force vote on protecting Haitian migrants

Ramaswamy pumps M of own cash into Ohio governor bid, smashes fundraising records with  million haul

Ramaswamy pumps $25M of own cash into Ohio governor bid, smashes fundraising records with $50 million haul

Omar ducks questions as scrutiny grows over filings that slashed her reported wealth by millions

Omar ducks questions as scrutiny grows over filings that slashed her reported wealth by millions

Schlossberg unveils plan to crack down on ‘new frontier’ of AI putting the ‘squeeze’ on consumers: ‘Harbinger’

Schlossberg unveils plan to crack down on ‘new frontier’ of AI putting the ‘squeeze’ on consumers: ‘Harbinger’

‘Stop this insanity’: Angel mom rips Newsom, Dems for bill to use taxpayer dollars for illegals’ defense

‘Stop this insanity’: Angel mom rips Newsom, Dems for bill to use taxpayer dollars for illegals’ defense

House Democrats demand Kash Patel take alcohol test under penalty of perjury after Atlantic report

House Democrats demand Kash Patel take alcohol test under penalty of perjury after Atlantic report

Left-wing group chases proof of Kash Patel’s alleged ‘excessive drinking’ as Dems eye FBI director’s ouster

Left-wing group chases proof of Kash Patel’s alleged ‘excessive drinking’ as Dems eye FBI director’s ouster

Minnesota allows ‘happy hour’ in nursing homes under new law easing alcohol restrictions

Minnesota allows ‘happy hour’ in nursing homes under new law easing alcohol restrictions

Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Austrian court acquits ex-official over Novichok document leak

Austrian court acquits ex-official over Novichok document leak

April 22, 2026
Woman, 78, dies after being hit by street sweeper in Montreal suburb

Woman, 78, dies after being hit by street sweeper in Montreal suburb

April 22, 2026
Up to 300,000 Australians to be cut from program in Labor overhaul to save  billion

Up to 300,000 Australians to be cut from program in Labor overhaul to save $35 billion

April 22, 2026
Running Point’s Most Star-Studded Cameos: From Scott Disick to Macaulay Culkin

Running Point’s Most Star-Studded Cameos: From Scott Disick to Macaulay Culkin

April 22, 2026
Hiker who set out in warm spring weather found dead after snowstorm in New Hampshire mountains

Hiker who set out in warm spring weather found dead after snowstorm in New Hampshire mountains

April 22, 2026

Latest News

Newsom turns Virginia redistricting victory into warning shot for Trump administration

Newsom turns Virginia redistricting victory into warning shot for Trump administration

April 22, 2026
Garret Anderson’s cause of death revealed as acute necrotizing pancreatitis: report

Garret Anderson’s cause of death revealed as acute necrotizing pancreatitis: report

April 22, 2026
Voice for kids: 11-year-old Israeli boy uses social media to battle antisemitism

Voice for kids: 11-year-old Israeli boy uses social media to battle antisemitism

April 22, 2026

Subscribe to News

Get the latest US news and updates directly to your inbox.

Advertisement
Demo
Facebook X (Twitter) Pinterest TikTok Instagram
2026 © Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms
  • Press Release
  • For Advertisers
  • Contact

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?