No Result
View All Result
  • Login
Friday, June 19, 2026
theadvisertimes.com
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading
No Result
View All Result
theadvisertimes.com
No Result
View All Result
Home Markets

Chart of the Week: AI Is a Black Box

by theadvisertimes.com
12 hours ago
in Markets
Reading Time: 4 mins read
A A
0
Chart of the Week: AI Is a Black Box
Share on FacebookShare on TwitterShare on LInkedIn


A strange thing happened last week.

Anthropic was forced to take its newest AI models offline only days after releasing them.

The company’s new Fable 5 and Mythos 5 systems were designed to be some of the most powerful AI models ever released. But shortly after launch, researchers discovered ways to get around some of the models’ built-in safety measures.

Government officials soon got involved as fears spread that these systems could become powerful cybersecurity weapons in the wrong hands.

Maybe those concerns were justified, and maybe they weren’t.

But to me, they raise an obvious question that not enough people are asking.

How would anyone know?

What’s Inside the Box?

Modern AI systems aren’t like traditional software.

Engineers don’t sit down and write lines of code telling them exactly how to reason through a problem.

Instead, researchers train these systems and then observe their behavior.

The result is what many researchers call a black box.

We can see what goes in, and we can see what comes out.

But what happens in between is often much harder to explain.

That’s why companies like Anthropic spend so much time studying AI interpretability, or the science of understanding how these systems arrive at their conclusions.

And that brings us to this week’s chart.

Because a group of researchers recently performed a strange experiment.

They secretly modified an AI model’s internal state. Then they asked whether the model could detect that something had changed.

Image: Uzay Macar and Li Yang

This chart might look complicated, but the basic idea is simple.

Researchers injected information directly into an AI model’s internal processing, then tested whether it could tell the difference between those injections and its normal thought process.

The chart compares three versions of the same model.

The first is the Base model, the raw AI system before it receives additional training.

The second is the Instruct model, which was trained to behave more like the helpful AI assistants most people interact with today.

The third is an Abliterated version of the model, where some of the refusal and safety behaviors were removed.

The blue line shows how often the model correctly detected a real change, while the orange line shows how often it falsely claimed that something changed when nothing had actually happened.

And the results are surprising.

The Base model performed poorly. When researchers secretly altered its internal processing, it often couldn’t tell the difference between a real change and a false alarm.

But the Instruct model performed much better.

Somewhere during the additional training process, the model appears to have developed an ability to recognize when something unusual had happened inside its own processing.

And in several cases, the Abliterated model performed even better still.

In other words, removing some of the AI’s safety and refusal behaviors actually improved the model’s ability to detect what was going on inside it.

That doesn’t mean the model became conscious or self-aware.

You can compare it to a computer server that detects when someone has tampered with its memory. The server isn’t aware of anything, but it can still recognize when something unusual has happened.

Researchers believe something similar happened here.

More importantly, they think capabilities like this could eventually help us better understand what’s happening inside advanced AI systems.

After all, these models have access to information that remains largely hidden from the people studying them.

Which means one way researchers could eventually learn more about advanced AI systems is by asking the systems themselves.

That might seem counterintuitive.

But it would give researchers something they’ve never really had before.

A window into what’s happening inside the model itself.

Here’s My Take

The primary goal of the AI industry has been to build more capable models.

But another challenge is gaining urgency.

Understanding them.

The controversy surrounding Anthropic’s latest models shows why we need to get a handle on this issue sooner than later.

Because it’s one thing to build a powerful AI system. It’s something else entirely to create a new form of intelligence yet only partially understand how it works.

So here’s my question to you:

If future AI systems become too complex for humans to fully understand on their own, would you trust AI to help explain what’s happening inside other AI models?

Or does that sound like asking the fox to guard the henhouse?

I’d love to hear what you think.

Let me know at [email protected].

We won’t reveal your full name in the event we publish a response, so feel free to share your honest opinion.

Regards,

Ian King's SignatureIan KingChief Strategist, Banyan Hill Publishing



Source link

Tags: BlackBoxchartweek
ShareTweetShare
Previous Post

Illinois’ new crypto tax puts users under a burden stocks do not face

Next Post

How to prepare for 4 big risks facing any retirement plan

Related Posts

Wabtec (WAB) Has an Aftermarket and Rail-Modernization Platform Story Bigger Than a Freight Cycle Trade

Wabtec (WAB) Has an Aftermarket and Rail-Modernization Platform Story Bigger Than a Freight Cycle Trade

by theadvisertimes.com
June 18, 2026
0

Wabtec (WAB) is often grouped with rail-cycle names and treated as a way to trade freight volumes or new locomotive...

The average SpaceX buyer post-IPO is almost under water after two-day slide

The average SpaceX buyer post-IPO is almost under water after two-day slide

by theadvisertimes.com
June 18, 2026
0

SpaceX celebrates their IPO at the Nasdaq on June 12th, 2026.Adam Jeffery | CNBCThe average investor who bought SpaceX shares...

The DTI Trap: Why Traditional Financing Stops Working After Your Second Rental (And What to Do Instead)

The DTI Trap: Why Traditional Financing Stops Working After Your Second Rental (And What to Do Instead)

by theadvisertimes.com
June 18, 2026
0

In This Article This article is presented by LendingOne. You have two rentals. Both are cash-flowing and performing exactly the...

Allegiant Air Cut 61 Routes, Including Three in Las Vegas

Allegiant Air Cut 61 Routes, Including Three in Las Vegas

by theadvisertimes.com
June 18, 2026
0

Allegiant Air, which has origins in Las Vegas, has dropped three routes to the Southern Nevada city as part of...

Can You Still Succeed With Weekend Trades?

Can You Still Succeed With Weekend Trades?

by theadvisertimes.com
June 18, 2026
0

Do you think that you have to be in front of your computer nonstop to succeed as a trader? What...

Smith & Wesson Brands Q4 2026 EPS Tops Expectations by 56.5%, Revenue Up 27%

Smith & Wesson Brands Q4 2026 EPS Tops Expectations by 56.5%, Revenue Up 27%

by theadvisertimes.com
June 18, 2026
0

AlphaStreet Newsdesk powered by AlphaStreet Intelligence SWBI|EPS $0.36 vs $0.23 est (+56.5%)|Rev $178.4M vs $155.3M est (+14.9%)|Net Income $16.2M Stock...

Next Post
How to prepare for 4 big risks facing any retirement plan

How to prepare for 4 big risks facing any retirement plan

I let Chat GPT plan my workdays down to the minute for a week — the shock wasn’t my output, it was realizing how much of my old schedule had been performance

I let Chat GPT plan my workdays down to the minute for a week — the shock wasn’t my output, it was realizing how much of my old schedule had been performance

  • Trending
  • Comments
  • Latest
FIS, InvestCloud aim to help advisors connect with younger clients

FIS, InvestCloud aim to help advisors connect with younger clients

May 20, 2026
6 Hotels Where Chase’s Points Boost Yields 2.5x

6 Hotels Where Chase’s Points Boost Yields 2.5x

May 22, 2026
Buy a 0K/Year Income Stream? This Is How to Do It

Buy a $500K/Year Income Stream? This Is How to Do It

May 22, 2026
Understanding risk remains a major investor blind spot: TIAA Institute

Understanding risk remains a major investor blind spot: TIAA Institute

June 5, 2026
Anthropic’s confidential S-1 signals summer AI IPO race could heat up fast

Anthropic’s confidential S-1 signals summer AI IPO race could heat up fast

June 2, 2026
Memorial Day 2026: Take Advantage of Food Freebies, Deals

Memorial Day 2026: Take Advantage of Food Freebies, Deals

May 23, 2026
Grocery chain pays massive fine, accused of inflated price reporting

Grocery chain pays massive fine, accused of inflated price reporting

0
When Algorithms And LLMs Become Sellers, Your Commerce Strategy Must Change

When Algorithms And LLMs Become Sellers, Your Commerce Strategy Must Change

0
How to prepare for 4 big risks facing any retirement plan

How to prepare for 4 big risks facing any retirement plan

0
ETMarkets PMS Talk | Dinshaw Irani of Helios India stays away from IT, doubles down on domestic consumption amid AI disruption

ETMarkets PMS Talk | Dinshaw Irani of Helios India stays away from IT, doubles down on domestic consumption amid AI disruption

0
Market Talk – June 18, 2026

Market Talk – June 18, 2026

0
Chart of the Week: AI Is a Black Box

Chart of the Week: AI Is a Black Box

0
ETMarkets PMS Talk | Dinshaw Irani of Helios India stays away from IT, doubles down on domestic consumption amid AI disruption

ETMarkets PMS Talk | Dinshaw Irani of Helios India stays away from IT, doubles down on domestic consumption amid AI disruption

June 18, 2026
CFTC Settlement Bans Celsius Founder Mashinsky From Trading

CFTC Settlement Bans Celsius Founder Mashinsky From Trading

June 18, 2026
Trump claims Iran deal is ‘unconditional surrender’: Axios

Trump claims Iran deal is ‘unconditional surrender’: Axios

June 18, 2026
Inside Trump’s Anthropic crackdown | Fortune

Inside Trump’s Anthropic crackdown | Fortune

June 18, 2026
How Jim Rowe Filled a Shopping Desert—With Costco Returns

How Jim Rowe Filled a Shopping Desert—With Costco Returns

June 18, 2026
5 Pennsylvania Rebate Rules Seniors Should Check Before the Property Tax/Rent Deadline

5 Pennsylvania Rebate Rules Seniors Should Check Before the Property Tax/Rent Deadline

June 18, 2026
theadvisertimes.com

Get the latest news and follow the coverage of Business & Financial News, Stock Market Updates, Analysis, and more from the trusted sources.

CATEGORIES

  • Business
  • Cryptocurrency
  • Economy
  • Financial Planning
  • Investing
  • Market Analysis
  • Markets
  • Money
  • Personal Finance
  • Startups
  • Stock Market
  • Trading

LATEST UPDATES

  • ETMarkets PMS Talk | Dinshaw Irani of Helios India stays away from IT, doubles down on domestic consumption amid AI disruption
  • CFTC Settlement Bans Celsius Founder Mashinsky From Trading
  • Trump claims Iran deal is ‘unconditional surrender’: Axios
  • Our Great Privacy Policy
  • Terms of Use, Legal Notices & Disclosures
  • About Us
  • Contact Us

© Copyright 2024 All Rights Reserved
See articles for original source and related links to external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading

© Copyright 2024 All Rights Reserved
See articles for original source and related links to external sites.