Anthropic's new expertise test

Brian Evergreen

Author of Autonomous Transformation (Wiley) | Former Microsoft, Accenture | Senior Advisor, Researcher, and Keynote Speaker

Published Oct 23, 2024

Thanks for reading my monthly newsletter for leading in the era of AI. If you want to read more from me, I share ideas, frameworks, and more twice weekly on my Substack here.

Anthropic's new 'Computer use' capability

Yesterday, Anthropic's announced their new 'Computer use' capability, and the internet is buzzing with hype about it. In case you missed it, here's one of the demo videos:

How this is the perfect test for expertise:

To any expert in AI, automation, and software engineering, this capability is an inefficient, error-prone, less secure way to use AI for automation.

The demo for copy-pasting data (a second video you can watch here) is less efficient and less reliable than SharePoint workflows we experts built back in 2014.

To anyone who is not an expert in AI, automation, and software engineering, this is the first time they're seeing AI + automation visualized in a way they can instantly understand, so they believe this is game-changing.

Automation normally requires layers of abstraction to understand, which most people don't have the patience or bandwidth to slow down and think through (and just as often, it's difficult for automation and AI experts to explain these concepts well).

Why it's a brilliant move by Anthropic

It leverages mob mentality. There's a prevailing narrative to "Keep up with AI or be left behind"—so by demoing this capability broadly, anyone who displays skepticism will be labelled a luddite or "not with it."
It scales Reinforcement Learning from Human Feedback (RLHF). When users interact with this capability, they are supervising the model's work and providing real-time feedback, training the model to improve over time. This means customers will be paying Anthropic to train its model on millions of tasks simultaneously.

Six questions that hint at why I am bearish on this as the form factor for AI + automation

What will the compute cost be if millions of automated workflows are created on the fly by non-experts?
What happens if the model hallucinates on a workflow involving something important?
How does this scale across an enterprise?
How will IT ensure safe use?
Why wouldn't we want to use natural language to infer intent and then leverage APIs to achieve more reliable, secure outcomes?

Why I'm so excited about this update

No other update from the AI pure plays has provided this subtle of an opportunity to see if the people you're following online or paying to advise you actually know what they're talking about. In other words, are they your AI Indiana Jones ready to lead you into the jungle or just someone in costume?

If you want to assess their credibility, just ask them their take on the new Anthropic Computer use capability.

If they say it's game-changing, you've seen what you need to see.

Reach out if you need recommendations for real experts.

Thanks for reading,

Brian

Whenever you're ready, here are 3 ways I can help you:

1. Keynote Speaking: I've briefed dozens of Fortune 500 C-suite executive teams, spoken to live, in-person audiences of more than 10,000 attendees, am a guest lecturer at Kellogg School of Management, and led panels of distinguished guests ranging from academia to public sector leaders to Fortune 500 C-suite executives.

2. Future Solving Advising: Join hundreds FORTUNE 500 C-level executives and startup founders who have leveraged my advisement on AI, the future of technology, and how to position yourself for the future in the era of AI.

3. Future Solving Workshops: Join 25+ of the FORTUNE 500 and NASA, who have positioned themselves for the future in the era of AI by leveraging new frameworks from the Future Solving Method I introduced in my book, Autonomous Transformation, to set a vision and strategy and spark action.

Future Solving

3,027 followers

+ Subscribe

Ricardo Sastre Martín

2mo

It's a great way to discard most of the 'experts' in this network as yesterday linkedin was flooded with posts saying that this was the biggest step in the history of AI ever...

2 Reactions

John Kraski

Author, The Future of Community (Wiley) I Building community to help brands grow their businesses I Former Chief Financial Officer I Only person on LinkedIn with an almond croissant named after them

2mo

Haha Brian Evergreen! Love this!

1 Reaction

Andreas Welsch

AI Advisor | Author: “AI Leadership Handbook” | Host: “What’s the BUZZ?” | Keynote Speaker

2mo

Only time (and user adoption) will tell where things are headed. I believe that agents will eventually hit several roadblocks when they move from research frameworks into enterprise environments. A lack of APIs will require agents to interact with legacy applications rather quickly, especially in multi-vendor scenarios. I believe the idea of combining agents with UI-level interaction is a necessity—the question is whether the agent needs to have this capability or if organizations are better served using RPA for the proven, routine tasks in a process and augment it with AI...

Anthropic's new expertise test

Brian Evergreen

Author of Autonomous Transformation (Wiley) | Former Microsoft, Accenture | Senior Advisor, Researcher, and Keynote Speaker

Anthropic's new 'Computer use' capability

How this is the perfect test for expertise:

Why it's a brilliant move by Anthropic

Six questions that hint at why I am bearish on this as the form factor for AI + automation

Recommended by LinkedIn

Why I'm so excited about this update

Future Solving

3,027 followers

More articles by this author

Insights from the community

Others also viewed

Dataiku Customers Take the Lead

AI without filter - Use Copilot for M365 to prepare your organization for AI

Strategies for CIOs to Scale and Transform GenAI Pilots into Business Value

Automation Tomorrow #90

Digital Transformation Projects in Companies: Navigating the Complexities of Automation, Replacement, and Augmentation

Tools, Tips and Tech Trends To Keep You Ahead the Curve

Introducing the Semantic Kernel Process Library: A New Era of AI Workflow Orchestration

Revolutionize Your Workflow with Your Own Version of Copilot for Insights

The Sci-Fi Framework: A Guide to Prompting LLMs

AiThority Daily Newsletter

Explore topics

Anthropic's new 'Computer use' capability

How this is the perfect test for expertise:

Why it's a brilliant move by Anthropic

Six questions that hint at why I am bearish on this as the form factor for AI + automation

Recommended by LinkedIn

Why I'm so excited about this update

Future Solving

3,027 followers

My honest take on Agentforce

Dec 5, 2024

How to not leave people behind in the era of AI

Sep 25, 2024

AI has a trust problem.

May 21, 2024

How to become an AI Influencer

Apr 5, 2024

Strategy in the era of AI

Mar 1, 2024

5 Predictions for 2024

Dec 29, 2023

Does AI pose an existential threat to humans?

Nov 12, 2023

What is the job of a business leader during a crisis?

Oct 13, 2023

Who will break the ice?

Oct 6, 2023

Tool Worship in the Age of AI (Part II)

Sep 29, 2023

Insights from the community

Others also viewed

Dataiku Customers Take the Lead

AI without filter - Use Copilot for M365 to prepare your organization for AI

Strategies for CIOs to Scale and Transform GenAI Pilots into Business Value

Automation Tomorrow #90

Digital Transformation Projects in Companies: Navigating the Complexities of Automation, Replacement, and Augmentation

Tools, Tips and Tech Trends To Keep You Ahead the Curve

Introducing the Semantic Kernel Process Library: A New Era of AI Workflow Orchestration

Revolutionize Your Workflow with Your Own Version of Copilot for Insights

The Sci-Fi Framework: A Guide to Prompting LLMs

AiThority Daily Newsletter

Explore topics