US moves to evaluate AI models from Google, Microsoft, xAI

Washington: New artificial intelligence tools, particularly advanced AI models, developed by major tech firms including Google, Microsoft, and xAI, will now be tested by the US Department of Commerce before being released to the public.

The companies have voluntarily agreed to submit each AI model for evaluation through the Department’s Center for AI Standards and Innovation (CAISI). This initiative expands on earlier agreements reached during the Joe Biden administration with firms such as OpenAI and Anthropic.

Under the new framework, every AI model will be assessed for performance, safety, and potential risks. Chris Fall, Director of CAISI, said that the expanded partnerships will help scale efforts to evaluate each AI model in the public interest at a critical stage of technological development.

Microsoft — Rep. Image: Salvatore De Lellis | Pexels | Cropped by BH

AI models under review

The programme will focus on testing, collaborative research, and the development of best practices for deploying commercial AI models. Among the systems under review are;

Google’s Gemini, an advanced AI model developed by its DeepMind division, which is widely integrated across products and increasingly used in US defence operations.
Microsoft’s Copilot is another widely used AI model.
xAI’s Grok, also an AI mode, has faced public scrutiny over issues such as generating inappropriate altered images.

CAISI confirmed that it has already conducted around 40 evaluations of various AI models, including several state-of-the-art AI models that have not yet been released to the public. However, the agency did not specify which AI models were withheld.

In a corporate statement, Microsoft noted that it already conducts internal testing of each AI model, but emphasised that evaluating national security and large-scale safety risks for any AI model requires close collaboration with government authorities.

xAI’s Grok sparks outrage_Image FROM_XAI — Image Credits: X@xAI | Cropped by BH

The move to expand testing across companies signals a shift from the previously hands-off regulatory stance under Donald Trump. His administration had introduced an ‘AI Action Plan’ aimed at reducing regulatory barriers and accelerating the development of each AI model to maintain US leadership in the sector.

However, with the US military increasing its reliance on advanced AI models, and recent claims by Anthropic that it developed an AI model called Mythos considered too powerful for public release, the administration’s position appears to be evolving.

Senior officials from the Trump administration recently met with Anthropic CEO Dario Amodei to discuss concerns surrounding the deployment of highly advanced AI models. This comes as Anthropic faces a legal dispute with the US Department of Defense over its refusal to remove safety guardrails from its systems for government use.

Editor's Pick

Scientists discover DNA test for 18 early cancer detection

China’s exports plunge amidst global demand decline & rising interest rates

Weight-loss medication can reduce heart failure symptoms; study

US moves to evaluate AI models from Google, Microsoft, xAI

US FDA blocks key studies supporting vaccine safety claims

Apple to pay $250mn to settle lawsuit over iPhone AI claims

Singapore schools permit caning under new anti-bullying rules

Tesla pushes for EU-wide approval of FSD in Brussels meeting

Spirit Airlines begins shutdown after rescue deal collapse

US gas hits $4.30 as Trump says prices will soon drop

Premier Inn announces 3,800 job cuts due to rising costs

Rare comet lights up southern skies over New Zealand

Emojis in Every Message: Habit or new digital language?

Assisi displays remains of St. Francis to mark 800 years since his death

Lion DNA used to convict poachers in wildlife crime case

Beyond the Collars: International Workers’ Day redefines respect

UN raises alarm over AI-backed abuse of women online

The power of voice assistants in modern smart homes

Power Naps: Tiny breaks that make big difference

We Have

US FDA blocks key studies supporting vaccine safety claims

Apple to pay $250mn to settle lawsuit over iPhone AI claims

Singapore schools permit caning under new anti-bullying rules

Sanju Samson leads CSK to crucial IPL win over Delhi

Editor's Pick

US moves to evaluate AI models from Google, Microsoft, xAI

AI models under review

ALSO READ | Tesla pushes for EU-wide approval of FSD in Brussels meeting

Newly Updated