Inference - Search News

The $20 Billion Bet On Inference: What Every AI Infrastructure Team Needs To Get Right

Every ChatGPT query, every AI agent action, every generated video is based on inference. Training a model is a one-time ...

4don MSN

OpenAI ditches Nvidia for faster AI inference chips, threatening chipmaker's dominance

Nvidia remains dominant in chips for training large AI models, while inference has become a new front in the competition.

12don MSN

Microsoft announces powerful new chip for AI inference

Microsoft has announced the launch of its latest chip, the Maia 200, which the company describes as a silicon workhorse ...

Cryptopolitan on MSN

OpenAI says its unhappy with Nvidia inference hardware, now looking at AMD, Cerebras, Groq

OpenAI isn’t happy with Nvidia’s AI chips anymore, especially when it comes to how fast they can answer users. The company ...

12d

Microsoft Unveils A New AI Inference Accelerator Chip, Maia 200

Microsoft’s new Maia 200 inference accelerator chip enters this overheated market with a new chip that aims to cut the price ...

18d

AI inference startup Baseten hits $5B valuation in $300M round backed by Nvidia

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities ...

Positron AI Raises $230 Million Series B at Over $1 Billion Valuation to Scale Energy-Efficient AI Inference

Positron AI, the leader in energy-efficient AI inference hardware, today announced an oversubscribed $230 million Series B financing at a post-money valuation exceeding $1 billion.

The Motley Fool

What Is AI Inference?

AI inference uses trained data to enable models to make deductions and decisions. Effective AI inference results in quicker and more accurate model responses. Evaluating AI inference focuses on speed, ...

The News International

OpenAI reportedly explores alternatives to Nvidia for AI inference chips

OpenAI is reportedly looking beyond Nvidia for artificial intelligence chips, signalling a potential shift in its hardware ...

Yahoo Finance

AI Inference Company Evaluation Report 2025 | NVIDIA, AMD, and Intel Compete for Dominance with Diverse Hardware and Strategic Partnerships

Dublin, Aug. 05, 2025 (GLOBE NEWSWIRE) -- The "AI inference - Company Evaluation Report, 2025" report has been added to ResearchAndMarkets.com's offering. The AI Inference Market Companies Quadrant is ...

InfoQ

Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models

Google has launched SQL-native managed inference for 180,000+ Hugging Face models in BigQuery. The preview release collapses the ML lifecycle into a unified SQL interface, eliminating the need for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results