LLM Diffusion Models - Search News

Inception Launches Mercury 2, the Fastest Reasoning LLM - 5x Faster Than Leading Speed-Optimized LLMs, with Dramatically Lower Inference Cost

Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of Mercury 2, the fastest reasoning LLM and first reasoning dLLM. Mercury 2 ...

Mercury 2 : World’s Fastest Reasoning AI Model Built for Production Applications

The new Mercury 2 AI model uses diffusion reasoning to generate 1,000 tokens per second; it runs about 5x faster than Haiku, speed limits are ...

eWeek

Need for Speed: Mercury 2 Is 13x Faster Than Claude Haiku

Mercury 2 introduces diffusion LLMs to text, delivering 10x faster speeds for AI agents and production workflows without sacrificing reasoning power.

SiliconANGLE

Microsoft reportedly develops LLM series that can rival OpenAI, Anthropic models

Microsoft Corp. has developed a series of large language models that can rival algorithms from OpenAI and Anthropic PBC, multiple publications reported today. Sources told Bloomberg that the LLM ...

VentureBeat

Beyond GPT architecture: Why Google's Diffusion approach could reshape LLM deployment

Last month, along with a comprehensive suite of new AI tools and innovations, Google DeepMind unveiled Gemini Diffusion. This experimental research model uses a diffusion-based approach to generate ...

InfoWorld

Google unveils open source projects for generative AI

Google at Google Cloud Next 24 unveiled three open source projects for building and running generative AI models. The company also introduced new large language models to its MaxText project of ...

InfoQ

Google Announces Video Generation LLM VideoPoet

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Cory Benfield discusses the evolution of ...

InfoWorld

LiteLLM: An open-source gateway for unified LLM access

LiteLLM allows developers to integrate a diverse range of LLM models as if they were calling OpenAI’s API, with support for fallbacks, budgets, rate limits, and real-time monitoring of API calls. The ...

SDxCentral

DeepSeek looks to offload simple LLM tasks to save billions of parameters

A little over a year after it upended the tech industry, DeepSeek is back with another apparent breakthrough: a means to stop current large language models (LLMs) from wasting computational depth on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results