AURA AI Launch Monitor
Live Stream Active

Signal Feed

Review the latest validated platform movements, model updates, and infrastructure shifts from across the AI landscape.

F

Fireworks AI

22:36 UTC • 29 Mar

Fireworks AI: Hosted Llama api schema changed

Introducing Llama 3.1 inference endpoints in partnership with Meta

Impact Intensity
risky
Details
x

xAI

22:36 UTC • 29 Mar

xAI: Grok 4 model launched

Grok 4 is the most intelligent model in the world. It includes native tool use and real-time search integration, and is available now to SuperGrok and Premium+ subscribers, as well as through the xAI API. We are also introducing a new SuperGrok Heavy tier with access to Grok 4 Heavy - the most powerful version of Grok 4.

Impact Intensity
positive
Details
x

xAI

22:36 UTC • 29 Mar

xAI: Grok 4 model launched

Grok 4.1 is now available to all users on grok.com, 𝕏, and the iOS and Android apps. It is rolling out immediately in Auto mode and can be selected explicitly as “Grok 4.1” in the model picker.

Impact Intensity
positive
Details
G

Groq

22:36 UTC • 29 Mar

Groq: Compound context window changed

Compound Beta ( compound-beta ): 350 tokens per second (TPS) with a latency of ~4,900 ms

Impact Intensity
neutral
Details
G

Groq

22:35 UTC • 29 Mar

Groq: GPT-OSS on Groq context window changed

Added support for high , medium , and low options for reasoning_effort when using GPT-OSS models to control their reasoning output. Learn more about how to use these options to control reasoning tokens.

Impact Intensity
neutral
Details
C

Cohere

22:35 UTC • 29 Mar

Cohere: Command A api schema changed

Command A is Cohere’s most performant model to date, excelling at real world enterprise tasks including tool use, retrieval augmented generation (RAG), agents, and multilingual use cases. With 111B parameters and a context length of 256K, Command A boasts a considerable increase in inference-time efficiency — 150% higher throughput compared to its

Impact Intensity
risky
Details
C

Cohere

22:35 UTC • 29 Mar

Cohere: Command A api schema changed

We’re excited to announce the release of Command A Reasoning , a hybrid reasoning model designed to excel at complex agentic tasks, in English and 22 other languages. With 111 billion parameters and a 256K context length, this model brings advanced reasoning capabilities to your applications through the familiar Command API interface.

Impact Intensity
risky
Details
C

Cohere

22:35 UTC • 29 Mar

Cohere: Command A api schema changed

Command A Translate ( command-a-translate-08-2025 ) is now available for all Cohere users through our standard API endpoints. For enterprise customers, private deployment options are available to ensure maximum security and control over your translation workflows.

Impact Intensity
risky
Details
G

Google Gemini

22:35 UTC • 29 Mar

Google Gemini: Gemini 2.5 deprecation announced

Released gemini-2.5-flash , our first stable 2.5 Flash model. To learn more, see Gemini 2.5 Flash . gemini-2.5-flash-preview-04-17 will be deprecated on July 15, 2025.

Impact Intensity
risky
Details