Tier 2 breadth coverage for Groq docs, pricing, changelog, and blog updates.
Compound Beta ( compound-beta ): 350 tokens per second (TPS) with a latency of ~4,900 ms
Added support for high , medium , and low options for reasoning_effort when using GPT-OSS models to control their reasoning output. Learn more about how to use these options to control reasoning tokens.