command-r-plus-08-2024 is an update of the Command R+ with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint the same.
Read the launch post here.
Use of this model is subject to Cohere's Usage Policy and SaaS Agreement.
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.