Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Cohere’s 111B-parameter AI model can run on just two GPUs

The core architecture of Command A employs an optimized transformer design featuring three layers of sliding window attention, each with a window size of 4096 tokens

byKerem Gülen
March 17, 2025
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Cohere has released Command A, a high-performance AI model featuring 111 billion parameters, a 256K context length, and support for 23 languages, on March 16, 2025. The model is designed for enterprise applications, promising a 50% reduction in operational costs compared to existing API-based models.

Meet Cohere Command A

Command A addresses the significant challenges posed by training and deploying large-scale AI models that often require extensive computational resources. Typical models, such as GPT-4o and DeepSeek-V3, demand up to 32 GPUs and extensive infrastructure, which poses a barrier for smaller enterprises. Command A, however, operates efficiently on just two GPUs while maintaining competitive performance levels.

The core architecture of Command A employs an optimized transformer design featuring three layers of sliding window attention, each with a window size of 4096 tokens. This structure enhances local context modeling, allowing the model to effectively manage detailed information across lengthy text inputs. Additionally, it includes a fourth layer that consists of global attention mechanisms, facilitating unrestricted token interactions throughout the entire sequence, thereby enriching its contextual understanding.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Command A achieves a token generation rate of 156 tokens per second, which is 1.75 times faster than GPT-4o and 2.4 times faster than DeepSeek-V3. Its performance in handling instruction-following tasks, SQL queries, and retrieval-augmented generation (RAG) applications has shown exceptional accuracy in real-world evaluations, outperforming its competitors in multilingual scenarios.


Baidu just made AI cheaper: Ernie 4.5 costs 1% of GPT-4.5


The model’s multilingual capabilities extend beyond basic translation, exhibiting superior proficiency in various dialects, including Arabic, with evaluations showing enhanced contextually appropriate responses for regional dialects such as Egyptian, Saudi, Syrian, and Moroccan Arabic. This linguistic versatility is particularly beneficial for businesses operating in diverse language environments.

Performance evaluations indicate that Command A consistently outperforms its peers in fluency, faithfulness, and response utility during human assessments. It is equipped with advanced RAG capabilities that include verifiable citations, which enhance its utility for enterprise information retrieval applications. Furthermore, the model includes high-level security features designed to protect sensitive business information.

Noteworthy features of Command A include:

  • Operational efficiency on two GPUs, significantly lowering computational costs.
  • 111 billion parameters optimized for extensive text processing demands in enterprise applications.
  • Support for a 256K context length, facilitating effective processing of long-form documents.
  • Proficiency in 23 languages, ensuring high accuracy across global markets.
  • Exceptional execution in SQL, agentic tasks, and tool-based applications.
  • Private deployments being up to 50% more economical than traditional API alternatives.
  • Enterprise-grade security to safely manage sensitive data.

The introduction of Command A marks a significant advancement for businesses seeking cost-effective, efficient AI solutions that maintain robust performance standards.


Featured image credit: Kerem Gülen/Midjourney

Tags: AIGPU

Related Posts

Apple touchscreen MacBook could launch with M5 Pro chips

Apple touchscreen MacBook could launch with M5 Pro chips

June 29, 2026
Apple touchscreen MacBook could launch with M5 Pro chips

Apple touchscreen MacBook could launch with M5 Pro chips

June 29, 2026
OpenAI limits ChatGPT 5.6 access to government-approved users first

OpenAI limits ChatGPT 5.6 access to government-approved users first

June 26, 2026
Apple to skip M6 Pro and Max chips and launch M7 in 2027

Apple to skip M6 Pro and Max chips and launch M7 in 2027

June 26, 2026
IBM unveils world’s first sub-1nm chip with new nanostack architecture

IBM unveils world’s first sub-1nm chip with new nanostack architecture

June 26, 2026
Apple raises prices across Macs, iPads and home devices

Apple raises prices across Macs, iPads and home devices

June 26, 2026

LATEST NEWS

Apple touchscreen MacBook could launch with M5 Pro chips

Apple touchscreen MacBook could launch with M5 Pro chips

OpenAI limits ChatGPT 5.6 access to government-approved users first

Apple to skip M6 Pro and Max chips and launch M7 in 2027

IBM unveils world’s first sub-1nm chip with new nanostack architecture

Apple raises prices across Macs, iPads and home devices

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

WatchMyCompetitor

TokkingHeads

Fellow.app

Octoparse

AnyToSpeech

Vrew

Fireflies

SpeedLegal

Teachable Machine

Unriddle

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.