Frequently Asked Questions (15)
When was Granite 4.0 H Small released?
Granite 4.0 H Small was released on September 22, 2025.
Who created Granite 4.0 H Small?
Granite 4.0 H Small was created by IBM.
How intelligent is Granite 4.0 H Small?
Granite 4.0 H Small scores 11 on the Artificial Analysis Intelligence Index, placing it below average among other open weight non-reasoning models of similar size (median: 12).
How fast is Granite 4.0 H Small?
Granite 4.0 H Small generates output at 432.0 tokens per second (based on the median across providers serving the model), which is well above average compared to other open weight non-reasoning models of similar size (median: 100.5 t/s).
What is the latency of Granite 4.0 H Small?
Granite 4.0 H Small has a time to first token (TTFT) of 10.20s (based on the median across providers serving the model), which is at the higher end compared to other open weight non-reasoning models of similar size (median: 1.49s).
How much does Granite 4.0 H Small cost?
Granite 4.0 H Small costs $0.06 per 1M input tokens (very competitive, median: $0.16) and $0.25 per 1M output tokens (better than average, median: $0.40), based on the median across providers serving the model.
What is Granite 4.0 H Small API pricing?
Granite 4.0 H Small costs $0.06 per 1M input tokens and $0.25 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $0.11 per 1M tokens. Pricing may vary by provider.
How verbose is Granite 4.0 H Small?
When evaluated on the Intelligence Index, Granite 4.0 H Small generated 2.3M output tokens, which is very competitive compared to other open weight non-reasoning models of similar size (median: 5.3M).
Is Granite 4.0 H Small a reasoning model?
No, Granite 4.0 H Small is not a reasoning model. It provides direct responses without extended chain-of-thought reasoning.
What input modalities does Granite 4.0 H Small support?
Granite 4.0 H Small supports text input.
What output modalities does Granite 4.0 H Small support?
Granite 4.0 H Small supports text output.
Can Granite 4.0 H Small process images?
No, Granite 4.0 H Small does not support image input. It can only process text.
Is Granite 4.0 H Small multimodal?
No, Granite 4.0 H Small is not multimodal. It only supports text input.
What is the context window of Granite 4.0 H Small?
Granite 4.0 H Small has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.
Is Granite 4.0 H Small open source?
Yes, Granite 4.0 H Small is open weights. The model weights are publicly available and can be downloaded for self-hosting.