ai/granite-docling

Verified Publisher

By Docker

•Updated 7 months ago

Granite Docling is a multimodal model for efficient document conversion.

Model

10K+

Overview Tags

ai/granite-docling repository overview

⁠Granite Docling

logo

⁠Description

Granite Docling is a multimodal Image-Text-to-Text model engineered for efficient document conversion. It preserves the core features of Docling while maintaining seamless integration with Docling Documents⁠ to ensure full compatibility.

⁠Characteristics

Attribute	Details
Provider	IBM Research
Architecture	Based on Idefics2-8B; vision encoder = siglip-base-patch16-512; LLM = Granite 165M
Cutoff date	-
Languages	English (with experimental support for Japanese, Arabic, Chinese)
Tool calling	❌
Input modalities	Text, Image
Output modalities	Text
License	Apache 2.0⁠

⁠Available model variants

Model variant	Parameters	Quantization	Context window	VRAM¹	Size
`ai/granite-docling:258M` `ai/granite-docling:258M-F16` `ai/granite-docling:latest`	258M	MOSTLY_F16	8K tokens	0.86 GiB	312.88 MB
`ai/granite-docling:258M-Q8_0`	258M	MOSTLY_Q8_0	8K tokens	0.72 GiB	166.28 MB

¹: VRAM estimated based on model characteristics.

latest → 258M

⁠Use this AI model with Docker Model Runner

docker model run ai/granite-docling

⁠Considerations

Best suited for document conversion and extraction workflows (PDF → Markdown/HTML/structured outputs).
Recommended to use through the Docling library or SDK for optimal integration and inference stability.
Supports English natively; Japanese, Arabic, and Chinese support is experimental.

Granite-Docling-258M emphasizes layout fidelity and content integrity over creative or open-ended generation. It is released under Apache 2.0 and integrates seamlessly with the Docling ecosystem for structured document AI workflows.

⁠Links

Tag summary

Recent tags

Content type

Model

Digest

sha256:229f83681…

Size

497.5 MB

Last updated

7 months ago

docker model pull ai/granite-docling

This week's pulls

Pulls:

176

Last week

Learn more⁠