ai/granite-docling

Verified Publisher

By Docker

Updated 7 months ago

Granite Docling is a multimodal model for efficient document conversion.

Model
2

10K+

ai/granite-docling repository overview

Granite Docling

logo

Description

Granite Docling is a multimodal Image-Text-to-Text model engineered for efficient document conversion. It preserves the core features of Docling while maintaining seamless integration with Docling Documents to ensure full compatibility.

Characteristics

AttributeDetails
ProviderIBM Research
ArchitectureBased on Idefics2-8B; vision encoder = siglip-base-patch16-512; LLM = Granite 165M
Cutoff date-
LanguagesEnglish (with experimental support for Japanese, Arabic, Chinese)
Tool calling
Input modalitiesText, Image
Output modalitiesText
LicenseApache 2.0

Available model variants

Model variantParametersQuantizationContext windowVRAM¹Size
ai/granite-docling:258M

ai/granite-docling:258M-F16

ai/granite-docling:latest
258MMOSTLY_F168K tokens0.86 GiB312.88 MB
ai/granite-docling:258M-Q8_0258MMOSTLY_Q8_08K tokens0.72 GiB166.28 MB

¹: VRAM estimated based on model characteristics.

latest258M

Use this AI model with Docker Model Runner

docker model run ai/granite-docling

Considerations

  • Best suited for document conversion and extraction workflows (PDF → Markdown/HTML/structured outputs).
  • Recommended to use through the Docling library or SDK for optimal integration and inference stability.
  • Supports English natively; Japanese, Arabic, and Chinese support is experimental.

Granite-Docling-258M emphasizes layout fidelity and content integrity over creative or open-ended generation. It is released under Apache 2.0 and integrates seamlessly with the Docling ecosystem for structured document AI workflows.

Tag summary

Content type

Model

Digest

sha256:229f83681

Size

497.5 MB

Last updated

7 months ago

docker model pull ai/granite-docling

This week's pulls

Pulls:

176

Last week