AI & ML interests
Enterprise-grade AI models
Recent Activity
Papers
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Articles
IBM Granite
Granite is a family of open, enterprise-grade AI models that are performant, efficient, and trustworthy.
- π€ Granite SLMs - Our latest language models, faster, leaner, and built for agentic workloads.
- ποΈβπ¨οΈ Granite Vision - VLM with a special emphasis on document-related tasks.
- π₯ Granite Docling - Document models for enterprise document workflows.
- π£οΈ Granite Speech - Models for automatic speech recognition and spoken language understanding.
- π Granite Embedding - High-quality embedding models for RAG and semantic search.
- π¦Ί Granite Guardian - Safety and content moderation models for responsible AI deployments.
- π Granite Time Series - Models purpose-built for enterprise time series data.
- π§© Granite Libraries - Libraries of adapters that supercharge a wide range of capabilities.
Resources
- π Docs: ibm.com/granite/docs/models/granite
- π§ͺ Playground: ibm.com/granite/playground
- π GitHub: github.com/ibm-granite
-
ibm-granite/granite-4.1-30b
Text Generation β’ 29B β’ Updated β’ 2.01k β’ 70 -
ibm-granite/granite-4.1-8b
Text Generation β’ 9B β’ Updated β’ 14.3k β’ 121 -
ibm-granite/granite-4.1-3b
Text Generation β’ 3B β’ Updated β’ 1.45k β’ 43 -
ibm-granite/granite-4.1-30b-base
Text Generation β’ 29B β’ Updated β’ 111 β’ 21
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 123k β’ 1.16k -
ibm-granite/granite-docling-258M-mlx
Image-Text-to-Text β’ 0.3B β’ Updated β’ 3.38k β’ 91 -
granite-docling-258M demo
π271Extract and convert document content from images
-
Granite Docling 258M WebGPU
π£157Convert document images to HTML with Docling
-
ibm-granite/granite-4.1-30b
Text Generation β’ 29B β’ Updated β’ 2.01k β’ 70 -
ibm-granite/granite-4.1-8b
Text Generation β’ 9B β’ Updated β’ 14.3k β’ 121 -
ibm-granite/granite-4.1-3b
Text Generation β’ 3B β’ Updated β’ 1.45k β’ 43 -
ibm-granite/granite-4.1-30b-base
Text Generation β’ 29B β’ Updated β’ 111 β’ 21
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 123k β’ 1.16k -
ibm-granite/granite-docling-258M-mlx
Image-Text-to-Text β’ 0.3B β’ Updated β’ 3.38k β’ 91 -
granite-docling-258M demo
π271Extract and convert document content from images
-
Granite Docling 258M WebGPU
π£157Convert document images to HTML with Docling
spaces 11
Granite Vision Document Intelligence
Document intelligence with Granite-Vision-4.1-4B
Granite Speech WebGPU
Transcribe or translate spoken audio to text in your browser
Granite 4.0 1B Speech
Granite 4.0 1B Speech recognition and translation demo
Granite 4.0 Nano WebGPU
In-browser tool calling with IBM Granite-4.0
Granite Docling 258M WebGPU
Convert document images to HTML with Docling