Gemma Instruct (2B)

2B instruct Gemma model by Google: lightweight, open, text-to-text LLM for QA, summarization, reasoning, and resource-efficient deployment.

Try now

read docs

About model

Gemma Instruct (2B) generates human-like text based on user input, excelling at following instructions and producing coherent responses. It is suitable for applications requiring controlled and context-specific text generation. Designed for developers and researchers, it provides a reliable tool for various natural language processing tasks.

To run this model, you first need to deploy it on a Dedicated Endpoint.

Quickstart guides

RAG

Building a RAG Workflow

Agents

Agent Workflows

Apps

Next.js Chat Quickstart

Related models

Model specifications

Model data

Model provider
Google
Type
LLM
Chat
Main use cases
Chat
Small & Fast
Deployment
Dedicated
Parameters
2B
Context length
8K
Input modalities
Text
Output modalities
Text

Released
February 8, 2024
Last updated
June 12, 2025
Quantization level
FP16
External link
Provider docs
Category
Chat

Quickstart docs

Deploy model