Models / Google
LLM

Gemma Instruct (2B)

2B instruct Gemma model by Google: lightweight, open, text-to-text LLM for QA, summarization, reasoning, and resource-efficient deployment.

About model

Gemma Instruct (2B) generates human-like text based on user input, excelling at following instructions and producing coherent responses. It is suitable for applications requiring controlled and context-specific text generation. Designed for developers and researchers, it provides a reliable tool for various natural language processing tasks.

To run this model, you first need to deploy it on a Dedicated Endpoint.

    Related models
    • Model provider
      Google
    • Type
      LLM
    • Main use cases
      Chat
      Small & Fast
    • Deployment
      On-Demand Dedicated
      Monthly Reserved
    • Parameters
      2B
    • Context length
      8K
    • Input modalities
      Text
    • Output modalities
      Text
    • Released
      February 8, 2024
    • Last updated
      June 12, 2025
    • Quantization level
      FP16
    • External link
    • Category
      Chat