RAG Cross Corpus Retrieval
RAG Cross Corpus Retrieval RAG Cross Corpus Retrieval is available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. This feature allows you to retrieve…
19 updates from Google Cloud.
RAG Cross Corpus Retrieval RAG Cross Corpus Retrieval is available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. This feature allows you to retrieve…
Anthropic's Claude Opus 4.7 <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/opus-4-7">Claude Opus 4.7</a> is available in Model Garden.
Metadata search for RAG Engine Use schema-based metadata search in Vertex AI RAG Engine. You can define a metadata schema for a corpus, attach metadata to files within that corpus, and use this…
Vertex AI RAG Engine Serverless mode Vertex AI RAG Engine Serverless mode is now available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. Serverless mode…
Gemma 4 26B A4B IT is available as an experimental launch in Model Garden. This is an open model built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio…
Veo 3.1 Lite Veo 3.1 Lite is available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. This release is our most cost-efficient Veo on Vertex AI model. For…
Gemini 2.5 model retirement dates updated The retirement dates for Gemini 2.5 Pro, Gemini 2.5 Flash-Lite, and Gemini 2.5 Flash have been updated to October 16, 2026. For more information, see <a…
Lyria 3 Lyria is available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. You can use lyria-3-pro-preview to generate 184 seconds of audio,…
Video generation GA endpoints deprecation The following table describes video generation endpoints that are deprecated and their replacements. We recommend updating your model endpoints before June…
Imagen generation GA endpoints deprecation The following table describes image generation endpoints that are deprecated and their replacements. We recommend updating your model endpoints before June…
Partner model evaluations The Gen AI evaluation service supports evaluating partner models, such as Anthropic and Llama models. For more information, see <a…
Gemini 3.1 Flash-Lite Gemini 3.1 Flash-Lite (gemini-3.1-flash-lite-preview) is available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. This release is our…
Video generation preview endpoints deprecation The following table describes video generation endpoints that are deprecated and their replacements. We recommend updating your model endpoints before…
Gemini 3.1 Flash Image Gemini 3.1 Flash Image (gemini-3.1-flash-image) is available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>. This release enables…
Anthropic's Claude 3 Haiku Anthropic's Claude 3 Haiku is deprecated as of February 23, 2026 and will be shut down on August 23, 2026. For more information, see <a…
Gemini 3.1 Pro Preview <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-1-pro">Gemini 3.1 Pro</a> is available in preview in Model Garden. Gemini 3.1 Pro is our…
Anthropic's Claude Sonnet 4.6 <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/sonnet-4-6">Claude Sonnet 4.6</a> is available in Model Garden.
Image generation preview endpoints deprecation The following table describes image generation endpoints that are deprecated and their replacements. We recommend updating your model endpoints before…
GLM 5 is available as an experimental launch in Model Garden. This model is targeting complex systems engineering and long-horizon agentic tasks. GLM 5 is available as a managed API in Model Garden.…