Venice.ai Change Log - May 6, 2026

May 28th, 2026

Venice.ai Change Log - May 6, 2026 - May 25, 2026

Agentic Chat

Agentic Chat is now the default Venice chat experience, with tool
use, media generation, and multi-step workflows available directly
inside a conversation. Users can ask Venice to search, reason, generate
or edit images, create videos, and continue refining the result without
jumping between separate chats.

Open Agentic Chat: https://venice.ai/chat/agent

Here's everything else we shipped.

New Models

The following models have been added to Venice:

Text Models

Grok Build 0.1 — Text model from xAI. Private. Available to all
users
Gemini 3.5 Flash — Google DeepMind's lightweight, low-latency text
model optimized for speed. Anonymous. Available to all users.
Qwen 3.7 Max — Large-scale text model from Alibaba Cloud in the
Qwen 3.7 family. Anonymous. Available to all users.
Gemma 4 31B Instruct — Google's 31B-parameter open text model with
instruction tuning. TEE/E2EE. Available to all users.
Qwen 3.6 35B A3B FP8 — Mixture-of-experts text model from Alibaba
Cloud with 35B total parameters and 3B active parameters, served in
FP8 precision for efficient inference. TEE/E2EE. Available to all
users.
Gemma 4 26B A4B Uncensored — Uncensored, unfiltered
mixture-of-experts variant of Google's Gemma 4 with 26B total
parameters and 4B active parameters. TEE/E2EE. Available to all users.
Qwen3.6 35B A3B Uncensored — Uncensored, unfiltered variant of
Alibaba Cloud's Qwen 3.6 mixture-of-experts model with 35B total
parameters and 3B active parameters. TEE/E2EE. Available to all users.

Image & Video Models

Grok Imagine High Quality (SOTA) — Image generation model from xAI
with state-of-the-art quality output. Private. Available to all users.
Kling V3 Standard Motion Control — Kuaishou's V3 Standard video
model with motion control support, enabling camera and subject
movement direction in generated videos. Anonymous. Available to all
users.
Kling V3 Pro Motion Control — Kuaishou's V3 Pro video model with
motion control support, enabling camera and subject movement direction
in generated videos at higher quality. Anonymous. Available to Pro
users

Audio Models

Lyria 3 Pro — Google DeepMind audio and music generation model
capable of producing high-fidelity instrumental and vocal tracks.
Anonymous. Available to all users

Model Updates

Qwen Image Update — Effective June 18, Qwen Image pricing will
increase from \$0.01 to \$0.03 per generated image. The model will
also move from height / width parameters to aspect_ratio, with support
for: 1:1, 3:2, 16:9, 21:9, 9:16, 2:3, 3:4, and 4:5.
GPT Image 2 Quality Selector — New quality setting added for GPT
Image 2 image generation.

Web App

New Features & Improvements

Home Page Redesign — Redesigned home page with updated layout at
venice.ai/home
Default Agentic Chat — The default chat route now redirects to
agentic chat.
Agentic Chat Message Editing — Messages in agentic chat can now be
edited in place without resending.
Auto-Approve Video in Agentic Chat — Video generation requests in
agentic chat are now auto-approved without requiring manual
confirmation.
Drag-to-Folder for Agentic Chat — Agentic chat conversations can
now be dragged into folders in the sidebar
Video Studio Enhancements — Added start-from-shared-video,
advanced model selection, generation queue, save-to-assets, and a
"taking longer" progress indicator in Video Studio
Video Download — Videos can now be downloaded directly from the
video interface
Download All Videos — New "Download All Videos" button in the
Studio gallery to batch-download all generated videos.
In-Browser Camera Capture — Capture photos directly from the
browser camera in Chat, Video Studio, Image Studio, and multi-edit
Model Selector Enhancements — Model cards now show an agentic
filter, code-optimized indicator, capability icons, copy-ID button,
and context window size.
Image-to-Video Progress Preview — A blurred version of the source
image is now displayed while image-to-video generation is in progress.
Write Code Preset — New "Write Code" preset option added to chat
for code-focused conversations
Audio Reference Chips — Audio files can now be attached as
reference chips via the chat slash menu
Two-Factor Authentication — Added additional second-factor
authentication options for account security
Email Identity Verification — Optional email address can now be
linked for account identity verification
Unified Reference Upload — Single upload button that accepts
image, video, and audio references in one action
Clickable Creator Name — Creator name in the Social Feed post
detail header now links to the creator's profile
Emoji Reactions — Emoji reactions are now available on social
posts in the Social Feed
Pinned Featured Models — Featured Image and Video models are now
pinned and no longer change based on the user's current model
selection
Suggested Prompts for Image & Video — Pre-written prompt
suggestions now appear in the image and video generation interfaces.
Tool Call UI — New visual indicator in chat showing when the model
is executing a tool call during a conversation.
Consumption Limit Reset Period — Consumption limits can now be
configured with Monthly, Daily, or Total reset periods.
Image Generation Metadata — Generated images now include
generation metadata (model, prompt, settings) embedded in the file.
Image Model Switching — Incompatible settings such as resolution
or aspect ratio are now automatically reset when switching between
image models.
Conversation Mode Minutes Display — Remaining conversation mode
minutes are now shown in the Storage & Limits settings page.

Wallet and Payments

Crypto Subscription Credit Display — Credit balance now shown on
the crypto subscription card.
Credit Cost Granularity — Credit costs now displayed with
per-100-character precision.
Auto Top-Up — New option to automatically replenish credits when
balance falls below a set threshold.
Crypto Subscription Management — Upgrade, downgrade, or cancel
directly from the crypto subscription card.
Credit Usage History — View a detailed log of past credit usage in
the wallet section.
Solana Top-Up Support — Solana is now accepted as a payment method
for credit top-ups and wallet authentication.

Mobile App

Agentic Chat — Introduced Agentic Chat (Chat V2), a new multi-step
chat mode powered by an agent, with access to Venice tools and
features.
APK Install Dialog — New dialog prompts Android users to install
the native APK.
Referral Screen — New referral screen presented as a modal in the
mobile app.
Image Settings — Added pay-per-use configuration for image
generation in app settings.
Image Search — Image search results now appear as output items in
chat.
Video Carousel — Video carousel now supports Agentic Chat videos.
Enter Key to Send — Pressing Enter now submits messages in chat.
Image Modal Aspect Ratio Gating — The image modal now validates
selected aspect ratios before generation.
Context Search Rendering — Added inline rendering of context
search results in chat.

API

Venice MCP Server — Venice MCP server is now live, available as an
npm package and on GitHub, with 31 tools covering Chat & Embeddings,
Image, Video, Audio, and Web capabilities via the Model Context
Protocol.
Venice Video MCP — Video generation tools are now available
through an MCP-compatible workflow for agent builders using Venice
video capabilities.
x402 Solana Support — The x402 top-up endpoint and wallet
inference auth now support Solana.
Agent Tooling Docs — API docs now include an Agent Tooling section
covering Venice MCP, Skills, and the Video Harness.
Private Research Agent Guide — API docs now include a guide for
building private research agents on Venice.
Reasoning Effort Options — The API now returns which
reasoning_effort values each supported model accepts.
Max Tokens Enforcement — max_tokens is now enforced as the cap for
the total number of tokens the model generates, including reasoning
tokens.
Context-Grep Tool Access — The context-grep tool was enabled for
public use where supported.
TTS Response Format Selection — The text-to-speech endpoint now
accepts a response format parameter to specify the output audio format
per request.
Voice Usage Metering — Voice conversation usage tracking switched
from byte-counting to audio token measurement.
Voice Conversation Quotas — Voice conversation quotas are now
enforced during the session instead of only after usage is processed.
Reference Audio URLs — OpenAPI docs now expose
referenceaudiourls for supported audio-reference workflows.
Seedance R2V Audio — Seedance reference-to-video workflows can now
use reference audio through the public API.
Video API Docs — API docs now clarify video Base64 behavior and
remove misleading asset ID references.
Seedance 2.0 Usage Guide — API docs now include a Seedance 2.0
guide for API users.
GPT Image 2 Quality Parameter — GPT Image 2 now supports a quality
parameter with pricing tied to resolution and quality.
Venice Edit Multi-Edit Endpoint — Venice Edit now uses a
multi-edit endpoint, supporting multiple image edits in a single
request.
Venice Edit Resource — Venice Edit was added as an API-accessible
image editing resource.
Image Edit and Upscale Timeout — Venice image edit and upscale
requests now support longer processing timeouts.
Multipart File Uploads — Added a gRPC file chunk streaming client
for multipart upload workflows.
Scrape Limits — Added limits for scrape API usage to make web
extraction behavior more predictable.
Model Deprecation Fields — New API-only fields on model objects
indicate deprecation date and replacement model information.
Deprecation Warning Headers — API responses now include
deprecation warning headers for affected models.
Deprecated Model Hiding — Models are automatically removed from
API model listings once their deprecation date is reached.
Dynamic Deprecation Table — API docs now generate the model
deprecation table dynamically.
Private Models Download Link — API docs now include a private
models download link.
Gemma 4 31B API Rate Limits — API rate limits were increased for
google-gemma-4-31b-it
Claude API Rate Limits — API rate limits were increased for Claude
Sonnet 4.6 and Claude Opus 4.6.
Grok Imagine V2V Pricing — Fixed duration-based pricing for Grok
Imagine video-to-video usage.
Music Pricing Granularity — Character-based music pricing now uses
finer 100-character granularity.
Burn Read APIs — Added API endpoints for reading token burn data.
Payments Endpoint — Added a missing payments endpoint used by
wallet and subscription flows.
Overload Retry Headers — Overloaded image edit requests now return
429 with Retry-After where supported.

Model Deprecations

Qwen 3.5 122B A10B (TEE/E2EE) — Replacement: Qwen 3.6 35B A3B FP8 (TEE/E2EE)
Grok Imagine Pro — Replacement: Grok Imagine High Quality.
Grok 4.1 Fast — Replacement: Grok 4.3.
GLM 5 (TEE/E2EE) — Replacement: GLM 5.1 (TEE/E2EE)
Qwen 3 Coder 480B — Replacement: Qwen 3 Coder 480B Turbo.
Kimi K2 Thinking — Replacement: Kimi K2.5.