May 28th, 2026

Venice.ai Change Log - May 6, 2026 - May 25, 2026

Agentic Chat

Agentic Chat is now the default Venice chat experience, with tool
use, media generation, and multi-step workflows available directly
inside a conversation. Users can ask Venice to search, reason, generate
or edit images, create videos, and continue refining the result without
jumping between separate chats.

Open Agentic Chat: https://venice.ai/chat/agent

Here's everything else we shipped.

New Models

The following models have been added to Venice:

Text Models

  • Grok Build 0.1 — Text model from xAI. Private. Available to all
    users

  • Gemini 3.5 Flash — Google DeepMind's lightweight, low-latency text
    model optimized for speed. Anonymous. Available to all users.

  • Qwen 3.7 Max — Large-scale text model from Alibaba Cloud in the
    Qwen 3.7 family. Anonymous. Available to all users.

  • Gemma 4 31B Instruct — Google's 31B-parameter open text model with
    instruction tuning. TEE/E2EE. Available to all users.

  • Qwen 3.6 35B A3B FP8 — Mixture-of-experts text model from Alibaba
    Cloud with 35B total parameters and 3B active parameters, served in
    FP8 precision for efficient inference. TEE/E2EE. Available to all
    users.

  • Gemma 4 26B A4B Uncensored — Uncensored, unfiltered
    mixture-of-experts variant of Google's Gemma 4 with 26B total
    parameters and 4B active parameters. TEE/E2EE. Available to all users.

  • Qwen3.6 35B A3B Uncensored — Uncensored, unfiltered variant of
    Alibaba Cloud's Qwen 3.6 mixture-of-experts model with 35B total
    parameters and 3B active parameters. TEE/E2EE. Available to all users.

Image & Video Models

  • Grok Imagine High Quality (SOTA) — Image generation model from xAI
    with state-of-the-art quality output. Private. Available to all users.

  • Kling V3 Standard Motion Control — Kuaishou's V3 Standard video
    model with motion control support, enabling camera and subject
    movement direction in generated videos. Anonymous. Available to all
    users.

  • Kling V3 Pro Motion Control — Kuaishou's V3 Pro video model with
    motion control support, enabling camera and subject movement direction
    in generated videos at higher quality. Anonymous. Available to Pro
    users

Audio Models

  • Lyria 3 Pro — Google DeepMind audio and music generation model
    capable of producing high-fidelity instrumental and vocal tracks.
    Anonymous. Available to all users

Model Updates

  • Qwen Image Update — Effective June 18, Qwen Image pricing will
    increase from \$0.01 to \$0.03 per generated image. The model will
    also move from height / width parameters to aspect_ratio, with support
    for: 1:1, 3:2, 16:9, 21:9, 9:16, 2:3, 3:4, and 4:5.

  • GPT Image 2 Quality Selector — New quality setting added for GPT
    Image 2 image generation.

Web App

New Features & Improvements

  • Home Page Redesign — Redesigned home page with updated layout at
    venice.ai/home

  • Default Agentic Chat — The default chat route now redirects to
    agentic chat.

  • Agentic Chat Message Editing — Messages in agentic chat can now be
    edited in place without resending.

  • Auto-Approve Video in Agentic Chat — Video generation requests in
    agentic chat are now auto-approved without requiring manual
    confirmation.

  • Drag-to-Folder for Agentic Chat — Agentic chat conversations can
    now be dragged into folders in the sidebar

  • Video Studio Enhancements — Added start-from-shared-video,
    advanced model selection, generation queue, save-to-assets, and a
    "taking longer" progress indicator in Video Studio

  • Video Download — Videos can now be downloaded directly from the
    video interface

  • Download All Videos — New "Download All Videos" button in the
    Studio gallery to batch-download all generated videos.

  • In-Browser Camera Capture — Capture photos directly from the
    browser camera in Chat, Video Studio, Image Studio, and multi-edit

  • Model Selector Enhancements — Model cards now show an agentic
    filter, code-optimized indicator, capability icons, copy-ID button,
    and context window size.

  • Image-to-Video Progress Preview — A blurred version of the source
    image is now displayed while image-to-video generation is in progress.

  • Write Code Preset — New "Write Code" preset option added to chat
    for code-focused conversations

  • Audio Reference Chips — Audio files can now be attached as
    reference chips via the chat slash menu

  • Two-Factor Authentication — Added additional second-factor
    authentication options for account security

  • Email Identity Verification — Optional email address can now be
    linked for account identity verification

  • Unified Reference Upload — Single upload button that accepts
    image, video, and audio references in one action

  • Clickable Creator Name — Creator name in the Social Feed post
    detail header now links to the creator's profile

  • Emoji Reactions — Emoji reactions are now available on social
    posts in the Social Feed

  • Pinned Featured Models — Featured Image and Video models are now
    pinned and no longer change based on the user's current model
    selection

  • Suggested Prompts for Image & Video — Pre-written prompt
    suggestions now appear in the image and video generation interfaces.

  • Tool Call UI — New visual indicator in chat showing when the model
    is executing a tool call during a conversation.

  • Consumption Limit Reset Period — Consumption limits can now be
    configured with Monthly, Daily, or Total reset periods.

  • Image Generation Metadata — Generated images now include
    generation metadata (model, prompt, settings) embedded in the file.

  • Image Model Switching — Incompatible settings such as resolution
    or aspect ratio are now automatically reset when switching between
    image models.

  • Conversation Mode Minutes Display — Remaining conversation mode
    minutes are now shown in the Storage & Limits settings page.

Wallet and Payments

  • Crypto Subscription Credit Display — Credit balance now shown on
    the crypto subscription card.

  • Credit Cost Granularity — Credit costs now displayed with
    per-100-character precision.

  • Auto Top-Up — New option to automatically replenish credits when
    balance falls below a set threshold.

  • Crypto Subscription Management — Upgrade, downgrade, or cancel
    directly from the crypto subscription card.

  • Credit Usage History — View a detailed log of past credit usage in
    the wallet section.

  • Solana Top-Up Support — Solana is now accepted as a payment method
    for credit top-ups and wallet authentication.

Mobile App

  • Agentic Chat — Introduced Agentic Chat (Chat V2), a new multi-step
    chat mode powered by an agent, with access to Venice tools and
    features.

  • APK Install Dialog — New dialog prompts Android users to install
    the native APK.

  • Referral Screen — New referral screen presented as a modal in the
    mobile app.

  • Image Settings — Added pay-per-use configuration for image
    generation in app settings.

  • Image Search — Image search results now appear as output items in
    chat.

  • Video Carousel — Video carousel now supports Agentic Chat videos.

  • Enter Key to Send — Pressing Enter now submits messages in chat.

  • Image Modal Aspect Ratio Gating — The image modal now validates
    selected aspect ratios before generation.

  • Context Search Rendering — Added inline rendering of context
    search results in chat.

API

  • Venice MCP Server — Venice MCP server is now live, available as an
    npm package and on GitHub, with 31 tools covering Chat & Embeddings,
    Image, Video, Audio, and Web capabilities via the Model Context
    Protocol.

  • Venice Video MCP — Video generation tools are now available
    through an MCP-compatible workflow for agent builders using Venice
    video capabilities.

  • x402 Solana Support — The x402 top-up endpoint and wallet
    inference auth now support Solana.

  • Agent Tooling Docs — API docs now include an Agent Tooling section
    covering Venice MCP, Skills, and the Video Harness.

  • Private Research Agent Guide — API docs now include a guide for
    building private research agents on Venice.

  • Reasoning Effort Options — The API now returns which
    reasoning_effort values each supported model accepts.

  • Max Tokens Enforcement — max_tokens is now enforced as the cap for
    the total number of tokens the model generates, including reasoning
    tokens.

  • Context-Grep Tool Access — The context-grep tool was enabled for
    public use where supported.

  • TTS Response Format Selection — The text-to-speech endpoint now
    accepts a response format parameter to specify the output audio format
    per request.

  • Voice Usage Metering — Voice conversation usage tracking switched
    from byte-counting to audio token measurement.

  • Voice Conversation Quotas — Voice conversation quotas are now
    enforced during the session instead of only after usage is processed.

  • Reference Audio URLs — OpenAPI docs now expose
    referenceaudiourls for supported audio-reference workflows.

  • Seedance R2V Audio — Seedance reference-to-video workflows can now
    use reference audio through the public API.

  • Video API Docs — API docs now clarify video Base64 behavior and
    remove misleading asset ID references.

  • Seedance 2.0 Usage Guide — API docs now include a Seedance 2.0
    guide for API users.

  • GPT Image 2 Quality Parameter — GPT Image 2 now supports a quality
    parameter with pricing tied to resolution and quality.

  • Venice Edit Multi-Edit Endpoint — Venice Edit now uses a
    multi-edit endpoint, supporting multiple image edits in a single
    request.

  • Venice Edit Resource — Venice Edit was added as an API-accessible
    image editing resource.

  • Image Edit and Upscale Timeout — Venice image edit and upscale
    requests now support longer processing timeouts.

  • Multipart File Uploads — Added a gRPC file chunk streaming client
    for multipart upload workflows.

  • Scrape Limits — Added limits for scrape API usage to make web
    extraction behavior more predictable.

  • Model Deprecation Fields — New API-only fields on model objects
    indicate deprecation date and replacement model information.

  • Deprecation Warning Headers — API responses now include
    deprecation warning headers for affected models.

  • Deprecated Model Hiding — Models are automatically removed from
    API model listings once their deprecation date is reached.

  • Dynamic Deprecation Table — API docs now generate the model
    deprecation table dynamically.

  • Private Models Download Link — API docs now include a private
    models download link.

  • Gemma 4 31B API Rate Limits — API rate limits were increased for
    google-gemma-4-31b-it

  • Claude API Rate Limits — API rate limits were increased for Claude
    Sonnet 4.6 and Claude Opus 4.6.

  • Grok Imagine V2V Pricing — Fixed duration-based pricing for Grok
    Imagine video-to-video usage.

  • Music Pricing Granularity — Character-based music pricing now uses
    finer 100-character granularity.

  • Burn Read APIs — Added API endpoints for reading token burn data.

  • Payments Endpoint — Added a missing payments endpoint used by
    wallet and subscription flows.

  • Overload Retry Headers — Overloaded image edit requests now return
    429 with Retry-After where supported.

Model Deprecations

  • Qwen 3.5 122B A10B (TEE/E2EE) — Replacement: Qwen 3.6 35B A3B FP8 (TEE/E2EE)

  • Grok Imagine Pro — Replacement: Grok Imagine High Quality.

  • Grok 4.1 Fast — Replacement: Grok 4.3.

  • GLM 5 (TEE/E2EE) — Replacement: GLM 5.1 (TEE/E2EE)

  • Qwen 3 Coder 480B — Replacement: Qwen 3 Coder 480B Turbo.

  • Kimi K2 Thinking — Replacement: Kimi K2.5.