Changelog

Follow new updates and improvements to Venice.ai.

May 28th, 2026

Venice.ai Change Log - May 6, 2026 - May 25, 2026

Agentic Chat

Agentic Chat is now the default Venice chat experience, with tool
use, media generation, and multi-step workflows available directly
inside a conversation. Users can ask Venice to search, reason, generate
or edit images, create videos, and continue refining the result without
jumping between separate chats.

Open Agentic Chat: https://venice.ai/chat/agent

Here's everything else we shipped.

New Models

The following models have been added to Venice:

Text Models

Grok Build 0.1 — Text model from xAI. Private. Available to all
users
Gemini 3.5 Flash — Google DeepMind's lightweight, low-latency text
model optimized for speed. Anonymous. Available to all users.
Qwen 3.7 Max — Large-scale text model from Alibaba Cloud in the
Qwen 3.7 family. Anonymous. Available to all users.
Gemma 4 31B Instruct — Google's 31B-parameter open text model with
instruction tuning. TEE/E2EE. Available to all users.
Qwen 3.6 35B A3B FP8 — Mixture-of-experts text model from Alibaba
Cloud with 35B total parameters and 3B active parameters, served in
FP8 precision for efficient inference. TEE/E2EE. Available to all
users.
Gemma 4 26B A4B Uncensored — Uncensored, unfiltered
mixture-of-experts variant of Google's Gemma 4 with 26B total
parameters and 4B active parameters. TEE/E2EE. Available to all users.
Qwen3.6 35B A3B Uncensored — Uncensored, unfiltered variant of
Alibaba Cloud's Qwen 3.6 mixture-of-experts model with 35B total
parameters and 3B active parameters. TEE/E2EE. Available to all users.

Image & Video Models

Grok Imagine High Quality (SOTA) — Image generation model from xAI
with state-of-the-art quality output. Private. Available to all users.
Kling V3 Standard Motion Control — Kuaishou's V3 Standard video
model with motion control support, enabling camera and subject
movement direction in generated videos. Anonymous. Available to all
users.
Kling V3 Pro Motion Control — Kuaishou's V3 Pro video model with
motion control support, enabling camera and subject movement direction
in generated videos at higher quality. Anonymous. Available to Pro
users

Audio Models

Lyria 3 Pro — Google DeepMind audio and music generation model
capable of producing high-fidelity instrumental and vocal tracks.
Anonymous. Available to all users

Model Updates

Qwen Image Update — Effective June 18, Qwen Image pricing will
increase from \$0.01 to \$0.03 per generated image. The model will
also move from height / width parameters to aspect_ratio, with support
for: 1:1, 3:2, 16:9, 21:9, 9:16, 2:3, 3:4, and 4:5.
GPT Image 2 Quality Selector — New quality setting added for GPT
Image 2 image generation.

Web App

New Features & Improvements

Home Page Redesign — Redesigned home page with updated layout at
venice.ai/home
Default Agentic Chat — The default chat route now redirects to
agentic chat.
Agentic Chat Message Editing — Messages in agentic chat can now be
edited in place without resending.
Auto-Approve Video in Agentic Chat — Video generation requests in
agentic chat are now auto-approved without requiring manual
confirmation.
Drag-to-Folder for Agentic Chat — Agentic chat conversations can
now be dragged into folders in the sidebar
Video Studio Enhancements — Added start-from-shared-video,
advanced model selection, generation queue, save-to-assets, and a
"taking longer" progress indicator in Video Studio
Video Download — Videos can now be downloaded directly from the
video interface
Download All Videos — New "Download All Videos" button in the
Studio gallery to batch-download all generated videos.
In-Browser Camera Capture — Capture photos directly from the
browser camera in Chat, Video Studio, Image Studio, and multi-edit
Model Selector Enhancements — Model cards now show an agentic
filter, code-optimized indicator, capability icons, copy-ID button,
and context window size.
Image-to-Video Progress Preview — A blurred version of the source
image is now displayed while image-to-video generation is in progress.
Write Code Preset — New "Write Code" preset option added to chat
for code-focused conversations
Audio Reference Chips — Audio files can now be attached as
reference chips via the chat slash menu
Two-Factor Authentication — Added additional second-factor
authentication options for account security
Email Identity Verification — Optional email address can now be
linked for account identity verification
Unified Reference Upload — Single upload button that accepts
image, video, and audio references in one action
Clickable Creator Name — Creator name in the Social Feed post
detail header now links to the creator's profile
Emoji Reactions — Emoji reactions are now available on social
posts in the Social Feed
Pinned Featured Models — Featured Image and Video models are now
pinned and no longer change based on the user's current model
selection
Suggested Prompts for Image & Video — Pre-written prompt
suggestions now appear in the image and video generation interfaces.
Tool Call UI — New visual indicator in chat showing when the model
is executing a tool call during a conversation.
Consumption Limit Reset Period — Consumption limits can now be
configured with Monthly, Daily, or Total reset periods.
Image Generation Metadata — Generated images now include
generation metadata (model, prompt, settings) embedded in the file.
Image Model Switching — Incompatible settings such as resolution
or aspect ratio are now automatically reset when switching between
image models.
Conversation Mode Minutes Display — Remaining conversation mode
minutes are now shown in the Storage & Limits settings page.

Wallet and Payments

Crypto Subscription Credit Display — Credit balance now shown on
the crypto subscription card.
Credit Cost Granularity — Credit costs now displayed with
per-100-character precision.
Auto Top-Up — New option to automatically replenish credits when
balance falls below a set threshold.
Crypto Subscription Management — Upgrade, downgrade, or cancel
directly from the crypto subscription card.
Credit Usage History — View a detailed log of past credit usage in
the wallet section.
Solana Top-Up Support — Solana is now accepted as a payment method
for credit top-ups and wallet authentication.

Mobile App

Agentic Chat — Introduced Agentic Chat (Chat V2), a new multi-step
chat mode powered by an agent, with access to Venice tools and
features.
APK Install Dialog — New dialog prompts Android users to install
the native APK.
Referral Screen — New referral screen presented as a modal in the
mobile app.
Image Settings — Added pay-per-use configuration for image
generation in app settings.
Image Search — Image search results now appear as output items in
chat.
Video Carousel — Video carousel now supports Agentic Chat videos.
Enter Key to Send — Pressing Enter now submits messages in chat.
Image Modal Aspect Ratio Gating — The image modal now validates
selected aspect ratios before generation.
Context Search Rendering — Added inline rendering of context
search results in chat.

API

Venice MCP Server — Venice MCP server is now live, available as an
npm package and on GitHub, with 31 tools covering Chat & Embeddings,
Image, Video, Audio, and Web capabilities via the Model Context
Protocol.
Venice Video MCP — Video generation tools are now available
through an MCP-compatible workflow for agent builders using Venice
video capabilities.
x402 Solana Support — The x402 top-up endpoint and wallet
inference auth now support Solana.
Agent Tooling Docs — API docs now include an Agent Tooling section
covering Venice MCP, Skills, and the Video Harness.
Private Research Agent Guide — API docs now include a guide for
building private research agents on Venice.
Reasoning Effort Options — The API now returns which
reasoning_effort values each supported model accepts.
Max Tokens Enforcement — max_tokens is now enforced as the cap for
the total number of tokens the model generates, including reasoning
tokens.
Context-Grep Tool Access — The context-grep tool was enabled for
public use where supported.
TTS Response Format Selection — The text-to-speech endpoint now
accepts a response format parameter to specify the output audio format
per request.
Voice Usage Metering — Voice conversation usage tracking switched
from byte-counting to audio token measurement.
Voice Conversation Quotas — Voice conversation quotas are now
enforced during the session instead of only after usage is processed.
Reference Audio URLs — OpenAPI docs now expose
referenceaudiourls for supported audio-reference workflows.
Seedance R2V Audio — Seedance reference-to-video workflows can now
use reference audio through the public API.
Video API Docs — API docs now clarify video Base64 behavior and
remove misleading asset ID references.
Seedance 2.0 Usage Guide — API docs now include a Seedance 2.0
guide for API users.
GPT Image 2 Quality Parameter — GPT Image 2 now supports a quality
parameter with pricing tied to resolution and quality.
Venice Edit Multi-Edit Endpoint — Venice Edit now uses a
multi-edit endpoint, supporting multiple image edits in a single
request.
Venice Edit Resource — Venice Edit was added as an API-accessible
image editing resource.
Image Edit and Upscale Timeout — Venice image edit and upscale
requests now support longer processing timeouts.
Multipart File Uploads — Added a gRPC file chunk streaming client
for multipart upload workflows.
Scrape Limits — Added limits for scrape API usage to make web
extraction behavior more predictable.
Model Deprecation Fields — New API-only fields on model objects
indicate deprecation date and replacement model information.
Deprecation Warning Headers — API responses now include
deprecation warning headers for affected models.
Deprecated Model Hiding — Models are automatically removed from
API model listings once their deprecation date is reached.
Dynamic Deprecation Table — API docs now generate the model
deprecation table dynamically.
Private Models Download Link — API docs now include a private
models download link.
Gemma 4 31B API Rate Limits — API rate limits were increased for
google-gemma-4-31b-it
Claude API Rate Limits — API rate limits were increased for Claude
Sonnet 4.6 and Claude Opus 4.6.
Grok Imagine V2V Pricing — Fixed duration-based pricing for Grok
Imagine video-to-video usage.
Music Pricing Granularity — Character-based music pricing now uses
finer 100-character granularity.
Burn Read APIs — Added API endpoints for reading token burn data.
Payments Endpoint — Added a missing payments endpoint used by
wallet and subscription flows.
Overload Retry Headers — Overloaded image edit requests now return
429 with Retry-After where supported.

Model Deprecations

Qwen 3.5 122B A10B (TEE/E2EE) — Replacement: Qwen 3.6 35B A3B FP8 (TEE/E2EE)
Grok Imagine Pro — Replacement: Grok Imagine High Quality.
Grok 4.1 Fast — Replacement: Grok 4.3.
GLM 5 (TEE/E2EE) — Replacement: GLM 5.1 (TEE/E2EE)
Qwen 3 Coder 480B — Replacement: Qwen 3 Coder 480B Turbo.
Kimi K2 Thinking — Replacement: Kimi K2.5.

May 6th, 2026

Venice.ai Change Log - April 21, 2026 - May 5, 2026

Grok 4.3 on Venice

xAI's most intelligent reasoning model is now generally available on Venice. 1M-token context window, function calling, structured outputs, and multimodal support.

Voice Mode

Realtime voice conversations are now live on Venice. Talk to any model with memory sync, chat persistence, waveform visualization, push-to-talk input, and language switching. Now available on web, iOS and Android.

GPT-5.5 on Venice

OpenAI's latest-generation model family is now available on Venice. GPT-5.5 delivers improved reasoning, stronger instruction-following, and better multi-turn conversation across the board. GPT-5.5 Pro adds extended reasoning depth and a larger context window for demanding workloads. Both models are available now.

Kling 4K Video

Kuaishou's Kling V3 and O3 video models now generate native 4K output on Venice. Available in text-to-video, image-to-video, and reference-to-video modes, Kling 4K delivers sharper detail, better motion coherence, and cinematic-quality output at four times the resolution of previous generations.

Programmatic Burn Increase

Venice has increased the programmatic burns for new subscriptions: $2 in VVV for Pro, $5 in VVV for Pro+, and $10 in VVV for Max. Every new subscription triggers a buy-and-burn at these updated amounts.

New Models

The following models have been added to Venice:

Text Models

Grok 4.3 — xAI's most intelligent reasoning model with 1M-token context window, function calling, structured outputs, and multimodal support. Available to all users.
GPT-5.5 — OpenAI's latest-generation text model with improved reasoning, instruction-following, and multi-turn conversation. Available to all users.
GPT-5.5 Pro — OpenAI's higher-capability variant of GPT-5.5 with extended reasoning depth and larger context window. Pro users only.
DeepSeek V4 Pro — DeepSeek's full-size V4 reasoning model with extended context and strong performance on coding, math, and multi-step tasks. Available to all users.
DeepSeek V4 Flash — Lighter, faster variant of DeepSeek V4 optimized for speed and lower latency while retaining strong general-purpose performance. Available to all users.
Qwen 3.6 27B — Text model from Alibaba Cloud with 27 billion parameters, offering a balance of capability and efficiency with 128K context window. Available to all users.
GLM 5.1 E2EE — Zhipu AI's GLM 5.1 running with end-to-end encryption in a Trusted Execution Environment. Available to Pro users at no additional credit cost.

Image & Video Models

Kling V3 4K — Kuaishou text-to-video at native 4K resolution. Available to all users.
Kling V3 4K R2V — Kuaishou reference-to-video at native 4K resolution. Available to all users.
Kling O3 4K — Kuaishou O3-series text-to-video at native 4K resolution. Available to all users.
Kling O3 4K I2V — Kuaishou O3-series image-to-video at native 4K resolution. Available to all users.
Kling O3 4K R2V — Kuaishou O3-series reference-to-video at native 4K resolution. Available to all users.
HappyHorse 1.0 — Alibaba's text-to-video generation model. Available to all users.
HappyHorse 1.0 I2V — Image-to-video generation from a source image. Available to all users.
HappyHorse 1.0 Reference — Video generation guided by a reference image for style and content. Available to all users.
HappyHorse 1.0 Edit — Video editing model for modifying and transforming existing video. Available to all users.
Wan 2.7 Pro Edit — Alibaba DashScope image editing model for prompt-driven edits to existing images. Available to all users.

App

Improvements

Model Explorer Redesign — Refreshed layout for the Model Explorer with improved navigation and filtering.
Recommended Model Sort — New "Recommended" sort option in the model selector, prioritizing recently used models.
Model Details Modal — Model details can now be opened directly via URL in a dedicated modal.
Model Explorer Switcher — New entry point in the model switcher to navigate directly to the Model Explorer.
Prompt Enhancement Context — The prompt enhancement wand now incorporates conversation context when rewriting prompts.
Video Auto-Compression — Oversized videos are automatically compressed client-side before upload.
Per-Class PPU Toggles — Pay-per-use confirmation can now be toggled independently for each model class in chat.
Batch Delete Warning — Batch chat delete confirmation now warns that chats will be removed from other devices too.
Select All in Chat Delete — Added "Select All" option to the chat sidebar delete menu.
Image Auto-Downsize on Share — Images larger than 25 MB are automatically downsized before sharing.
Adaptive Thinking Always On — Removed the adaptive thinking toggle. Adaptive thinking is now always enabled.
Burn Type Tooltips — Tooltips now vary by burn type, with "Bought" label shown for discretionary burns.
China Server Location Flag — China flag icon now displayed for CN server locations in model details.
Sidebar Cleanup — Removed Help & Feedback button from the sidebar app menu.
PPU Confirmation Popup — Confirmation popup now shown when a pay-per-use model is routed.
Tool Call Loading Indicator — A loading spinner now appears in agentic chat while waiting for the next tool to execute.
Unified Chat History — All v1 and v2 chat history now appears in a single combined list in the sidebar.
Rate Limit Banner — A banner now appears in the chat input area when you've hit a rate limit.
Time Sent in Info Panels — Text, image, and video info panels now display a "Time Sent" row.
Cost Management Charts — Charts on the cost management dashboard now include today's spending data.
Wide Screen Layout — Improved 2-column grid layout on wide screens for better use of available space.
Today's Spend Card — New summary card on the cost management dashboard showing today's total spend.
Chat Performance — Conversation window now uses lazy rendering for off-screen messages, reducing lag in long conversations.

Wallet and Payments

Insufficient Credits Banner — Low credit warnings now appear as a dismissible banner above the input field instead of blocking interaction.
x402 Wallet View — Added a dedicated wallet view and admin top-up panel on the user page for x402 balances.
Voice Conversation Billing — Audio duration is now tracked per voice conversation session for accurate credit billing.
Video Credit Retry — Video generation credits are now automatically retried when an initial charge amount fails.

Mobile App

Android Voice Mode — Voice mode is now available on Android, with a prompt to update to the latest app version.
Uncensored Model Badges — Video model selectors now display an "Uncensored" badge where applicable.
Wallet Connect on Sign-In — Crypto wallet connection is now available on the sign-in and sign-up screens.
Pay-Per-Use in Chat — Pay-per-use purchase dialog added to the chat screen.
Pay-Per-Use Confirmation — Added a confirmation step before completing pay-per-use purchases.
iOS Native Chat Streaming — Chat responses now stream using native iOS processing.
Android Native Chat Streaming — Chat responses now stream using native Android processing.
Background Chat Sync — Chat responses that streamed while the app was backgrounded sync upon returning to the foreground.
Tablet Image Modal — Image detail modal now uses a tablet-optimized layout.
Tablet Dialogs — Dialogs now adapt to tablet screen sizes.
Tablet Settings Layout — Settings screens support split-screen and tablet-optimized layouts.
Tablet Modal Screens — Modal presentation screens now adapt to tablet screen sizes.
Dynamic Image Sizing — Images now resize dynamically based on device orientation.
Settings Navigation — Fixed navigation behavior and renamed settings screens.
Rate Limit Display — Updated rate limit information in settings.
Image & Video Info Sizing — Fixed sizing on image and video detail screens.
Privacy Warning Layout — Improved button positioning on the privacy warning dialog.
Conversation Replay Fix — Fixed a bug where already-read responses would replay when re-entering a conversation.
Android Chat Reliability — Fixed chat dropping or failing during request timeouts and mid-stream disconnects on Android.
iOS Background Image Generation — Fixed image generation failing when the app is in the background on iOS.
Android Background Image Generation — Image generation now continues running when the app is in the background on Android.
Text File Chat Sharing — Restored the ability to share chat conversations as text files.
Image Loading Indicator — Progress border on the image loader now waits briefly before appearing to avoid flicker on fast loads.
Image Error Display — Image generation errors now appear inline within chat messages.
Pro Upgrade Prompt — Restored the Pro upgrade button in the app header.
Default Playback Speed — Changed the default text-to-speech playback speed to 1.2x.
Auto Mode Image Editing — Auto mode now supports editing images referenced in the chat conversation.

API

Venice Skills GitHub Repository — Official veniceai/skills repository now live on GitHub with example skills covering the full Venice API surface.
Voice Cloning API — New POST /v1/audio/voices endpoint for MiniMax-based voice cloning.
OpenAI-Compatible File Inputs — Chat completions endpoint now accepts file inputs using the OpenAI-compatible format.
Model Overloaded Status Code — Model overloaded errors now return HTTP 429 instead of 503.
maxtokens Strict Cap on Reasoning Models — On reasoning-capable models, maxtokens is now a strict cap on total completion tokens (visible output + reasoning), restoring Venice's prior behavior across the model fleet. maxcompletiontokens is accepted as an equivalent alias and takes precedence if both are sent.
API File Inputs GA — File input support in the API is now generally available, no longer in preview.
Context Length in /v1/models — New context_length field added to each model object in /v1/models responses.
Free User Rate Limit CTA — Free users now see a call-to-action prompt when they hit rate limits.
Voice Rate Limit Headers — Voice agent responses now report the current rate limit and reset time to connected clients.
Qwen Image Deprecation — The qwen-image model has been deprecated and removed from both the app and the API.
Image Edit Resolution Parameter — New resolution parameter available on the image edit and multi-edit API endpoints.
Voice Mode Quota — The API now returns the caller's remaining voice mode quota in responses.
Disabled API Tier — Added a "Disabled" API consumption tier that blocks API access for the account.
Chatterbox HD on /models — Chatterbox HD voice cloning model is now listed and documented on the /models endpoint.
Per-Model Daily Costs — The Activity API now returns daily cost breakdowns per model.
Hermes Agent Integration — Official Venice integration guide for Hermes Agent, the open-source self-hosted AI agent by Nous Research. Point Hermes at the Venice API for access to 230+ models across text, image, video, audio, and embeddings with persistent memory and autonomous skill creation.

Token

Programmatic Burn Increase — Venice increased the programmatic VVV burn for new subscriptions: $2 for Pro, $5 for Pro+, and $10 for Max. Every new subscription now triggers a larger automatic token burn.
Emissions Reduction — Venice completed the first of three planned emissions reductions for VVV, reducing the rate of new token issuance from 6M/yr to 5M/yr. Additional reductions planned in June and July.

Model Deprecations

Kimi K2 Thinking — Retired. Traffic routed to Kimi K2.5 via alias. Existing API requests using kimi-k2-thinking now resolve to kimi-k2-5
Qwen3 Coder 480B — Deprecated April 30, fully retired May 4. Traffic routed to Qwen3 Coder 480B Turbo. The non-turbo variant is no longer visible in API or app
Venice Uncensored 1.1 — Retired. All traffic routed to Venice Uncensored 1.2. API requests using venice-uncensored transparently resolve to 1.2
HiDream — Deprecation date extended to May 7, 2026 (from May 1). Email sent to affected API users
NEAR AI GLM 5.0 (E2EE) — Retired. All traffic routed to GLM 5.1 (E2EE)

Fixes and Improvements

Improved inpainting progress animation to reflect actual model processing time
Fixed app menu being clipped in landscape mode on iPad Safari
Updated execution time display to show milliseconds
Fixed gallery header action buttons being clipped on narrow viewports
Fixed thinking indicator disappearing during reasoning-only streaming
Removed incomplete trailing bucket from Per Period volume chart
Updated PPU model acknowledgment to trigger once per account instead of per conversation
Fixed model search returning unrelated results via subsequence matches on description and use case
Updated PPU acknowledgment to trigger once per conversation for every PPU modality
Condensed the x402 wallet balance table from 6 columns to 3
Removed the automatic greeting sent when opening a voice websocket connection
Fixed inpaint auto mode behavior after a recent regression
Improved Hunyuan 3D results to render GLB and OBJ mesh outputs directly in the viewer
Fixed rate limiting not being correctly applied to background removal and upscale for free-tier users
Improved error alert positioning and added a retry button for failed messages
Fixed incorrect provider names displayed in the model explorer
Fixed incorrect label displayed for vision models
Improved agentic mode loading indicator with an animated gradient border
Fixed audio crackling caused by inconsistent sample rate
Fixed Max button rounding instead of preserving full numerical precision
Fixed auto-enhance preference not being respected during image generation
Updated copy on the Pro upgrade call-to-action
Improved Model Selector layout by pinning the View All Models button to the bottom of the dropdown
Fixed aspect ratio selector appearing during single-image edits with Grok
Fixed moderate post modal closing when the context menu is dismissed
Improved reordered items in the user dropdown menu
Fixed arrow key navigation in image zoom following incorrect left/right order
Fixed credit balance not updating immediately after completing a chat request
Improved rendering performance for long conversations
Fixed Spotlight Search not respecting the top safe-area inset on PWA
Restored Lustify v7 model availability after prior deprecation
Fixed missing API keys silently returning empty results instead of an error
Fixed an error occurring when quoting video content in conversations
Improved image search results with lightbox preview, context menu support, and better error handling
Fixed chat message queue issues that could cause messages to be processed incorrectly
Improved context window handling with more accurate token counting, cost display tooltips, and smarter message compaction
Fixed interactions not responding correctly in the Model Explorer
Fixed temperature warning displaying at an incorrect baseline threshold
Fixed inability to send messages containing only an attachment without text
Fixed errors when using Grok 4.1 Fast with characters

April 21st, 2026

Venice.ai Change Log — March 27 – April 21, 2026

Headlines

GPT Image 2 Now on Venice

OpenAI's latest image generation model is live on Venice. Industry-leading text rendering, UI generation, and photorealism with native output up to 4K.
Generate with GPT Image 2

New Subscription Tiers & Refreshing Credits

Venice now offers three subscription levels — Pro, Pro+, and Max — each with distinct usage limits, feature access, and credit allocations. Credits refresh on a monthly basis, giving subscribers ongoing access to Venice’s 230+ models and advanced features.
Explore subscription tiers

Programmatic VVV Buy & Burn

Automatic VVV token burns now execute programmatically. Every new Pro subscription triggers a buy-and-burn, with a new tracker page displaying full burn history on-chain. This is in addition to the monthly discretionary buy and burn mechanic.
View the Burn Tracker

Venice Studio

A full timeline-based video editor is now live in Venice Studio. Multi-track editing, AI-generated media import, text overlays, filters, transitions, multiple aspect ratios, auto-save, and one-click sharing to the community feed — all inside the browser.
Try Venice Studio

Seedance 2.0 Now on Venice

ByteDance's Seedance 2.0 video model is available on Venice with text-to-video, image-to-video, and reference-to-video modes. Standard and fast variants across all modes, now with 1080p resolution support.
Generate with Seedance 2.0

Venice Agent Tools

Three new open-source repositories are now available for developers building on Venice.

Agent Skills — 19 self-contained skill files for LLM agents (Cursor, Claude Code, Codex, Cline) covering every Venice API surface. GitHub
Venice CLI — Command-line interface for Venice. Generate text, images, and audio directly from the terminal. GitHub
x402 Client SDK — Client SDK for x402 micropayments. Pay for Venice API requests with USDC on Base, no account required. GitHub

New Models

The following models have been added to Venice across our app and API:

Text Models

Claude Opus 4.7 — Anthropic's latest Opus-tier model with extended context, deep reasoning, and sustained performance on long-form tasks. Available to all users.
Grok 4.20 — xAI's latest Grok text model with function calling support. Available to all users.
Grok 4.20 Multi-Agent — xAI's multi-agent variant of Grok 4.20, supporting orchestrated multi-step reasoning across coordinated agent workflows. Available to Pro users.
Venice Uncensored 1.2 — Venice.ai's proprietary uncensored, unfiltered text model. Updated version with improved coherence and instruction following. Available to all users.
Kimi K2.6 — Text model from Moonshot AI with long-context support and strong multilingual capabilities. Available to all users.
GLM 5.1 — Zhipu AI's latest flagship text model, successor to GLM 5 with improved reasoning and instruction following. Available to Pro users.
Qwen 3.5 397B — Alibaba Cloud's 397B parameter text model from the Qwen 3.5 series. Large-scale model with broad reasoning and multilingual capabilities. Available to Pro users.
Qwen 3.6 Plus — Text model from Alibaba Cloud in the Qwen 3.6 family. Mid-tier variant with strong multilingual and reasoning capabilities. Available to all users.
GLM 5 Turbo — Text model from Zhipu AI. Speed-optimized variant of the GLM 5 series with reduced latency. Available to all users.
GLM 5V Turbo — Multimodal model from Zhipu AI with vision and text capabilities. Accepts image inputs alongside text prompts. Available to all users.
Mistral Small 4 — Mistral AI's compact text model optimized for low-latency inference while maintaining strong instruction-following. Available to all users.
Google Gemma 4 31B Instruct — Google DeepMind's 31B parameter dense instruction-tuned text model. Available to all users.
Google Gemma 4 26B A4B Instruct — Google DeepMind's 26B total parameter mixture-of-experts model with 4B active parameters per forward pass. Instruction-tuned for chat and task completion. Available to all users.
Gemma 4 Uncensored — Uncensored, unfiltered variant based on Google DeepMind's Gemma 4 architecture. Removes built-in refusal behavior. Available to all users.
Aion 2.0 — Large-scale text model with multi-step reasoning and long-context support. Available to all users.

Video Models

Seedance 2.0 — ByteDance's next-generation video model with text-to-video, image-to-video, and reference-to-video support. Includes standard and fast variants across all modes. Now supports 1080p resolution. Available to all users.
Runway Gen-4.5 — Video generation model from Runway with improved visual fidelity, motion coherence, and multi-subject consistency over Gen-4. Available to all users.
Runway Gen-4 Turbo — Faster, lower-cost variant of Runway's Gen-4 video model, optimized for reduced generation time while maintaining baseline quality. Available to all users.
Grok Imagine Private — Video generation model from xAI with private mode, supporting text-to-video, image-to-video, and reference-to-video generation without public visibility on the Grok platform. Available to all users.
PixVerse C1 — Text-to-video generation model from PixVerse with support for multiple aspect ratios and consistent motion synthesis. Available to all users.
PixVerse C1 R2V — PixVerse C1 variant supporting reference-to-video generation, producing video output guided by a reference image input. Available to all users.
PixVerse C1 Transition — PixVerse C1 variant that generates smooth transition videos between two input images or scenes. Available to all users.
Wan 2.7 Edit — Video editing model from Alibaba Cloud that modifies existing video content based on text prompts, supporting region-specific edits and style changes. Available to all users.

Image Models

GPT Image 2 — OpenAI's latest image generation model with stronger text rendering, UI generation, and photorealism. Native output up to 3840px across three quality tiers, with masked editing and streaming output. Available to all users.
FireRed Image Edit 1.1 — Image editing model supporting instruction-based modifications such as object removal, style transfer, and inpainting. Available to all users.

Audio Models

MiniMax Music 2.5 — Music generation model from MiniMax capable of producing songs with vocals, lyrics, and instrumentals from text prompts. Available in Venice Studio and via API.
MiniMax Music 2.6 — Updated music generation model from MiniMax with improved audio quality and vocal synthesis over Music 2.5. Available in Venice Studio and via API.

Additional Models

xAI TTS v1 — Text-to-speech model from xAI.
Inworld TTS 1.5 Max — Text-to-speech model from Inworld AI.
Chatterbox HD — High-definition text-to-speech model.
Orpheus TTS — Text-to-speech model with expressive voice synthesis.
ElevenLabs Turbo v2.5 — Low-latency text-to-speech model from ElevenLabs.
MiniMax Speech 02 HD — High-definition text-to-speech model from MiniMax.
Gemini Flash TTS — Text-to-speech model from Google DeepMind.
xAI Speech-to-Text v1 — Speech-to-text model from xAI supporting 25 languages with word-level timestamps.
BGE-EN-ICL — English text embedding model from BAAI with in-context learning support for retrieval and semantic similarity tasks. API-only.
Qwen3 Embedding 8B — 8B-parameter text embedding model from Alibaba Cloud for search, retrieval, and classification tasks. API-only.
Qwen3 Embedding 0.6B — Lightweight 0.6B-parameter text embedding model from Alibaba Cloud, optimized for low-latency embedding workloads. API-only.
Multilingual E5 Large Instruct — Instruction-tuned multilingual text embedding model from Microsoft supporting cross-lingual retrieval and similarity tasks. API-only.
Text Embedding 3 Small — Compact text embedding model from OpenAI for search, clustering, and classification with reduced dimensionality. API-only.
Text Embedding 3 Large — Higher-dimensional text embedding model from OpenAI with stronger retrieval accuracy and flexible dimension truncation. API-only.
Gemini Embedding 2 Preview — Text embedding model from Google DeepMind supporting search, document retrieval, and classification. API-only.
Nemotron Embed VL 1B v2 — 1B-parameter vision-language embedding model from NVIDIA for multimodal retrieval across text and image inputs. API-only.

Model Upgrades

Grok Models Privacy Upgrade — All Grok models upgraded from Privacy Mode 1 (anonymous) to Privacy Mode 2 (private). Users are no longer charged for failed generations due to content restrictions.

App

New Features

Audio Generation in Venice Studio — Music, voice, and sound effect generation added to Venice Studio; users can describe desired audio and generate original tracks, voiceovers, or sound effects with in-line playback and previews.
Chat Insights — Automatically extracts and remembers key details about the user across conversations, stored locally on the user's device.
Topaz Upscaler — AI image upscaling via Topaz is now available to all users in Image Studio.
Video Upscaling — New video upscaling feature available to all users from within the Studio interface.
Mobile Studio Access — Studio is now visible and accessible on mobile devices.
Voice Conversations — Realtime voice conversation mode with memory sync, chat persistence, waveform visualization, push-to-talk input, auto-greet, and language switching support.
Privacy Mode UI Simplification — TEE and E2EE options in the Privacy Mode dropdown are now combined into a single option; TEE is the default, with an E2EE toggle available in model settings. Privacy pill display order updated to "TEE · E2EE."
Support Bot Auto-Routing — Support bot now automatically routes conversations to the appropriate support category.
Country Attestation Gate — Users in blocked countries now see a once-per-session country attestation prompt before proceeding.
Character Page OG Images & Prompt Redesign — Character pages now display branded Open Graph images for link previews. Public character prompt page redesigned with updated layout.
Model Search Persistence — The search query in the model selector now persists when the selector is closed and reopened within the same session.
Crop Image Modal — Added an image cropping modal for editing images before use.
Visualization Sharing — Added visualization support to shared content.
Video Preview Thumbnails — Added preview image thumbnails for videos.
Memoria & Character Context Uploads — Support for .md file uploads now works for Memoria and character context.
Ignore Beads — Added bead filtering to ignore list.
Usage Tab in Settings — Added a new "Usage" tab to the Settings page.

Wallet and Payments

Subscription Flow & UI Refresh — Revised subscription purchase, management, and upgrade/downgrade flows with updated UI, routing, and tier display.
Bonus Credits Dollar Display — Bonus credits are now displayed as their USD equivalent ($30 for Pro, $10 for Plus) instead of raw credit counts.
Crypto Payment Fallback — Stripe-based crypto checkout now falls back to Coinbase Payments when unavailable.
Burn Page Pagination — "Load more" button now available for additional transactions on the burn page.
Video Credit Refund Status — Credits refunded due to video inference failures now show a "refunded" state in the transaction history.
Subscription Upgrade UI — Added pending-state UI components shown during in-progress subscription upgrades.
Subscription Upgrade CTAs — Added clearer upgrade calls-to-action within the subscription management UI.
Pricing Value Badges — Added value badges (e.g., "Best Value") next to credit line items on the pricing page.
Crypto Checkout Deeplinks — Added deeplinks that route users directly to the crypto checkout flow from external surfaces.

Performance

Multi-Image Upload Compression — Per-image compression is automatically scaled down when uploading 8 or more images in a single message.
List Virtualization — Added virtualization to long scrollable lists to reduce rendering overhead.

Mobile App

Max/Plus Badge — Added badge indicators for Max and Plus subscription tiers in the UI.
Voice Settings Screen — Added a dedicated voice settings screen with reusable component shared across screens, including adjustable playback speed controls.
Reference Video Attachment — Added a UI component for attaching reference videos in the input area.
Thinking Content Dialog — Added a dialog component to display model thinking/reasoning content.
TEE Attestation Report — Added TEE attestation report link to the model selector.
System Prompt in Auto/Simple Mode — System prompt settings now appear in auto and simple mode; settings order updated.
Video Download URL — Added downloadUrl support for videos.
Android WebView Bridge — Suppressed noisy javacalljs bridge logs in Android WebView.
Android APK Download — Updated the Android APK download URL on the website.
Axios Dependency Update — Updated axios from 1.13.6 to 1.15.0, including CVE security fixes.
Music Player Seeking — Fixed inaccurate time seeking in the music player.
Model Selector Search Count — Fixed search result count display in the model selector to match actual results.
System Prompt Dialog — Constrained the input field height in the system prompt dialog to prevent overflow.
Music Bottom Sheet — Changed the generate button text color to white in the music bottom sheet.
System Prompt Sync — Fixed system prompt activation not syncing correctly on mobile.
Image-to-Video Rotation — Fixed incorrect rotation of image attachments when used for image-to-video on mobile.
ASR Button Visibility — Hide the speech recognition button when text is already present in the input field.
Light Mode Error Boundary — Fixed a styling bug in light mode on the error boundary screen.
Native Playback Speed — Playback speed setting is now passed to native start-session calls on both iOS and Android.
Video Processing Hook — Updated the video processing hook with revised handling logic.
iCloud Download Error — Added a toast notification when an iCloud file download fails.
Send Button Fix — Fixed a bug preventing the send button from functioning correctly.
Settings Layout Cleanup — Removed an unused screen from the settings navigation layout.
Video Error Handling — Updated error messaging and handling for video playback failures.
iOS Audio Playback Speed — Fixed playback speed not applying correctly on iOS.
Playback Speed Switcher — Added a selected-state checkmark indicator to the playback speed switcher.
Conversation Voice Selector — Voice selector in conversations is now visible only to Pro users.
Settings Layout — Added flex-wrap to multiple settings screens to handle varying content widths.
Venice Voice Settings Order — Reordered items in the Venice voice settings screen.
Language Selector Separation — Separated the language selector into its own component, decoupled from the TTS component.

API

Crypto RPC Proxy — New JSON-RPC proxy at POST /api/v1/crypto/rpc/:network covering 24 network slugs across 11 chains (Ethereum, Polygon, Arbitrum, Optimism, Base, Linea, Avalanche, BSC, Blast, zkSync Era, Starknet). Supports single and batch calls, tiered credit billing (1x/2x/4x by method complexity), and per-user rate limiting. Public discovery endpoint at GET /api/v1/crypto/rpc/networks.
x402 Protocol Support — Venice now accepts payments via x402, a micropayment protocol enabling per-request pay-as-you-go API access with USDC on Base. No account required.
Search Endpoint — New POST /api/v1/augment/search endpoint for web search queries.
Web Scrape Endpoint — New POST /api/v1/web/scrape pass-through endpoint for retrieving webpage content.
Base64 Audio Upload for ASR — The speech-to-text endpoint now accepts base64-encoded audio input in addition to file uploads.
Reasoning Token Usage — Chat completion responses now include token usage details for reasoning tokens.
Function Calling Expansion — Enabled function calling support on Qwen models and additional models.
Venice Uncensored 1.2 Capabilities — Enabled multimodal input and function calling for Venice Uncensored 1.2.
Multi-Image Edit API Update — Multi-image edit endpoint now accepts an array of image URLs instead of a single URL.
Embedding Model Metadata — The /models endpoint now exposes embedding dimensions and input token limits for embedding models.
Usage History Endpoint — New GET api.venice.ai/api/v1/billing/usage endpoint for querying billing usage history.
Child API Keys with Spend Caps — Support for child API keys with configurable lifetime DIEM and USD spend caps.
Video Download URL — API responses for video generation now include a direct download URL.
DIEM Staking Balance Refresh — New endpoint to refresh the cached DIEM staking balance after a user stakes.
Referrals Leaderboard — New leaderboard endpoint integrated into the referrals UI to display ranking data.

Fixes and Improvements

Fixed image crop modal exceeding its expected boundaries
Improved restored compact selected-model cards at the top of Video Studio
Updated the Burn Watch area chart component
Updated grouping logic for item organization
Renamed mobile settings menu item from "Preferences" to "General"
Fixed model selector tooltip remaining visible when dropdown is open
Fixed image details drawer falling out of sync when navigating between images in the lightbox
Added Google Search Console verification file
Reduced excess empty space below image variants in Image Studio
Removed redundant directive from the getCharacter function
Fixed an infinite redirect loop between chat and sign-in pages
Added keep-alive handling during server-sent events processing to prevent premature disconnects
Filtered out bot-driven React Server Component router-state header errors
Added missing React key to a Flex element in MultiModalUserMessageContent to resolve rendering warnings
Updated the credits icon in the navigation, replacing the previous "Purchase Credits" badge
Hidden the "Pay with Crypto" button on monthly subscription options
Fixed downloaded images not respecting the user's selected image format setting
Fixed playback speed changes incorrectly altering voice pitch in conversations
Adjusted ZaiGLM51 model configuration parameters to improve response performance
Removed irrelevant push-to-talk keyboard shortcut tooltip from mobile web interface
Fixed buttons remaining active after submission, preventing duplicate requests
Widened the API Key Created modal to prevent key text from wrapping
Improved questionnaire to allow submitting responses by pressing Enter/Return
Fixed image-to-video pricing accuracy by including image URL in video quote requests
Updated web scrape response structure to match existing API response conventions
Fixed session not resetting properly when switching wallets during web3 logout
Improved video generation to display a credits purchase modal when the user has insufficient credits
Fixed subscription upgrade modal not functioning correctly for staked users
Fixed credit balance banner displaying misleading information when balance is split across sources
Fixed past-due subscription status not clearing when a non-Stripe subscription becomes active
Fixed a navigation error loop occurring on the chat page in Mobile Safari
Fixed autocomplete anchoring for references in Safari
Fixed a regression in Safari chat image copy from the viewer
Fixed image generation progress indicator not displaying in Safari
Fixed texture rendering error occurring when seeking within videos in the video editor
Fixed copied rendered images producing invalid blob URLs instead of usable image data
Fixed images loading eagerly instead of lazily, impacting page performance
Fixed action buttons on the video grid not functioning correctly
Fixed deleting a single image removing all displayed variants instead of only the selected one
Fixed text readability on blocked content indicators in the video grid
Fixed character profile Open Graph image routing and photo lookup failures
Fixed character Open Graph images not rendering in link previews due to missing server-side pre-fetch
Fixed character profile images not appearing correctly in social media share previews
Fixed black box appearing in place of images while loading in chat
Improved render performance of the video studio
Fixed edit image modal not scrolling correctly on mobile devices
Fixed video selector not functioning correctly when choosing video inputs
Fixed audio track processing unnecessarily waiting on thumbnail generation to complete
Fixed prompt character limits not being enforced correctly across different video generation models
Fixed image generation failing when negative seed values were passed to the API
Fixed video pricing calculation failing when image-to-video requests had no aspect ratio specified
Fixed API multi-turn conversations stripping images from previous messages, breaking image analysis across turns
Fixed queued messages overlapping with chat responses
Fixed voice input not working when Brave browser's Shields feature is enabled
Fixed selected chat model not persisting correctly between sessions
Improved markdown rendering to support LaTeX and math notation in multimodal responses
Fixed rate limit notification not displaying for free-tier users
Fixed message ordering appearing incorrect when reopening a chat
Fixed unavailable models row rendering incorrectly in certain conditions
Fixed fork popup not dismissing properly in chat menus
Fixed token caching behavior that was causing issues when used outside of the API context
Fixed mobile model picker opening the description panel on row tap instead of selecting the model
Fixed model search failing to find models with version letters embedded in their names
Fixed sidebar conversation delete not working correctly
Improved overall performance and page loading times
Fixed image generation variants incorrectly using steps and CFG scale values from global settings
Fixed variant settings inheriting steps and CFG scale from the wrong model
Fixed file names with special characters causing errors during upload
Improved Memoria context accuracy and reduced repetitive memory references
Fixed generation queue not recovering properly after an error
Fixed memory context not being applied to certain eligible models
Fixed conversation titles not displaying correctly
Fixed geo-restriction notification appearing for models that are not actively selected
Fixed studio mobile header being obscured by the safe area in installed PWA mode
Fixed plan indicator displaying incorrectly on pricing cards
Fixed localized country names incorrectly including an English article
Fixed settings items missing their card-style container styling
Fixed gaps in local media cleanup that could leave orphaned files
Fixed audio output being truncated during processing
Fixed Mermaid diagram rendering errors appearing in chat
Improved queue loading animation in the header
Fixed content policy errors not being surfaced properly during music generation
Fixed a scrollbar regression that affected chat turn history display
Fixed user prompt being duplicated in music generation requests
Fixed dollar-sign currency values being incorrectly rendered as LaTeX math expressions in chat
Fixed model switcher to preserve backend-defined ordering instead of re-sorting client-side
Fixed auto-submitted prompts not clearing from the character chat input field after submission
Fixed chat input field rendering below the visible viewport on Safari
Fixed model search not matching results for queries containing spaces
Fixed photo viewer not closing when initiating background removal on an image
Updated past-due payment banner to indicate users retain access during the grace period
Fixed pricing tiers not updating when promo state changes
Added automatic redirection to Audio Studio from legacy audio routes
Fixed model fallback toast notification appearing repeatedly
Fixed incorrect model pricing display
Updated Swagger video schemas to show all available model options

March 30th, 2026

Venice.ai Change Log - March 7, 2026 - March 26, 2026

Venice Referral Program

Share your referral link and earn $10 in API credits when someone you refer upgrades to Pro. Referred users also receive $10 in credits. Track referrals from Profile settings or the new referral drawer. The program is open to all users, including anonymous visitors.

Privacy Modes: TEE & E2EE on Venice

Every model on Venice now displays a privacy mode, giving users full visibility into how their data is handled. Models running inside Trusted Execution Environments (TEE) ensure isolated processing, and end-to-end encrypted (E2EE) models go further — not even Venice can see your prompts or responses. Click any privacy badge to see exactly what each mode means.

Read the full announcement →

Explore Privacy Modes →

All models below support both TEE and E2EE. Toggle between modes in model settings.

Venice Uncensored 1.1
GLM 5
Qwen3.5 122B A10B
Gemma 3 27B
GLM 4.7
GLM 4.7 Flash
GPT OSS 20B
GPT OSS 120B
Qwen 2.5 7B
Qwen3 30B A3B
Qwen3 VL 30B A3B

Operated by NEAR AI Cloud and Phala Network.

Import Memory

Import conversation memories from ChatGPT, Claude, and other AI providers into Venice. Rolled out to all users.

https://x.com/AskVenice/status/2033989325830955458

Kling Reference-to-Video

Upload a reference image and generate video that maintains the same character across scenes. Supports character consistency and lipsync in AI-generated videos.

https://www.youtube.com/watch?v=vjA9pECWfTY

New Models

Models added to Venice during this period.

Text Models

MiniMax M2.7 — Text model from MiniMax with multi-turn conversation support.
Aion 2.0 — Text model from AionLabs. Pro only.
Grok 4.20 Beta — xAI's latest Grok text model with function calling and built-in X Search.
Grok 4.20 Multi-Agent Beta — Multi-agent variant of Grok 4.20 with X Search support.
Mistral Small 3.2 24B Instruct — Mistral AI's 24B-parameter instruction-tuned text model. API only.
Qwen 3 Next 80B — 80B parameter dense model from Alibaba Cloud. API only.
Qwen 3 235B A22B Instruct — 235B parameter mixture-of-experts model from Alibaba Cloud. API only.
Qwen 3 235B A22B Thinking — 235B parameter MoE model with extended thinking mode. API only.

Video Models

Kling O3 Pro Reference-to-Video — Character-consistent video generation from a reference image. Pro tier.
Kling O3 Standard Reference-to-Video — Character-consistent video generation. Standard tier.
Seedance 1.5 Pro — Video generation model from ByteDance. Text-to-video and image-to-video. Uncensored.

Audio Models

ElevenLabs Multilingual v2 — Multilingual text-to-speech from ElevenLabs.
ElevenLabs TTS v3 — Latest ElevenLabs text-to-speech model.
ElevenLabs Music — Music generation from ElevenLabs.
ElevenLabs Sound Effects — Sound effects generation from ElevenLabs.
Qwen 3 TTS 0.6B — Compact text-to-speech model from Alibaba Cloud.
Qwen 3 TTS 1.7B — Larger text-to-speech model from Alibaba Cloud.

Model Upgrades

DeepSeek V3.2 — Now supports function calling and structured output. Pro only.
Qwen 3.5 9B — Now supports vision input, accepting both text and image prompts.

App

New Features

Music Generation Duration — Continuous duration control for music generation using ElevenLabs models
Video Queue Progress Bar — Queue items with unknown progress now show an indeterminate loading state
Image Generation Timeout Handling — Timeout errors during image generation now display a user-facing error message instead of failing silently
HEIC Image Upload Support — HEIC images are now automatically converted on upload so iOS photos can be used directly
Drag-and-Drop for Image Edit Prompt — Images can now be dropped directly into the edit image prompt input field
Qwen3 TTS in Playground — Added Qwen3 text-to-speech models to the Playground model selector
Resolution-Based Max Duration — Video models now enforce maximum generation duration based on selected output resolution
Referral Drawer — Dedicated drawer component for viewing and managing referral details
Referrals for Anonymous Users — Anonymous users can now view and interact with referral prompts before signing up
Referral Pro Upgrade Date — Referral dashboard now displays the date each referred user upgraded to Pro
Image Info Button in Edit Tab — Info button on the selected image in the edit tab showing image metadata
Video Duration Ranges on Model Cards — Model cards now show supported video duration ranges

Wallet and Payments

Coinbase Payment Links — Added Coinbase payment links as a payment option
Coinbase API Migration — Migrated Coinbase payments from Commerce API to Payment Links API
Referral Credits in Profile — Referral credits earned now displayed in Profile settings
Out of Credits Flow — User flow for when credit balance reaches zero, prompting next steps
Bonus Credits Callout Card — New card component highlighting available bonus credits
Admin Referral Credits Page — New admin page to view and manage referral credits
Admin Referrals Endpoint — New admin API endpoint for querying user referrals and credits
Referral Credits in Session API — Referral credit data now included in the user session API response
Stripe Invoice Pagination — Added Stripe hasMore parameter to invoice list responses
Auto Top-Up Amount Update — "Amount to Add" field now auto-updates when Auto Top-Up is configured
Next Estimated Payment — Stripe users can now see their next estimated payment amount

Mobile App

Connection Error Message — Added a user-facing error message when the connection is dropped mid-request.
Document Attachment Loader — Fixed missing loading indicator when attaching documents in chat.
Document Pick Wallet Fix — Fixed an issue where selecting multiple files for document upload caused the wallet to log out.
Import Memory Prompt Localization — Added internationalization support for the import memory prompt text.
Android Header Fix — Fixed the right-side header element not rendering in the Android stack header.
Social Feed Video Playback — Fixed playback issues and improved video rendering in the social posts feed.
Social Feed Media Layout — Added inline video playback and a masonry grid layout for media in the Social Feed.
Background Removal — Added a Remove Background action for images in the mobile app.
Video Playback UI — Improved video player interface elements and expanded test coverage.
Video Loading Progress — Video loader now displays percentage complete and estimated remaining time.
Enhanced Prompt Text Selection — Text inside the enhanced prompt popover is now selectable.
Lazy-Loaded Tabs — Image and feed tabs now lazy-load their content instead of rendering on mount.
Filter Defaults and Reset — Adjusted default filter values and added a reset button to restore defaults.
Chat Memory Default — Chat memory is now enabled by default for new sessions.
Video Models in Top-Level Navigation — Video models now appear at the same navigation level as text and image models.
Android Video Tap Fix — Fixed tapping a playing video on Android not registering correctly.
Media Detail Pre-Loading — Images now pre-load before navigating to the media detail screen.
Video State on Tab Swipe — Fixed videos continuing to play when swiping between tabs.
Video Feed Screen — Added a dedicated video feed screen to the mobile app.
Image Feed Screen — Added a dedicated image feed screen to the mobile app.
Video Tab — Added a new Video tab to the main tab layout.
Prompt Screen Keyboard Dismiss — Keyboard now dismisses when leaving the prompt screen.
Video Feed Volume Control — Added a volume control to videos in the feed.
Assistant Video Error Boundaries — Wrapped assistant video components in error boundaries to prevent crashes.
Post Video Error Boundaries — Wrapped video components in social posts with error boundaries.
Prompt Screen Render Optimization — Reduced re-renders on the prompt screen to fix a Hermes runtime crash.
Import Memories Dark Mode — Applied dark mode styling to the Import Memories button; added translations for multiple languages.
Prompt Input Limits — Added character and token limits to the message input components.
Image Modal Orientation Fix — Fixed image modal orientation for large portrait images.

API

Video Transcript API — New public endpoint for video transcription at $0.02 per request.
Music Services v1 API — New /v1/music API endpoints for music generation.
Character Reviews Endpoint — New GET /v1/characters/[slug]/reviews endpoint to retrieve character reviews.
Characters Public API Expansion — Added previously missing fields and features to the characters public API.
GTM on API Settings Page — Added Google Tag Manager loader to the API settings page.
Character Author Field — Characters now use a dedicated author field in their data model.
Chat Response Time — Chat responses now display elapsed response time.
Anonymous API Settings Access — Unauthenticated users can now view the API settings page.
Characters Endpoint: author & isOwner — /characters endpoint now returns author and isOwner fields.
System-Only Message Requests — API now accepts requests containing only system messages.
Default Reasoning Effort — Default reasoning effort for inference set to medium.

Fixes and Improvements

Improved file-type support detection and validation in the chat document upload flow
Updated referral text to clarify that referred users must subscribe to Pro
Improved responsiveness when syncing available models with the server
Updated document upload panel positioning to auto-adapt to available screen space
Fixed a crash caused by the chat message timer exceeding maximum update depth
Fixed clipped content caused by overflow CSS being set to hidden
Fixed a crash when displaying the message header for conversations using retired chat models
Fixed a createTreeWalker crash on the chat page
Added timeout handling for Memoria long-term memory operations
Fixed the Learn More link on the Diem token page pointing to an incorrect destination
Improved how video outputs and scene images are stored and displayed within chat
Simplified the duration label formatting shown during video generation
Fixed badge rendering and layout on mobile web viewports
Updated memory import prompt to request markdown-formatted input
Improved memory import to batch embedding requests and avoid rate limits
Strengthened rate limiting and endpoint validation across REST API routes
Removed maxBatchSize field from API response payloads
Restored Aria voice option in text-to-speech
Fixed referral credits not displaying correctly in the referrals drawer
Fixed referral credits being incorrectly issued for referrals made after subscription
Fixed price helpers and removed deprecated three-day subscription type
Fixed Open Graph thumbnails not displaying correctly for feed posts
Fixed inpaint and upscaler error messages not showing properly for free-tier users
Fixed race condition causing image attachments to disappear from conversations
Fixed defects in the Video Editor
Improved image size error messaging
Added video transcription v1 endpoint with fixed pricing
Improved chat interface to render sub-agent steps and responses
Fixed an error on the chat page that could interrupt conversations
Fixed a loop caused by active model fallback logic in chat
Fixed excessive re-rendering in the chat timer component
Fixed referral stats displaying NaN values
Fixed character selection overlay not dismissing properly on mobile
Fixed prompt history interfering with @mention keyboard navigation
Fixed library page getting stuck in a loading state
Improved authentication loading performance
Fixed theme color selector not indicating the selected color
Fixed promo codes not applying correctly during checkout
Fixed menu items appearing highlighted when not selected
Fixed selected theme not applying on the checkout page
Fixed typos in model descriptions for several models
Fixed incorrect error message when an invalid voice is specified
Fixed message order after editing by correcting the sorting logic
Fixed image modal footer visibility
Collapsed duplicated encrypted reasoning placeholder in TEE/E2EE responses
Fixed image modal orientation for large portrait images on mobile

March 6th, 2026

Venice.ai Change Log - February 26, 2026 - March 6, 2026

GPT-5.4 Suite Now Available

OpenAI's flagship models GPT-5.4 and GPT-5.4 Pro with extended reasoning are now available to all Venice users. Here's everything else we shipped.

New Models

The following models have been added to Venice:

Text Models

GPT 5.4 — Latest, most advanced text model from OpenAI. Available to all users.
GPT-5.4 Pro — OpenAI's Pro-tier text model with extended reasoning capabilities. Available to all users.
Qwen 3.5 35B A3B — Text model from Alibaba Cloud with 35 billion parameters. Now publicly available to all users.
GLM 4.6 — Text model from Zhipu AI. Now available via API.
GPT-4o — OpenAI's multimodal text model with 128K context window, supporting text, image, and audio inputs. Available to all users.

Image & Video Models

Nano Banana 2 — Google’s image generation model with per-unit pricing. Available to all users on a pay-per-use basis
LTX 2.3 — Video generation model from Lightricks. Available to all users

App

New Features

Video Prompt Auto-Enhance — Video prompts are now automatically enhanced before generation to improve output quality
Keyboard Shortcut: Cmd/Ctrl+Enter — Submit prompts using Cmd+Enter (Mac) or Ctrl+Enter (Windows)
Session Persistence — Chat sessions are now preserved across page reloads and browser restarts
Route-Based Navigation — Navigation now uses URL-based routing, enabling browser back/forward and direct linking
Session Selection — Users can now select and switch between active sessions
Music Generation Queue Polling — Music generation now polls for completion status asynchronously, showing real-time progress updates
Content Violation UI — Improved messaging and presentation when content moderation flags a request
Verification Center Header — New header section added to the Verification Center page
GIF Export — Export generated content as GIF files
Enter Key Submission — Pressing Enter now submits prompts in applicable input fields
Logged-In User Redirect — Authenticated users are now automatically redirected away from landing/login pages
Randomized Video Feed — Videos in the public feed now display in randomized order
Video Gallery — New gallery view for browsing generated videos
Video Completion Notification — Users now receive a notification when their video generation finishes
Recent Chat Images — Recent images from chat history are now accessible in the conversation view
Variant Model Settings — New settings for configuring model parameters when generating variants
Venice Banner — New Venice-branded banner added to the interface
Music Routing and Confirmation — Music generation now includes inline confirmation before submission and updated routing

Wallet and Payments

Billing Interval Toggle — Monthly checkout now includes a toggle to switch between billing intervals.
Video Generation Credit Costs — Credit costs are now displayed in the video generator.
Monthly Crypto Subscriptions — Users can now subscribe to monthly plans using cryptocurrency.
Credit Cost Reminder Modal — A modal now reminds users of credit costs before confirming an action.

Mobile App

AI Agent Skills — Agents now support skills, enabling task-specific capabilities inside conversations.
Multi-Image Support — Send or attach multiple images in a single message.
Voice Input Reliability — Updated speech-to-text handling for the new Expo SDK, including more reliable auto-submit behavior.
Android Message Composer — The message input now grows dynamically on Android to avoid typing and layout issues.
Context Usage UI — Updated the context usage component and fixed layout issues in the context usage popover.
Conversation Rename Dialog — Added a dedicated dialog for renaming conversations.
Errored Message Actions — Assistant messages that fail now surface regenerate and delete actions.
Token Dashboard Navigation — Added authenticated navigation entry points to the token dashboard.
Settings Navigation — Added Settings to the drawer and refreshed parts of the settings UI.
Preferred Model Persistence — The model selector now preserves and displays the user's saved model label.
Credits and Pricing UI — Refreshed credits iconography, price quote styling, and related status colors.
Variant Settings Navigation — Added a back arrow to variant settings bottom sheets.
Animated Image Loader — Image generation now shows an animated loading state, with fixes to loader clipping.
Image Variant Defaults — Updated default handling when selecting image variants.
Media Item Text Layout — Adjusted line height and removed model labels from media item displays.
Chat and Search Layout Polish — Refined chat padding and search group header styling.

API

Music Generation SDK — Added async queue support for music generation in the API SDK
Auto-Mint API Key — API keys are now automatically generated for new accounts
API Key Requirement Removed — API key is no longer required for certain operations
Max Reasoning Effort — New max_reasoning_effort parameter to control reasoning depth on supported models
Background Removal Model — Replaced BEN-2 with Bria RMBG 2.0 for background removal
Checkout Session Endpoint — New endpoint for updating an existing checkout session
Multi-Image Generation — Added support for generating multiple images in a single API request
Free Tier Moderation — Content moderation now applied to free-tier user requests
Credit Grant — 1K credits granted to user accounts
Dynamic Model Selection — Router now dynamically selects the fastest available model for requests
Annual Subscription Pricing — Annual Stripe subscription price changes from $149 to $180, effective March 1, 2026
Router Token Limit — Increased the maximum token limit for model router requests
Grok 4.1 Fast Pricing — Updated per-token pricing for the grok-4.1-fast model
Stablecoin Payments — Added stablecoin as a payment method
Music Task Type — Added music as a new task type in the API

Fixes and Improvements

Improved tooltips now appear after a short delay instead of instantly on hover
Updated memory behavior in temporary chat sessions
Improved chat input field now auto-refocuses after sending a message
Improved border animation on the music generation progress indicator
Improved moved video models to a different category in the model selector
Improved video model selector now displays the first 6 models by default
Updated multimodal model status indicator rendering for new content types
Improved removed video models selection box from the default view
Improved upgrade button is no longer shown to users already on a Pro plan
Improved interface now resets to edit mode after completing an action
Improved mode switching is now disabled while a generation task is in progress
Added loading placeholder to reduce layout shift while content loads
Improved document upload and parsing speed in Memoria
Added back arrow to variant settings sheets for easier navigation
Improved adjusted line height and removed model labels from media item text displays
Updated error message for insufficient account balance
Improved access by removing the credit requirement for usage
Improved Phala verification by running it client-side in the browser
Improved model capabilities to support multi-modal inputs
Improved rendering of model thinking output in the chat interface
Improved end-to-end encryption with server key authentication
Improved privacy capabilities with end-to-end encryption support
Improved end-to-end encryption with enhanced security measures
Improved model selector to persist its state across sessions
Improved certificate security by adding CRL revocation checking

February 25th, 2026

Venice.ai Change Log - February 19, 2026 - February 25, 2026

GPT-5.3 Codex Now Available

OpenAI's code-specialized GPT-5.3 Codex model is now available on Venice, optimized for code generation, refactoring, and debugging tasks. Here's everything else we shipped.

New Models

The following models have been added to Venice:

Text Models

GPT-5.3 Codex — OpenAI's code-specialized text model optimized for code generation, refactoring, and debugging tasks. Available to all users.

Image & Video Models

Seedream V5 Lite — ByteDance image generation model for text-to-image synthesis. Available to all users

App

New Features

Batch Image Deletion — Added the ability to select and delete multiple images at once in
Image-Transition Video on Mobile — Image-transition video models are now available in the mobile app.
Image Storage Policy — Updated image storage and retention policy.
Inline Model Selector — Moved the model selector to an inline position within the generation UI.
Direct Image Generation — Images can now be generated directly from the current context without additional navigation.
Image Info Button — Added an info button on generated images to view metadata and details.
Clear Image History — Added option to clear image generation history.
Image Settings Sync — Image generation settings now sync across sessions and devices.
Prompt Character Limits — Prompt input fields now display and enforce character limits.
Prompt Enhancer Info — Added informational tooltip explaining the prompt enhancer feature.
Rate Limit Display — Added a widget showing current API rate limit usage and remaining quota.
Enhance Prompt Button — Added a button to automatically enhance/rewrite prompts before generation.
Pinned Model Selectors — Model selectors now stay pinned to the top of the panel when scrolling.
Counter Animation — Added smooth animated transitions to numerical counters (e.g., credits, counts).
Unified Combine UI — Unified the combine feature interface across image and video modes.

Wallet and Payments

PPU Credit Alert — New dialog warns users about credit costs before regenerating responses from pay-per-use models
Clickable Credit Elements — Credit indicators in the UI are now clickable
Credit Balance Display — Credit balance is now visible in the UI

Mobile App

Mobile Update Dialog — New dialog prompts users to update the app when a newer version is available

API

Web Scrape & Reasoning — Added web scraping and reasoning capabilities to the API.
Free Tier Limits Removed — Removed rate limits for the free tier.
Single Prompt Support — Added support for single-prompt requests (non-chat completions).
Character-Specific Sampling — LLM sampling parameters now vary per character/persona.
Negative Prompts Disabled — Negative prompt parameter has been disabled.
Qwen Image Model — Added Qwen Image model to the v3 inference router.
Prompt Limit Increase — Maximum prompt length increased to 4,096 tokens.
Murano API Endpoints — Added new API endpoints for the Murano product.

Fixes and Improvements

Updated error message text displayed on mobile devices
Updated image hover action buttons to a circular shape
Fixed header and footer not appearing when scrolling to the top of the page
Improved removed the 4:5 aspect ratio from available options
Improved removed the 4:3 aspect ratio from available options
Improved the metadata panel shown during variant generation
Updated the color of the favorite/star icon
Improved hidden the Variants button when auto mode is active to avoid conflicts
Improved incompatible video models are now automatically deselected when switching settings
Fixed settings modal not adjusting its height to fit content
Improved removed the redundant "New Session" button from the interface
Improved removed wallet connect button from the video landing page
Updated default model parameters for Venice models
Updated default minP sampling parameter for model inference
Improved handling and response behavior when web search fails
Improved wallet-based users are now excluded from promotional credit grants
Fixed image generation prompt not being applied correctly
Fixed image action buttons not responding to clicks
Fixed aspect ratio button not functioning correctly in image generation
Fixed web search and resolution quoting not working correctly
Improved error handling during backup restore operations
Fixed style picker and variant selection issues in image generation
Fixed bugs and improved model selector behavior
Fixed time-to-first-token timer reporting incorrect values
Improved document processing reliability
Improved API to support Anthropic structured output

February 19th, 2026

Venice.ai Change Log - February 5, 2026 - February 19, 2026

Claude Sonnet 4.6 Now Available on Venice

Anthropic's latest Claude Sonnet model is now live on Venice. This model brings improved capabilities across reasoning, coding, and conversation at competitive pricing — available to all users.

Here's a selection of everything we shipped over the past weeks.

New Models

The following models have been added to Venice app & API:

New Models

Claude Opus 4.6 — Anthropic's Opus-tier model with 1M token context window and strong performance across coding, analysis, and complex reasoning. Now available to all users
Claude Sonnet 4.6 — Anthropic's latest Sonnet with 1M token context, adaptive reasoning, and code performance approaching Opus at Sonnet pricing. Available to all users
GLM 5 — Zhipu AI's latest-generation text model, now available on Venice via DeepInfra. Available to all users
GLM 4.7 Flash Heretic — Uncensored, unfiltered text model from Zhipu AI with 128K context window. Available to all users
MiniMax 2.5 — Text model from MiniMax with multi-turn conversation and strategic reasoning capabilities. Available to all users

Model Updates

GLM 4.7 — Reasoning mode now enabled, supporting chain-of-thought processing. 128K context window
Nano Banana Pro — Now includes built-in prompt enhancement for image generation (Pro users)

App

General Updates

Aspect Ratio Selection — New aspect ratio picker added to image and video generation
Inpainting Settings — New settings panel with controls for inpainting edits
Annual Price Increase Banner — Banner notifying users of upcoming annual pricing changes
Video Studio Link — Added a direct navigation link to Video Studio
Favorite Models — Save models as favorites for quick access in the model picker
Image Paste in Video Studio — Paste images directly into Video Studio from your clipboard
Drag-and-Drop Images — Drag and drop images into Video Studio or chat
Video Variants — Generate variant versions of existing videos
Clear All Button — New "Clear All" button to reset current selections
Multi-Shot Video — Create videos with multiple scenes in a single prompt
Chat Annotations — Inline annotations now appear alongside chat responses
Chat Onboarding — Step-by-step walkthrough for new users on first visit to chat
Video Studio Walkthrough — Guided tour for new users on first visit to Video Studio
Image Input for Prompt Enhancer — Attach reference images when using the video prompt enhancer
Category Suggestions — Added category-based suggestions to model or content discovery
Preferred Model in Settings — Set a default preferred model from the Settings page
Multimodal Image Loader — Upload images alongside text prompts in multimodal chat

Wallet and Payments

3D Secure for Past-Due Payments — Payment retries on past-due invoices now support 3D Secure authentication
3D Secure for Subscription Upgrades — Subscription upgrade payments now support 3D Secure authentication

Mobile Web

Video Studio on Mobile — Video studio link is now visible and accessible on mobile web PWA.

Mobile App

Video-to-Video Generation — New video-to-video mode allows uploading an existing video as input for generation
Favorite Models — Star models to add them to a new Favorites section in the model selector
Favorites Search — Search bar added to the Favorites tab in the model selector
Memoria — On-device memory system that stores context locally on your device between conversations
Auto-Generate Memories Toggle — New setting to enable or disable automatic memory generation from conversations

API

Image Generation Tool — Image generation is now available as a tool in chat completions requests.
Multi-Prompt Video Inference — Video inference requests now support multiple prompts in a single call.
Video Inference End Image — New end_image_url parameter for video inference requests to specify a target end frame.
Image Model Prompt Enhancer Field — Model listing responses now include hasBuiltInPromptEnhancer on image models.
Image Generation Simple Mode — New simpleMode parameter added to image generation requests.
Prompt Enhancement Rate Limits — New prompt_enhance rate limit type for prompt enhancement requests.
Image Request Field — New image field added to the request schema for image-related endpoints.

Fixes and Improvements

Improved prompt input bar to auto-expand as the user types
Updated model cards in the model picker to support collapsing
Improved removed the enhance prompt toggle from the interface
Added error messages when a response stream fails
Updated pricing for the PixVerse v5.6 video model
Updated Pro badges and icons to be visible to signed-out visitors
Updated Simple mode to hide model name labels
Updated web search results in chat to support collapsing
Improved NSFW video filtering in the social feed
Improved restored the Billing History section in account settings
Improved redesigned the retry payment flow for past-due subscriptions with clearer status and actions
Improved wallet reconnection by skipping the Sign-In With Ethereum step for previously authenticated wallets
Fixed loading dots persisting in auto mode
Added loading indicators to the Clip library while content loads
Improved search to include conversations inside folders in results
Fixed conversation folders section to be scrollable when the list exceeds the visible area
Improved search to hide folder headers and show only matching conversations
Added file size error handling for audio and video uploads that exceed the allowed limit
Fixed video model sub-tabs to be horizontally scrollable in the model selector
Improved expanded image editing API access to free-tier users
Increased rate limits for unauthenticated API users
Updated pricing to tiered model for requests exceeding standard context length
Updated Opus model requests to use tiered pricing

February 4th, 2026

Venice.ai Change Log - January 27, 2026 - February 3, 2026

2 Million Users: Thank You for Building with Venice

Venice has reached 2 million users — a major milestone for our community. Thank you to everyone who has made this journey possible. Here's everything we shipped this week.

New Models

We have added several powerful new models to the Venice platform:

Text Models:

Kimi K2.5 Model — Offering enhanced performance and capabilities at competitive pricing
GLM 4.7 Flash Model — Access restrictions lifted, making this powerful model available to all API users

Image & Video Models:

xAI Grok Imagine — Enhance your creative capabilities with new models for text-to-image, image editing, and video
Vidu Q3 Video — Publicly accessible video models now available for a wide range of applications and use cases
Chroma — Now available to all Pro and API users for high-quality, uncensored image generation

Character Video Generation

Character Video Generation — Create short videos featuring your AI characters. Uses Qwen Edit for image creation and Grok for video and audio production, bringing your characters to life in engaging video format.

Generate videos with your custom characters
Integrated image and video/audio pipeline

Venice AI Skill on OpenClaw

Venice AI OpenClauw Skill — Venice is now available as a comprehensive skill on OpenClaw, integrating text, models, embeddings, audio, and web search capabilities. Link: https://clawhub.ai/jonisjongithub/venice-ai

Venice is now also discoverable on Clawdhub, making it easier to find and use Venice skills.

Combine Images

Merge multiple images into one composite and edit them together with AI-powered blending.

Multi-Image Selection — Upload and select multiple images to combine into a single composition
AI-Powered Blending — Intelligent blending that seamlessly merges your images
Model Selection — Choose from multiple image models including Nano Banana Pro, GPT Image 1.5, Flux 2 Max, Seedream 4.5, and Qwen-Edit

Available now for Pro users and via API

Venice Video Studio

Venice Video Studio — A dedicated workspace for AI video generation. Create videos from text prompts, images, or existing videos with our standout multi-model generation feature that lets you compare results side-by-side.

Key features:

Generate videos from text prompts, images, or existing videos
Multi-model generation to compare results across different models
Side-by-side comparison view
Available in the App for all users

API Playground

API Playground — Test Venice API endpoints directly in your browser without writing code. After successful alpha testing, the interactive API Playground is now available to all users.

Features:

Test text, image, and video generation endpoints
View real-time responses
Copy code snippets for your integration

App

Agent Activity — Enhanced workflow with tool calling support for improved user experience
Cloud SQL Support — Added support for Cloud SQL Socket URLs for Google Cloud SQL users
Log Enhancements — Improved logging with migration status and modified request and response logs
Container Architecture — Auto-generated container wrapper structs for improved system architecture
Model Indicator — New model indicator SVG added for better model representation
Video Transition — New feature for video transition models with start and end frame specification
Logo Visibility — Enhanced logo visibility with deliberate design choice
Image Display — Improved image display in chat with clearer images
Design Capabilities — Significant enhancement to project's design capabilities with new files and modifications
Visual Indicators — Added visual indicators for easy issue identification
Temporary Video Warning — Modified temporary video warning banner for simple mode
File Upload — Improved file upload with processing indicator and explicit accept attribute
File Picker — Updated file picker to only allow image files for image editing modes
Venice Video Studio — Introduced multi-model video generation comparison tool
Persistent Video Playback — Added persistent video playback in Venice Video Studio
Video Deletion — Introduced ability to delete individual errored videos from queue
Translation Updates — Updated translation files for new hint text
Model Privacy — Integrated existing components to display model's privacy status consistently
Keyboard Handler — Added keyboard handler to video studio prompt textarea
Thumbnail Visibility — Improved thumbnail visibility in video studio
Thumbnail Previews — Introduced thumbnail previews for video queue items
Video Audio — Introduced dynamic video audio behavior with play on click and mute on hover
Character Insights — Introduced character insights feature with automatic extraction and new 'Insights' tab
Widescreen Support — Added support for widescreen 21:9 aspect ratio
Max Prompt Limit — Introduced max prompt limit for improved user experience
Chart Component — Created chart component to visualize daily messages
Video Studio Input — Enhanced overall functionality of Video Studio input panel
Audio Support — Introduced audio support status display for models in Venice Video Studio
Browser Notifications — Introduced browser notifications for video completion
Video Studio Navigation — Modified video studio navigation to always display navigation button
Chat Inference — Improved chat inference with hidden searching and first token
Model Selection — Improved model selection process in video studio
Chat Input Mode — Introduced automatic switching of chat input mode to image mode
Video Card Width — Updated Venice Video studio UI to enforce minimum video card width
Recent Images — Introduced feature to quickly select from recent generated images
Video Generation Time — Introduced feature to log and display video generation time
Model Selector — Improved model selector with click instead of hover
Vidu Logo — Added new Vidu video model logo
Video Generation Retry — Introduced feature to retry character video generation with uncensored model
Video Direction — Introduced comprehensive video direction for character video generation
Image Placeholders — Improved image placeholders to appear immediately and render as they complete
Reasoning Enabled — Introduced 'Reasoning Enabled' toggle in Character Settings modal
Responsive Layout — Improved responsive layout for consistent user experience
Character Mode — Introduced new CharacterModeToggle component
Story Prompts — Introduced feature to provide suggested story prompts for character interactions
Image Context — Improved user experience by providing essential context for generated images
Context-Aware Prompt — Introduced setting to disable Context-Aware Prompt Suggestions
Video Playback — Improved video playback behavior in VideoStudio component
Refresh Button — Introduced refresh button to conversation suggestions feature
Prompt Collections — Made prompt collections in Generated Videos section collapsible
Video Playback Control — Introduced feature to ensure only one video plays at a time
Chat Output — Enhanced chat output rendering with new components and modifications
Confirmation Modal — Introduced consistent themed confirmation modal for video deletion actions
Image Download — Introduced feature to download images after background removal
Button Visibility — Improved visibility of 'Hide Conversation Suggestions' button
Search Button — Moved search button next to notifications button and added back button
Entity Insights — Enhanced entity insights feature with updates to useInsightExtractor.ts hook
Post Detail Modal — Improved PostDetailModal.tsx file to address scrolling issues
Story Suggestions — Changed initial value of storySuggestionsEnabled to false
Action Menu — Updated action menu to remain usable during image and video generation
Translation Updates — Updated various language files and translations.json file
Document Upload Button — Updated icon used in DocumentUploadButton component
Template Columns — Extracted duplicate template columns to a constant and updated panel border styling
Video Fullscreen — Introduced changes to VideoFullscreenProvider.tsx file and updated translation files
Recent Images Helper — Introduced feature to display video studio input images in recent images helper
Edit Image Modal — Introduced ability to paste images directly into Edit Image modal
Video Gallery — Updated video gallery to maintain 2-column layout until xl breakpoint
User Experience — Improved user experience with various updates and enhancements
Responsive Video Gallery — Introduced responsive video gallery grid that adapts to different screen widths

Wallet and Payments:

Clear Guidance — Understand why the submit button is disabled when watching videos or using credit-billed models without logging in
Easy Credit Management — Get clear explanations and direct access to purchase more credits when running low in the Video Studio
Improved Payment Options — Enhanced payment method displays and updated translations for a smoother user experience
Expanded Access — Free users can now access auto top-up and billing sections for more convenient account management

Performance:

Persistent Video Generations — Completed video creations are now saved across page refreshes for a seamless editing experience

Mobile App

Image Sharing — Easily share images directly to the app and attach them to chats
Responsive Design — Improved layout and responsiveness for a better user experience
Model Updates — Enhanced model definitions for more accurate and efficient processing
Technical Updates — Behind-the-scenes improvements for smoother app performance
App Version Display — Quickly view your app version in the account info section
Clear Pricing — Subscription prices now clearly displayed in US dollars
Screen Lock Prevention — Phone screen stays on during tasks like voice recording and AI processing
Advanced Query — Expanded query functionality for more powerful searching
Folder Management — Easily create, delete, and manage mobile folders and conversations
Toast Message Improvement — Update toast messages now fit neatly on mobile screens with reduced font size and text

API

API Playground — Rolling out to all users after successful alpha testing
Multi-File Update — Improves overall system performance and functionality by updating multiple files and adding new ones.
Single File Modification — Enhances user experience by modifying a single file to improve its functionality.
Chat System Expansion — Expands the features or improves the existing chat system for better user interaction.
Error Message Improvement — Provides helpful error messages when a CONTENTPOLICYVIOLATION error is encountered, improving user experience.
Model Access — Allows users to access the model regardless of their subscription tier, with costs being incurred on a pay-per-use basis.
Debugging Enhancement — Introduces a new error code to differentiate between model runner rejections and validation failures, enhancing debugging and observability.
Model Metadata Update — Updates the model metadata to provide a direct link to the model's Hugging Face repository.
Video Inference Capability — Enables video models to utilize start and end frame capabilities.
Video Model Enhancement — Introduces a new capability to the VideoModelSchema, specifically the supportsEndImage feature.
Insight Extraction API — Enables the extraction and return of structured profile updates from conversations.
ImagineArt 1.5 Pro — Enables public access to ImagineArt 1.5 Pro.
Technical Update — Updates the interface and prioritizes Qwen3 VL 235B for vision requests.
API Price Sheet Update — Adds ImagineArt15Pro to the API price sheet and updates its FAL analytics URL.
Model Availability — Demonstrates the ease of managing model availability through configuration updates.
Request Tracking — Improves the tracking and identification of requests.
Social Feed Update — Allows users to post images without providing any text fields.
Security Enhancement — Enhances security by adding a minion authentication header to the TTS request.
Character Image Generation — Updates the character image generation prompt logic to use the flag.
Technical Enhancement — Supports enhancements by modifying or adding files.
Tool Support — Introduces support for xAI server-side tools, specifically websearch and xsearch.
Whisper Large V3 — Makes Whisper Large V3 public.
Video Studio Access — Expands access to the video studio, potentially enhancing user experience.
Variable Addition — Adds a variable to the script and uses it in the function.
Grok Imagine Update — Updates the Grok Imagine Text-to-Video and Video-to-Video models to reflect their audio support.
Model Access — Enables public access to the models by setting the configuration.
Migration Files — Adds new migration files to ensure a smooth transition to the updated schema.
Code Update — Updates the existing functionality by adding and removing lines of code.
Cortex Data — Expands the functionality of the API, allowing for more interactions with Cortex data.
SSE Streaming — Introduces support for nested containers with arbitrarily deep nesting in SSE streaming.
Model Discovery — Allows API clients to discover and select the model, which was previously hidden.
Function Update — Updates the function with an added test case to verify the behavior.
Refresh Index — Introduces a new index to support refresh operations.
Function Update — Updates the function with additions to the files.
Technical Update — Includes renaming of queries, optimization of images, and updates to manual header settings.
Schema Update — Updates the schema.prisma file with notable changes.
Cortex Data Endpoint — Introduces a new endpoint for listing cortex data, enhancing the API's functionality.
Cortex Entities — Introduces new API endpoints for managing cortex entities, including retrieval, creation, and publication.
Image Model Specs — Enhances the image model specs with aspect ratios, providing more detailed information about the images.
Grok Imagine Defaults — Updates the default resolution to 720p and duration to 10s for Grok Imagine video models.
Documentation Update — Updates the documentation with comprehensive information.
Inpaint Model Settings — Introduces aspect ratio settings to the InpaintModelSchema, allowing for more flexible image editing.
Posts Filtering — Introduces an optional parameter to the posts list endpoint, allowing for server-side filtering of posts by media type.
Media Type Filtering — Introduces an optional parameter to the posts API client, allowing for server-side media type filtering.
Model Sets — Introduces the concept of model sets for video models, allowing for better categorization and filtering.
Video Model Enhancement — Passes the selected video model ID to the prompt enhance endpoint in Video Studio.
Prompt Enhancement — Enhances prompts for video models by using a video-specific template, including details like motion, camera movements, and audio atmosphere.
Video Model Schema — Introduces an optional modelSets array field to the VideoModelSchema, allowing video models to be categorized into sets.
Traffic Routing — Introduces percentage-based traffic routing to LLM host configurations, enabling weighted distribution.
Payment Retry — Introduces an optional parameter to the past-due invoice payment API, allowing for retrying payments with the customer's current default payment method.
API Usage CSV — Disables API usage CSV download.
Subscription Upgrade — Introduces the ability to immediately upgrade Stripe subscriptions.
Prisma Client Update — Introduces a 30-second statement timeout for PrismaClient.
Conversations Selector — Introduces a new conversations selector for folder IDs, enhancing the functionality of the useConversations hook.
API Settings — Introduces an component to the API settings page, enhancing discoverability and user experience.
RxDB Schemas — Introduces new RxDB schemas for storing audio and video attachment metadata locally.
Cache Backfilling — Enables the backfilling of missing cache rows for users from USD ledger entries, improving data consistency.
Sharp Image Processing — Calls Sharp image processing paths to reduce high event loop pauses, particularly for large images over 1MB.
User Management — Improves the overall functionality of the user management system.
List Update — Updates the list to include 'Immediate' and adds filters to exclude false positives.
Entity Insights — Introduces new fields for demographic and physical appearance to entity insights, enhancing the schema.
Demographic Fields — Adds demographic fields to entity insights, enhancing the functionality of the app's insight extraction capabilities.
Deprecation Information — Introduces a new flag, apiShowDeprecation, to control the display of deprecation information in API responses.
Code Cleanup — Cleans up import paths in enhance-image.ts and multi-edit.ts.
Multi-Image Editing — Introduces a new multi-image editing endpoint, allowing users to submit up to 3 images for editing.
Routing Enhancement — Allows for more flexible and resilient routing.
Prisma Configuration — Updates the Prisma configuration to accommodate cleartext connections.
Billing Endpoint — Introduces a new GET /api/v1/billing/balance endpoint that returns the current balance, daily usage, and epoch reset time, requiring an Admin API key for access.

Fixes and Improvements

Image & Video:

Fixed issues with image handling in video generation to support start and end images
Fixed pricing calculations for image-to-video models by adding a fixed input cost field
Fixed visual effects issues with image-to-video generation, including problems with grey backgrounds
Fixed image copy protection by adding a custom 'Copy' action that intercepts the context menu
Fixed video gallery layout to display videos in a consistent 2-column grid
Improved video studio model matching for better accuracy
Improved video display quality and playback behavior
Enhanced video generation guidance with video-specific prompts
Improved PWA video generation reliability
Adjusted video player layout for better viewing experience

Chat & UI:

Fixed post modal scrolling issues
Improved conversation suggestions visibility
Improved action menu behavior during inference
Updated ChatOutputItem renderer for better display
Improved icon consistency across the app

Wallet & Payments:

Fixed release and pricing issues
Improved insufficient credits error handling with clearer messages
Improved payment method UX with better displays
Added auto top-up indicator for better visibility

Mobile:

Improved API page mobile responsiveness
Improved image handling on mobile
Improved mobile update toast to fit better on smaller screens
Prevented screen lock during idle tasks like voice recording

Data & Backend:

Improved VIS error tracking for better debugging
Enhanced event loop monitoring to identify performance issues
Added memory leak prevention rules to streaming handlers
Added GC pause monitoring for performance optimization
Improved Redis metrics and lock tracking
Added 30 second statement timeout for database queries

User Experience:

Updated video playback behavior for smoother experience
Made prompt collections collapsible for cleaner interface
Improved grid layout responsiveness across screen sizes
Added visual feedback to action buttons
Prevented blob URL copying for better security

January 28th, 2026

Venice.ai Change Log - December 25th, 2025 - January 27th, 2026

Memoria: Your Private AI Memory

Venice introduces Memoria — a revolutionary local-first memory system that gives Venice the ability to remember context across your conversations while keeping your data completely private.

Unlike cloud-based memory systems, Memoria stores everything directly in your browser using an advanced in-browser vector database powered by FAISS (Fast Library for Approximate Nearest Neighbors). This means:

Complete Privacy: Your memories never leave your device — no server storage, ever
Intelligent Recall: Venice can reference past conversations, your preferences, documents you have shared, and important context
Seamless Experience: Memory works automatically in the background as you chat
Full Control: Manage your memory documents in the new Chat Memory tab, toggle extraction on/off, and delete anything at any time
Multi-File Upload: Upload multiple documents to Memory at once

Memoria is now enabled by default for all Pro users. Access your memory settings by clicking the memory icon in chat or visiting the Chat Memory tab in settings.

New Models

We have added several powerful new models to the Venice platform:

Text Models

Kimi K2.5 — Powerful reasoning model from Moonshot AI with excellent multi-turn conversation capabilities, now available to all users
Claude Sonnet 4.5 — Anthropic's newest balanced model, available to all users
Qwen 3 VL 235B — Advanced vision-language model with 235 billion parameters (Pro users)
Gemini 3 Flash Preview — Google's latest fast inference model, available to all users
GPT-5.2 Codex — OpenAI's latest coding-focused model (API-only)

Video Models

LTX V2 — New video generation models with improved quality (all 4 variants available)
Wan 2.6 Flash — Fast image-to-video generation

Image Models

ImagineArt 1.5 Pro — High-quality image generation with excellent prompt adherence
Flux 2 Max — Black Forest Labs' flagship model with exceptional photorealism and detail
GPT Image 1.5 — OpenAI's advanced image generation with improved text rendering

Model Upgrades

Veo 3.1 — Now supports 4K resolution output
Qwen-Edit — Now the default image editing model with improved multi-edit support
Nano Banana Pro — Now supports 1K, 2K, and 4K resolutions

App

New Features:

Background Removal — Remove backgrounds from any image with one click (Pro users)
New Edit Image Models – Now supporting Nano Banana Pro, GPT Image 1.5, Flux 2 Max, Seedream 4.5, and Qwen-Edit
Combine Images — Merge multiple images together with AI-powered blending, with model selection dropdown
Transparency Indicator — Checkered background shows transparent areas in images
Character Selfie Mode — Generate images of your custom characters (Pro users)
Vision Routing — Vision requests now route to Qwen3 VL 235B in the background
Character Visualization Button — Quick access to visualize characters
Import Character Conversations — Bring your character chat history into Venice
Audio Input for Video — Add audio tracks when generating videos (for supported models)
Venice Voice Pause — Pause and resume text-to-speech playback
Web Scrape Toggle — Control URL scraping behavior in Auto mode with smarter URL accessibility checking
Video Completion Notifications — Browser notifications when your video generation finishes
Drag-and-Drop Visual Feedback — Clear overlay when dragging files into chat
Download As Menu — Choose your preferred format when downloading images
Announcement Toasts — New toast notifications for feature updates
Past-Due Subscription Visibility — Users can now see and manage past-due subscription status
Tooltip for Disabled Input — Clear messaging when chat input is disabled and why
Video Duration Detection — System detects and displays video duration before processing
Highlight to Chat — Select assistant message content and add it directly to chat input
Search Provider Switching — Switch between Brave and Google for web search
Video Start/End Frame — New UI for video transition models
Auto Prompt Enhancer — Now available in regular image mode, not just simple mode
Edit Variations — Select 1-4 variations in the edit (inpaint) modal
Model Variant Tooltip — Tooltip shows which model is selected in variant selector
Streaming Indicator — Visual … indicator when waiting for LLM chunks
Social Feed Improvements — Beautiful new grid layout for browsing community creations
- Video Autoplay — Videos now autoplay as you scroll through the Social Feed
- Video Thumbnails — Preview videos directly in the Social Feed before clicking
- 2-Column Mobile Grid — Better layout for browsing the feed on mobile devices

Wallet and Payments:

Instant Crypto Payments — Crypto payments now credit instantly to your account
Wallet Connect Upgrade — Upgraded to latest WalletConnect packages for more seamless sign-in
Crypto to Credit Card Switch — Crypto subscribers can now switch to credit card billing
Auto Top-up — Automatically add credits when your balance is low (Pro users)
Stripe Graceful Degradation — UI gracefully handles when browser blocks Stripe

Performance:

Sidebar virtualization for faster performance with many conversations
Improved streaming with pause button and visual feedback
Context filtering optimization for better KV cache hit rates
Removed artificial 2.2 second loading wait time
Improved API page mobile responsiveness
Cached input tokens now used in web app inference
Next.js 16 upgrade — builds reduced from 4.5 to 2.5 minutes

Mobile App

Venice Voice — New text-to-speech functionality that adapts to your language
Native Markdown Renderer — Better text formatting throughout the app
Conversations Grouped by Date — Chat history organized chronologically for easier browsing
Image-to-Video — Turn any image into a video directly from mobile with improved flow
New Settings Screen — Redesigned settings with better organization
Image Picker for Combine — Add images to combine directly from your library
Token Usage Tracking — See your token usage during conversations
Rate Limit Notice — Clear messaging when you hit rate limits
iOS Share Extension — Share images directly to Venice.ai from your iOS photo library
Improved Model Selection UI — New model badges and improved selection interface
Empty State Improvements — Better handling with keyboard dismissal
Android APK 1.8.0 — New version available for direct install

API

New API Dashboard — Track your usage by model and API key with detailed breakdowns
Nano Banana Pro Resolutions — Now supports 2K and 4K resolutions via API
SST Endpoint — Speech-to-text endpoint now available for API users
New Model API Fields — Added privacy, description, betaModel, and deprecation.date to models endpoint
API Key Prefixes — Keys now prefixed with VENICE-ADMIN-KEY- or VENICE-INFERENCE-KEY- to distinguish types
API Key Editing — Support for editing existing API keys
Cache Token Pricing — Optimized pricing for GLM 4.7 and models with input caching
Claude Cache Write Tokens — Charged at 1.25x input rate
Search/Scrape Pricing — Flattened to $10/k for all models
Video Rate Limits — Updated to 40 RPM for API with tier-based limits for UI
Developer Role Support — Added "developer" role in chat completions schema for GPT-5.2 Codex compatibility
Video Start/End Frame — Support for video transition models via API
Insight Extraction API — New endpoint for extracting insights
Model Suggestions on Error — When a model is not found, similar available models are suggested

Fixes

Wallet & Payments:

Fixed wallet QR code scanning issues for Base Wallet and Coinbase Wallet
Fixed WalletConnect signing errors
Fixed missing fonts in WalletConnect modal
Improved wallet disconnect consistency
Fixed duplicate auto top-up charges
Fixed Pro user chat rate limit bypass

Browser Compatibility:

Fixed Safari CodeMirror compatibility issues
Fixed ad-blocker handling for Stripe payments
Fixed Chrome Chromebook typing issues

Image & Video:

Fixed image variant preservation during editing
Fixed image EXIF preservation when downloading
Fixed character chat model selection errors
Fixed video upload error handling
Fixed photo viewer layering issues
Fixed duplicate share button appearing
Fixed multi-image switching bug
Fixed inpaint variations button visibility
Fixed video credits not showing
Fixed invalid stored video settings reset
Fixed image dimension validation in InpaintingModal

Chat & UI:

Fixed mobile tooltip display issues
Fixed LaTeX rendering (math expressions like 2^2=4 with citations, double ^^ issues)
Fixed streaming empty response when reasoning disabled
Fixed chat messages with line breaks displaying as single line
Fixed prompt input not clearing after submission
Fixed "The selected model is temporarily offline" error display
Fixed hidden close button in video summary
Fixed divider reasoning spacing
Fixed text color (text.muted to text.subtle)
Fixed hover tabs causing premature close in model selector
Fixed action panel delay and click behavior
Fixed share conversation for character chats
Fixed privacy warning flashing and checkbox color
Fixed chat input when video generation active
Fixed new chat button not working
Fixed chat history shared among different chats
Fixed enhance message undefined check

Data & Backend:

Fixed database connection errors
Fixed network timeout handling
Fixed tag injection for Deepseek 3.2 and Kimi K2
Fixed billing history transaction time (+5 min offset)
Fixed Claude assistant message error with tool_calls
Fixed video quote requirements/schema
Fixed prompt enhance V3 rewriting issues
Fixed model router back to Venice Uncensored for proper routing
Replaced keyv-upstash with @keyv/redis for cache-utils

User Experience:

Fixed download as image format
Fixed overflow/viewport height issues (using dvh)
Fixed default settings route
Fixed public character cache on moderation
Fixed burn chart data
Updated burn history with etherscan and historical fiat prices
Padded burn history to show minimum 12 months
Removed exclamation marks on welcome modal
Updated model selector hover state handling
Added search icon while searching
Added missing featured flag for public characters
Added temporary badge to temporary character chats
Improved batch deletion UX

December 24th, 2025

Venice.ai Change Log November 25th - December 23rd

Venice Now Hosts All Leading Frontier Models

We've significantly expanded our model library to include the latest and most powerful frontier AI models available. For text, we've added Grok 4.1 Fast with reasoning and vision, Claude Opus 4.5, Gemini 3 Pro Preview, Kimi K2, GPT-5.2, and Deepseek 3.2. For video, we've launched LTX 2 Video, Kling 2.6, and the new Longcat video models. For image generation, GPT-Image-1.5 is now available for alpha users.

These models use credits, and requests through these models are anonymized. Venice is now your one-stop-shop for accessing cutting-edge AI.

App

Fixed a bug where upscaling or enhancing an image at 1x would switch the conversation type to "image". The conversation type now remains consistent.
The "Use original prompt" option for upscaling is now correctly capped at the maximum prompt length for the enhance feature.
Added the ability to mute users directly in the Social Feed.
The "Auto" model is now hidden from the model switcher when you are in the video category.
Fixed an issue where credit estimates were not displaying correctly on small screens.
Added resolution selection to the NanoBananaPro model.
Added an overflow scroll to the memory documents section to prevent UI issues.
Moderated posts now appear at the top of the Social Feed for visibility.
Added a tooltip to the locked label on the token dashboard for better clarity.
Fixed an overflow issue on the preferences modal for guest users on the profile page.
Added feedback buttons to the Support Bot responses.
Hid the mature filter in preferences for non-Pro users.
Improved system-prompt token calculation for more accurate usage reporting.
Fixed a bug in multimodal regen.
Swapped user-facing "Beta" tags to "Alpha" or "Alpha Tester" for consistency.
Added estimated pricing display for Pay Per Use LLM models in the model selector.
Added drag-and-drop image upload functionality to the edit, combine, and upscale modals.
Replaced all instances of "points" with "Venice Credits" in the user profile and settings.
Relaxed schema validation for chat completions to allow unknown fields, improving compatibility with third-party clients.
Updated the chat input loading spinner and image loading animation for a smoother experience.
Added a way to feature characters and updated the default sort on the public characters page to "Featured".
Fixed duplicate entries on the "Browse Characters" page.
Fixed the NSFW overlay regeneration logic.
Updated search provider labels for clarity.
Added resolution and websearch information to the image details view.
Fixed style preset settings for models that do not support them.
Added a message to the Support bot for non-Venice related questions.
Added a copy button for markdown tables.
Fixed buttons in the row component from shrinking incorrectly.

Mobile App

Added 3 new languages (using Google Translate for now).
Mobile credit purchases now redirect back to the native app after completion.
Fixed tooltips getting stuck on Android devices.

Models

Promoted nano banana pro to the beta API.
Increased the price of Nano Banana Pro.
Launched the new LTX 2 Video model.
Added the new Grok 4.1 Fast model to all users and the API, with reasoning and vision enabled.
Added new Pay Per Use models: Gemini 3 Pro Preview and Kimi K2.
Upgraded Seedream to version 4.5.
Launched Kling 2.6 publicly.
Launched Deepseek 3.2 as a live PRO model, with API support and reasoning enabled.
Added new Longcat video models.
Added the new Claude Opus 4.5 model.
Made GPT-Image-1.5 live for alpha users.
Removed DeepSeek R1 (beta) from the API.
Began the two-step process of retiring the Venice-hosted Qwen235B model.
Added the new GPT-5.2 model.
Removed Devstral 2 from the API and Kandinsky 5 from the app.
Enabled reasoning for Deepseek 3.2 again after provider fixes.

API

Added Fal image generations to the API.
Updated API documentation for FAL model configurations.
Added support for the reasoning_effort parameter.
Added support for the cache_control parameter.
Added support for prompt_cache_key in the LLM schema to improve cache hits.
Preserved reasoning_details blocks for advanced tool calling and reasoning.
Fixed API Dashboard usage display to handle negative values correctly.
Added support for analysing audio and video files for Gemini 3 Pro/Flash and other compatible models.
Fixed API KEY consumption, which was incorrectly limiting some users to 50% of their available Diem balance.
Added consumable balance to video API request headers.
Added support for Anthropic-format tool configurations and the tool_use/tool_result message format.
Added cache token tracking and discounted billing for models that support context caching, including Anthropic models.

Token

Fixed the diem_global_utilization view to include refund entries.
Busted the Diem usage chart cache to reflect corrected data.
Added a new "burn" section to the token page.

Fixes and Improvements

Fixed a bug where prompts longer than the enhance model's limit would cause errors.
Fixed credit estimates display on small screens.
Fixed incorrect display of the "Reasoning" tag in text details.
Fixed an image aspect ratio issue in multimodal chats.
Fixed resolution examples to use "1K", "2K", and "4K" labels.
Fixed social user name validation to allow hyphens.
Fixed FAL image model pricing.
Fixed regen for character images.
Fixed aspect ratio issues in selfie mode and the image loader.
Fixed a bug where the info and feedback buttons were hidden in grid view.