Character's Gallery and Dynamic Story Imaging

Characters and role play like scenarios have been popular since LLMs came out, one of the features associated with them, but one that's often not well implemented, is character and story galleries. I'd like to see the Characters area see these updates, and more that I haven't thought of, in the future. I'm basing my suggestions off of what I've seen in character cards with authors including links to galleries, and of features available on SillyTavern.

These galleries are used in many ways, from providing backgrounds that LLMs change based on context to simply allowing the author to include multiple images of the character, characters, items, locations etc.

Allow galleries to be included in a character profile, essentially allowing more images to be uploaded to the character by the author or the user.

Allow the generation of images with a button click which triggers a set prompt to generate either contextually appropriate backgrounds, character portraits or current story scenario. For example the button could trigger the LLM to create an image generation prompt using instructions like "create a portrait image of user's character (often denoted as {{user}}) in the current scenario using recent context" etc. The generated response is either directly sent for image generation or supplied to the user for editing.

If a model supports image analysis it can draw on either an image of a character persona/avatar and use that description to generate more images of that character, though a specific image can be provided for adaptation into a newly generated image.

Background switching would work with a background process where an LLM is provided context and switches to a gallery image as the background based on the context. These would either be pre-existing background images either provided by Venice or User with appropriate names for the background, or newly generated backgrounds based on context if the user actively selects it. Background tags could also be used to note time of day, location etc.

Gallery images could be tagged to make it easier for the LLM to find and not just have appropriate names. If storage is an issue the images could be presented in webp or galleries could have an option to be linked via url.

Allow the user to adjust these pre-existing prompts in settings, allow them to set rules on their generation. For example "generate an image of character every 3 responses".

Potential character/user avatar, if the gallery includes these images with different expressions than the LLM could also switch between them based on story context as well.

Transparency setting, letting users control the transparency of the ui also allows them to get the most out of backgrounds.

Font style and size adjustments to allow users to tailor the text to their screen and potential impairments and disabilities.

Some of these features could be made Pro.

Venice.ai

Character's Gallery and Dynamic Story Imaging

Subscribe to post

Subscribe to post