Realtime, emotional and natural sounding Conversational Speech Model

This is the best realtime, natural sounding voice chat with AI that I’ve ever heard. They say its going to be open source too. This would be incredible and gamechanging to have in Venice. You have to check it out!

  • Emotional intelligence: reading and responding to emotional contexts.

  • Conversational dynamics: natural timing, pauses, interruptions and emphasis.

  • Contextual awareness: adjusting tone and style to match the situation.

  • Consistent personality: maintaining a coherent, reliable and appropriate presence.

Really natural sounding, it sometimes mispronounces words, it grows in confidence over time, it has memory for 2 weeks, it responds immediately, has emotion.. Its better than chatGPT’s too.

Try the demo here:

https://www.sesame.com/research/crossing_the_uncanny_valley_of_voice

is there any chance of this if it does go open source?
The devs do promise it’ll all be open source.

Please authenticate to join the conversation.

Upvoters
Status

Backlog

Board
💡

Feature Requests

Date

11 months ago

Author

JaeSwift

Subscribe to post

Get notified by email when there are changes.