https://ift.tt/VqkrBEO
from Techmeme https://ift.tt/jx0Q1Pd
Damir Yalalov / Metaverse Post:
Microsoft unveils text-to-speech AI model VALL-E, which was trained on English speech data and can simulate a person's voice with three seconds of sample audio — IN BRIEF — With just a three-second sample of any voice, the transformer-based TTS model VALL-E can produce speech in every voice.
from Techmeme https://ift.tt/jx0Q1Pd
No comments:
Post a Comment