News

VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges ...
It's the kind of back-and-forth that probably wasn't meant to cause a stir — but online, things rarely stay that simple. A casual comment from Maya Jama during a chat with Chunkz on his podcast has ...