"Having lifelike text-to-speech requires models to operate with very low latency and very high quality. We chose Baseten as our preferred inference provider for Orpheus TTS because we want our customers to have the best performance possible. Baseten’s Inference Stack allows our customers to create voice applications that sound as close to human as possible."
"I write differently when texting my mom, my wife, and my colleagues. One early challenge was encapsulating this behavior and making it controllable, matching Flow’s output to the user’s context and preferences."