Google Expands Vertex AI with Chirp 3 for Advanced Speech-to-Text and Text-to-Speech

google-Chirp3
0
0

Generative AI has primarily focused on text-based applications for generating written content, images, and more. However, the next big wave appears to be voice AI, and it’s rapidly advancing. In a major move, Google has announced the integration of Chirp 3, its latest speech-to-text and HD text-to-speech model, into Vertex AI, starting next week.

Chirp 3 Brings Enhanced Voice Capabilities

Google has been quietly expanding Chirp 3’s capabilities, introducing eight new voices across 31 languages in a recent rollout. The model is designed for a wide range of applications, including:

  • Voice assistants
  • Audiobook narration
  • Customer support agents
  • Voice-overs for video content

The announcement was made at an event held at Google DeepMind’s headquarters in London.

AI Voice Technology Gains Momentum

Google’s advancements in voice AI come at a time when other companies are also making rapid progress. Sesame, the startup behind the viral AI voices “Maya” and “Miles,” recently introduced a new model allowing developers to create custom voice applications.

Meanwhile, ElevenLabs, a major player in AI voice technology, has secured significant funding to expand its AI-powered speech services.

Google’s Focus on Responsible AI Development

While the launch of Chirp 3 marks a significant milestone, Google is taking a cautious approach to prevent potential misuse. Google Cloud CEO Thomas Kurian confirmed that the company is actively working with its safety team to establish appropriate usage guidelines.

Chirp 3 joins Google’s expanding suite of AI tools, including:

  • Gemini, its flagship large language model (LLM)
  • Imagen, its image-generation model
  • Veo 2, its premium video-generation tool

However, it remains to be seen whether Chirp 3 can match the ultra-realistic voice synthesis of other AI models, such as Sesame’s work. DeepMind CEO Demis Hassabis emphasized that AI advancements remain a long-term journey rather than an immediate transformation.

“AI won’t be a silver bullet for everything in the next couple of years. We’re still some years away from achieving AGI. Change will happen gradually over the next decade,” Hassabis stated.

Google’s Expanding AI Ecosystem

Launched in 2021, Vertex AI was originally designed as a cloud-based platform for machine learning development. However, with the surge of interest in generative AI—sparked by OpenAI’s GPT models—Google has increasingly positioned Vertex AI as a key competitor to similar offerings from Microsoft and Amazon.

Beyond generative AI tools, developers can use Vertex AI for data classification, model training, and production deployment. Whether Google will eventually integrate third-party AI models into the platform remains an open question.

The Evolution of Google’s “Chirp” AI

Google’s Chirp voice technology has been in development for years, with early iterations serving as an internal project to compete with Amazon’s Alexa. Now, with Chirp 3’s integration into Vertex AI, Google is making a clear push to establish itself as a leader in AI-powered voice solutions.

As voice AI gains traction, Google’s latest move signals a major shift towards multimodal AI, where text, images, and voice interact seamlessly, paving the way for more immersive and natural AI experiences.