Meta comes in guns blazing with SeamlessM4T!

In its efforts to develop cutting-edge AI that can understand several different languages, Meta has created SeamlessM4T, which can translate and understand over 100 other languages.

Meta guarantees its users that SeamlessM4T will be one of those AI inventions that will change the lives of millions with its significant abilities to translate text from speech to speech, to text to speech!

In some ways, SeamlessM4T is the spiritual heir to Universal Speech Translator, one of the only direct speech-to-speech translation systems that supports Hokkien, and Meta’s No Language Left Behind, a text-to-text machine translation paradigm. Additionally, it expands on Meta’s architecture for massively multilingual speech, which offers technology for speech synthesis, language identification, and recognition across more than 1,100 languages.

The training dataset for SeamlessM4T, dubbed SeamlessAlign, was produced by Meta using the text and speech that had been scraped. The researchers “taught” SeamlessM4T how to convert voice to text, interpret texts, produce speech from texts, and even translate words pronounced in one language into words spoken in another language by aligning 443,000 hours of speech with texts and producing 29,000 hours of “speech-to-speech” alignments.


Source: Meta

