Connect with us

News

Google Expands AI Capabilities With 1000 Language Model

Google is getting closer to its aim of building an AI language model that works for 1,000 languages. On Monday, it gave more info about the Universal Speech Model (USM). They consider it a “critical first step” in achieving their plans.

November last year, Google unveiled the 1,000 Languages Initiative; a bold pledge to develop an ML model that caters to the 1,000 most spoken languages in the world, enabling billions of people to experience inclusion.

Google – Universal Speech Model (USM)

Google introduces Universal Speech Model (USM), a breakthrough in supporting 1,000 languages. USM is a family of speech models, leveraging 12 million hours of speech and 28 billion sentences of text from 300+ languages. It enables automatic speech recognition (ASR) for widely-spoken languages like English and Mandarin, as well as under-resourced languages like Amharic, Cebuano, Assamese, and Azerbaijani.

Google USM: Scaling Automatic Speech Recognition beyond 100 Languages explains how pre-training the encoder of the model on a large unlabeled multilingual dataset and fine-tuning on a smaller set of labeled data can recognize under-represented languages. Their model training process is effective in adapting to new languages and data.

Self-supervised learning: Google’s approach to speech recognition combines self-supervised learning with fine-tuning. USM uses the encoder-decoder architecture, where the decoder can be CTC, RNN-T, or LAS. The encoder employs the Conformer, a convolution-augmented utilizes a Conformer block with attention, feed-forward, and convolutional modules.


USM’s overall training pipeline.

This block takes the log-mel spectrogram of the speech signal as input, performs convolutional sub-sampling, and ultimately produces the final embeddings.

Pre-trained encoder enables 300+ languages

Google’s encoder utilizes over 300 languages through pre-training and has been tested on YouTube Caption’s multilingual speech data. This data consists of 73 languages, each with an average of fewer than 3,000 hours of data. Despite the limited amount of supervised data, the model has achieved a WER of less than 30% on average with all 73 languages.


USM supports all 73 languages in the YouTube Captions’ Test Set and outperforms Whisper on the languages it can support with lower than 40% WER. Lower WER is better.

Generalization to downstream ASR tasks

Google reported that their model had lower WER on CORAAL (African American Vernacular English), SpeechStew (en-US), and FLEURS (102 languages) datasets compared to Whisper.


Comparison of USM (with or without in-domain data) and Whisper results on ASR benchmarks. Lower WER is better.

Their model showed lower WER with and without training on in-domain data. The comparison on FLEURS was done on the subset of languages (62) supported by the Whisper model.

Automatic speech translation (AST)

Google states that their fine-tuned USM model on the CoVoST dataset has achieved state-of-the-art quality with minimal supervision. To evaluate its performance, the CoVoST dataset was divided into high, medium and low resource availability segments, and the BLEU score (higher is better) was calculated for each


CoVoST BLEU score. Higher BLEU is better.

Google Translate has recently been upgraded with AI features such as contextual translation and more. USM outperformed Whisper in all segments, as displayed below.

Marching Toward 1,000 languages

Google is striving hard to fulfill its mission of organizing the world’s information and making it universally accessible via its USM development.

Google plans to use the base model architecture and training pipeline of USM as a foundation to expand speech modeling to the next 1000 languages. To learn more – click here.

Recent

entertainment10 hours ago

Bollywood Slammed For Their Silence Wrestlers Protest Shah Rukh Khan Tweeted

Bollywood slammed for their silence wrestlers protest Shah Rukh Khan tweeted for the new Parliament but no support for our...

IPL 2023 final Picture of Ziva Dhoni praying ahead CSK vs GT goes viral netizens say her prayers were heard IPL 2023 final Picture of Ziva Dhoni praying ahead CSK vs GT goes viral netizens say her prayers were heard
entertainment10 hours ago

IPL 2023 Final Picture of Ziva Dhoni Praying Ahead CSK vs GT Goes Viral

IPL 2023 final Picture of Ziva Dhoni praying ahead CSK vs GT goes viral netizens say her prayers were heard...

NEW STATE Mobile gets Revamped Bounty Royale mode 90 FPS all graphics modes and more NEW STATE Mobile gets Revamped Bounty Royale mode 90 FPS all graphics modes and more
News10 hours ago

NEW STATE Mobile gets Revamped Bounty Royale mode 90

NEW STATE Mobile gets Revamped Bounty Royale mode 90 FPS all graphics modes and more KRAFTON has released the May...

Pebble Cosmos Vault performance, Pebble Cosmos Vault price, Pebble Cosmos Vault specs, Pebble Cosmos Vault review, Pebble Cosmos Vault features, Pebble Cosmos Vault release date Pebble Cosmos Vault performance, Pebble Cosmos Vault price, Pebble Cosmos Vault specs, Pebble Cosmos Vault review, Pebble Cosmos Vault features, Pebble Cosmos Vault release date
News10 hours ago

Pebble Cosmos Vault Price And Specifications

Pebble Cosmos Vault 1.43″ AMOLED display, Bluetooth calling launched for Rs. 2,999 Pebble has launched a new Bluetooth calling smartwatch...

Philips Airfryer 5.6L 14 in1 cooking digital window Price And Specifications Philips Airfryer 5.6L 14 in1 cooking digital window Price And Specifications
Accessories10 hours ago

Philips Airfryer 5.6L 14 in1 cooking digital window Price And Specifications

Philips Airfryer 5.6L 14 in1 cooking digital window launched in India Price And Specifications Versuni, a domestic appliance maker, has...

Accessories11 hours ago

Hitachi launches new Yoshi iZen Senpai airHome series ACs in India

Hitachi launches new Yoshi iZen Senpai airHome series ACs in India In India, Hitachi has added 3 new product lines...

vivo S17 And S17 Pro performance, vivo S17 And S17 Pro price, vivo S17 And S17 Pro specs, vivo S17 And S17 Pro review, vivo S17 And S17 Pro features, vivo S17 And S17 Pro release date vivo S17 And S17 Pro performance, vivo S17 And S17 Pro price, vivo S17 And S17 Pro specs, vivo S17 And S17 Pro review, vivo S17 And S17 Pro features, vivo S17 And S17 Pro release date
Latest Devices11 hours ago

vivo S17 And S17 Pro Announced Price And Specifications

vivo S17 And S17 Pro 6.78″ 1.5K 120Hz curved AMOLED screen and vivo TWS Air Pro ANC earbuds to be...

OPPO announces Kaká as brand ambassador UEFA Champions League partnership OPPO announces Kaká as brand ambassador UEFA Champions League partnership
News11 hours ago

OPPO Announces Kaká as Brand Ambassador UEFA Champions League Partnership

OPPO announces Kaká as brand ambassador UEFA Champions League partnership OPPO has announced Ricardo Izecson Dos Santos Leite, better known...

Sara Ali Khan movies,  Sara Ali Khan latest news,  Sara Ali Khan,  Sara Ali Khan upcoming movies,  Sara Ali Khan hairstyle,  Sara Ali Khan net worth, Sara Ali Khan movies,  Sara Ali Khan latest news,  Sara Ali Khan,  Sara Ali Khan upcoming movies,  Sara Ali Khan hairstyle,  Sara Ali Khan net worth,
entertainment23 hours ago

Sara Ali Khan reflects on her memorable debut Cannes 2023

Sara Ali Khan reflects on her memorable debut at Cannes 2023 says I even met Leonardo DiCaprio Bollywood actress Sara...

Ranbir Kapoor movies,Ranbir Kapoor latest news,Ranbir Kapoor wife,Ranbir Kapoor upcoming movies,Ranbir Kapoor hairstyle,Ranbir Kapoor net worth, Ranbir Kapoor movies,Ranbir Kapoor latest news,Ranbir Kapoor wife,Ranbir Kapoor upcoming movies,Ranbir Kapoor hairstyle,Ranbir Kapoor net worth,
entertainment23 hours ago

After Salman Khan opts out Champions Aamir Khan approaches Ranbir Kapoor

After Salman Khan opts out Champions Aamir Khan approaches Ranbir Kapoor Earlier this year in February it was reported that...