Indianext
No Result
View All Result
Subscribe
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
No Result
View All Result
Latest News on AI, Healthcare & Energy updates in India
No Result
View All Result
Home AI Next

New Universal Speech Model From Google Outperforms OpenAI Whisper In Tests

March 10, 2023
Google

To accommodate 1,000 languages, Google researchers have just released an upgrade to their Universal Speech Model (USM). According to the researchers, this model outperforms OpenAI Whisper across the board for automated voice recognition.

Access to the USM API can be requested by researchers here.

According to the study, “Google USM: Scaling Automated Speech Recognition Beyond 100 Languages,” under-represented languages can be recognized using a huge, unlabeled multilingual dataset that was used to pre-train the model’s encoder and was then fine-tuned using a smaller amount of labelled data. Also, the training procedure effectively adapts new data and languages.

The pre-trained encoder’s efficiency was demonstrated by the researchers using YouTube Caption’s multilingual voice data for fine-tuning. Despite the minimal supervised data provided by YouTube, the model manages to achieve a record-low word error rate of around 30% across all 73 languages. In comparison to Whisper (large-v2), which was trained using more than 400k hours of labelled data for these 18 languages, the model has, on average, a 32.7% relative lower WER. Moreover, USM performs better than Whisper across the board for automatic speech recognition.

The 1,000 Languages Initiative was introduced last November with the goal of developing a machine learning model that will support the 1,000 most spoken languages in the world for greater inclusivity on a global scale. The main problem is figuring out how to support languages with few speakers or little available data because some of these languages are only spoken by less than twenty million people.

The USM is a collection of speech models with two billion parameters that were developed using a massive dataset of 12 million hours of audio and 28 billion text phrases from over 300 different languages. The models can automatically recognize speech in languages with little resources, such Amharic, Cebuano, Assamese, and Azerbaijani, to name a few. They are utilized on YouTube (for closed captions).

The upgraded version of the model employs the common encoder-decoder architecture. As an encoder, we employ the Conformer, also known as the convolution-augmented transformer. The Conformer block, which consists of attention, feed-forward, and convolutional modules, is a crucial component. It performs a sampling on the input before applying Conformer blocks and a projection layer to create the final embeddings.

The training of the model begins with unsupervised learning using speech recordings from hundreds of different languages. In order to do this, BEST-RQ is employed, which performs well on multilingual tasks when working with enormous amounts of unstructured audio data.

In the second optional phase, the researchers added more text data to the model using multi-objective supervised pre-training, which enhanced the model’s quality and language coverage. Whether or not text data is available will determine whether the second step is used, although USM performs best with this phase.

The model is refined on the downstream tasks in the final stage. With pre-training, it shows good performance with a minimal amount of task-related supervised data.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Editors Corner

How can Artificial Intelligence tools be a blessing for recruiters?

Will Artificial Intelligence ever match human intelligence?

Artificial Intelligence: Features of peer-to-peer networking

What not to share or ask on Chatgpt?

How can Machine Learning help in detecting and eliminating poverty?

How can Artificial Intelligence help in treating Autism?

Speech Recognition and its Wonders in your corporate life

Most groundbreaking Artificial Intelligence-based gadgets to vouch for in 2023

Recommended News

AI Next

Google: AI From All Perspectives

Alphabet subsidiary Google may have been slower than OpenAI to make its AI capabilities publicly available in the past, but...

by India Next
May 31, 2024
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

New research from Bryter, which involved over 200 doctors from the US and the UK, including neurologists, hematologists, and oncologists,...

by India Next
May 31, 2024
Solutions

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

Three government agencies joined forces to form a synergy in order to deliver eMigrate services through Common Services Centers (CSCs)...

by India Next
May 31, 2024
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

The advent of artificial intelligence has significantly changed the landscape of entrepreneurship. The figures say it all. Global AI startups...

by India Next
May 31, 2024

Related Posts

Google
AI Next

Google: AI From All Perspectives

May 31, 2024
Pfizer
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

May 31, 2024
Artificial-Intelligence
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

May 31, 2024
openai
AI Next

OpenAI Creates An AI Safety Committee Following Significant Departures

May 31, 2024
Load More
Next Post
Artificial-Intelligence

India Teases An AI Project To "Catalyze The Next Generation Of The Internet"

IndiaNext Logo
IndiaNext Brings you latest news on artificial intelligence, Healthcare & Energy sector from all top sources in India and across the world.

Recent Posts

Google: AI From All Perspectives

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

OpenAI Creates An AI Safety Committee Following Significant Departures

Tags

  • AI
  • EV
  • Mental WellBeing
  • Clean Energy
  • TeleMedicine
  • Healthcare
  • Electric Vehicles
  • Artificial Intelligence
  • Chatbots
  • Data Science
  • Electric Vehicles
  • Energy Storage
  • Machine Learning
  • Renewable Energy
  • Green Energy
  • Solar Energy
  • Solar Power

Follow us

  • Facebook
  • Linkedin
  • Twitter
© India Next. All Rights Reserved.     |     Privacy Policy      |      Web Design & Digital Marketing by Heeren Tanna
No Result
View All Result
  • About Us
  • Activate
  • Activity
  • Advisory Council
  • Archive
  • Career Page
  • Companies
  • Contact Us
  • cryptodemo
  • Energy next
  • Energy Next Archive
  • Home
  • Interviews
  • Make in India
  • Market
  • Members
  • Mission
  • News
  • News Update
  • People
  • Policy
  • Privacy Policy
  • Register
  • Reports
  • Subscription Page
  • Technology
  • Top 10
  • Videos
  • White Papers
  • Work Culture
  • Write For Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

IndiaNext Logo

Join Our Newsletter

Get daily access to news updates

no spam, we hate it more than you!