Current directory: /home3/bjinbymy/public_html/indianext/wp-content/mu-plugins A Microsoft Research Initiative Promotes The Survival And Growth Of Languages - AI Next
Indianext
No Result
View All Result
Subscribe
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
No Result
View All Result
Latest News on AI, Healthcare & Energy updates in India
No Result
View All Result
Home AI Next

A Microsoft Research Initiative Promotes The Survival And Growth Of Languages

February 1, 2023
artificial-intelligence

The final survivor of the 65,000-year-old pre-Neolithic society on the Andaman Islands in the Indian Ocean was a woman by the name of Boa Sr. She passed away in 2010, and the Bo language also perished and went extinct.

If that sounds like a singular occurrence, it’s not. Somewhere in the globe, a language is lost every two weeks.

Consider the Mundas, a group of a million or so people who live in the eastern Indian states of Jharkhand, Orissa, and West Bengal.

According to Dr. Meenakshi Munda, a Munda community member and assistant professor in the anthropology department of a university in Ranchi, Jharkhand, “I learned Mundari very late in life as my parents were in another state where they were working, thus we didn’t speak the language at home.” “I recognise how important identity is to a community, and our younger generation is losing that identity due to language barriers.”

The Munda community is worried about the future of their language because children in schools are only exposed to well-known languages like Bengali, Hindi, and Odiya.

Even though Mundari has a written script, it has very little digital content or an online presence, which provides even less motivation for people to invest in learning the language.

At the Microsoft Research (MSR) lab in India, a few researchers have been working on developing digital ecosystems for languages like Mundari that don’t have enough presence online.

According to Kalika Bali of MSR India, “the way I describe my job for myself is that no person in this world should be prohibited from adopting any technology because they speak a different language.”

The branch of linguistics and artificial intelligence (AI) that focuses on teaching computers to comprehend spoken and written languages, Bali is a specialist in natural language processing.

Her team develops the foundational datasets needed to build AI systems for underrepresented languages in collaboration with local groups and native speakers. They intend to produce a dataset that is accurate and culturally relevant by incorporating the community in the data collection procedure.

English has been the primary language of the internet since its inception. Since then, seven other widely spoken languages, including Chinese and Spanish, may partially rival English in terms of technological compatibility due to better internet availability and a desire for material in native languages. However, that only represents eight of the world’s almost 6,000 languages.

This indicates that only 88% of the languages spoken in the world have sufficient online presence. Additionally, it means that 1.2 billion people, or 20% of the world’s population, are unable to utilise their language to interact with the internet.

As a result, “the gap between the haves and the have-nots got fairly obvious,” says Monojit Choudhury, Bali’s colleague and principal data and applied scientist at Microsoft’s Turing India.
Low-resource languages are those, according to the experts, that lack the resources needed to create technology for a digital presence.

Building digital resources has two goals under Project ELLORA— Enabling Low Resource Languages: In addition to ensuring that speakers of these languages may engage and communicate in the digital world, it is also a step toward conserving a language for future generations.

Launched in 2015, Project ELLORA began with the fundamentals. Identifying existing resources, such as printed materials like books and the degree of a digital presence, was the first step. Bali and her coworkers presented a six-tier classification in a 2020 study, with the top tier representing languages with abundant resources, such as English and Spanish, and the bottom tiers indicating languages with few to no resources.

Project ELLORA’s effort involves gathering the necessary materials for these languages and creating language models to satisfy the digital needs of their speakers.

The researchers of Project ELLORA collaborate with the local populations to identify this demand and the foundational technologies that can help to meet it. According to Bali, “No language technology can be detached from the users.”

In order to determine what the Mundari community needs to preserve the language, the researchers in 2018 financed a study in partnership with IIT Kharagpur.

What began as a straightforward word game for schoolchildren to help them learn the language quickly evolved into complex technology undertakings.

The community will have access to additional Mundari content thanks to MSR researchers’ work on a Hindi-to-Mundari text translation and a speech recognition model.

The Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ), acting on behalf of the German Ministry for Economic Cooperation and Development, is funding a text-to-speech model as part of the “Forward – Artificial Intelligence for all” programme.

However, it is difficult to develop language translation models for a language for which there is little relevant digital resources on which to train machine learning models.

Initially, the team worked with locals to have them manually translate words from Hindi to Mundari. The team was led by professors from IIT Kharagpur.

Interneural Machine Translation (INMT), a new technology created by MSR researchers to expedite translation, aids in word prediction while someone is translating between languages.

“It (INMT) makes it possible for people to translate between languages more successfully. When I begin typing in Mundari when translating from Hindi, it offers me predictive ideas in Mundari. Similar to the predictive text found in smartphone keyboards, except it works in two languages, according to Bali.

They worked with Karya, which began as a research effort by Vivek Seshadri, a principal researcher at MSR, to create the dataset for text to speech. Karya is a digital platform for working that allows users to record, tag, and annotate data in order to create machine learning and AI models.

The translators were given the translated sentences to record for a male Mundari speaker, identified by the team, and Dr. Munda as the female speaker. On Android cellphones, they recorded the sentences using the Karya app.

For the purpose of training text to voice models, the recordings and the associated text are safely uploaded to the cloud.

To build these three technologies for Mundari, Bali explains, “the idea is that between Microsoft Research, Karya, and IIT Kharagpur, we will have data for machine translation, speech recognition, and text-to-speech synthesis.”

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Editors Corner

How can Artificial Intelligence tools be a blessing for recruiters?

Will Artificial Intelligence ever match human intelligence?

Artificial Intelligence: Features of peer-to-peer networking

What not to share or ask on Chatgpt?

How can Machine Learning help in detecting and eliminating poverty?

How can Artificial Intelligence help in treating Autism?

Speech Recognition and its Wonders in your corporate life

Most groundbreaking Artificial Intelligence-based gadgets to vouch for in 2023

Recommended News

AI Next

Google: AI From All Perspectives

Alphabet subsidiary Google may have been slower than OpenAI to make its AI capabilities publicly available in the past, but...

by India Next
May 31, 2024
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

New research from Bryter, which involved over 200 doctors from the US and the UK, including neurologists, hematologists, and oncologists,...

by India Next
May 31, 2024
Solutions

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

Three government agencies joined forces to form a synergy in order to deliver eMigrate services through Common Services Centers (CSCs)...

by India Next
May 31, 2024
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

The advent of artificial intelligence has significantly changed the landscape of entrepreneurship. The figures say it all. Global AI startups...

by India Next
May 31, 2024

Related Posts

Google
AI Next

Google: AI From All Perspectives

May 31, 2024
Pfizer
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

May 31, 2024
Artificial-Intelligence
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

May 31, 2024
openai
AI Next

OpenAI Creates An AI Safety Committee Following Significant Departures

May 31, 2024
Load More
Next Post
chatGPT

Chatbots Like ChatGPT Are Most Likely To Replace These Positions In The Future

IndiaNext Logo
IndiaNext Brings you latest news on artificial intelligence, Healthcare & Energy sector from all top sources in India and across the world.

Recent Posts

Google: AI From All Perspectives

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

OpenAI Creates An AI Safety Committee Following Significant Departures

Tags

  • AI
  • EV
  • Mental WellBeing
  • Clean Energy
  • TeleMedicine
  • Healthcare
  • Electric Vehicles
  • Artificial Intelligence
  • Chatbots
  • Data Science
  • Electric Vehicles
  • Energy Storage
  • Machine Learning
  • Renewable Energy
  • Green Energy
  • Solar Energy
  • Solar Power

Follow us

  • Facebook
  • Linkedin
  • Twitter
© India Next. All Rights Reserved.     |     Privacy Policy      |      Web Design & Digital Marketing by Heeren Tanna
No Result
View All Result
  • About Us
  • Activate
  • Activity
  • Advisory Council
  • Archive
  • Career Page
  • Companies
  • Contact Us
  • cryptodemo
  • Energy next
  • Energy Next Archive
  • Home
  • Interviews
  • Make in India
  • Market
  • Members
  • Mission
  • News
  • News Update
  • People
  • Policy
  • Privacy Policy
  • Register
  • Reports
  • Subscription Page
  • Technology
  • Top 10
  • Videos
  • White Papers
  • Work Culture
  • Write For Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

IndiaNext Logo

Join Our Newsletter

Get daily access to news updates

no spam, we hate it more than you!