Current directory: /home3/bjinbymy/public_html/indianext/wp-content/mu-plugins AI Is Used By India To Record Its 121 Languages - AI Next
Indianext
No Result
View All Result
Subscribe
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
No Result
View All Result
Latest News on AI, Healthcare & Energy updates in India
No Result
View All Result
Home AI Next

AI Is Used By India To Record Its 121 Languages

December 5, 2023
Artificial-Intelligence

As part of a project to create the nation’s first AI-based chatbot for tuberculosis, residents of the Indian state of Karnataka read aloud dozens of words in their native Kannada language into an app over a few weeks in 2023.

Kannada is one of the 22 official languages of India, where there are over 40 million native speakers. It is also one of over 121 languages spoken by 10,000 or more people in the most populous country in the world.

However, natural language processing, the area of artificial intelligence that makes it possible for computers to comprehend spoken and written language, does not encompass many of these languages.

As a result, hundreds of millions of Indians are shut out of numerous economic opportunities and helpful information.

According to Ms. Kalika Bali, lead researcher at Microsoft Research India, “AI tools need to cater to people who don’t speak English, French, or Spanish in order to work for everyone.”

However, we would have to wait an additional ten years if we had to gather all the data in Indian languages needed for a big language model like GPT. Hence, we may build layers on top of generative AI models, like Llama or ChatGPT,” Ms. Bali said in a statement to the Thomson Reuters Foundation.

The Karnataka peasants are among the thousands of speakers of various Indian languages who provide voice data to the tech company Karya. Karya creates datasets that companies like Microsoft and Google utilize in their artificial intelligence models for healthcare, education, and other services.

With Bhashini, an AI-driven language translation system that generates open source datasets in regional languages for the development of AI tools, the Indian government, which seeks to provide more services digitally, is also producing language datasets.

Through the platform’s crowdsourcing project, users can check audio or text transcriptions made by others, translate texts, label photographs, and submit words in a variety of languages. For Bhashini, tens of thousands of Indians have donated.

According to Mr. Pushpak Bhattacharyya, head of the Computation for Indian Language Technology Lab in Mumbai, “the government is pushing very strongly to create datasets to train large language models in Indian languages, and these are already in use in translation tools for education, tourism, and in the courts.”

However, there are other obstacles, including the oral legacy of most Indian languages, the scarcity of computerized data, and the prevalence of code mixing. Additionally, gathering data in less widely spoken languages is challenging and calls for extra work.

Financial worth
Less than 100 of the more than 7,000 live languages in the world are represented in significant databases, with English being the most developed.

A surge of interest in generative AI was sparked by the 2022 release of ChatGPT. English is ChatGPT’s primary training language. Only three of the nine languages that Amazon’s Alexa can speak are non-European: Arabic, Hindi, and Japanese. Google’s Bard can only speak English.

Start-ups and governments are attempting to close this gap. A new huge language model named Jais can enable generative AI applications in Arabic in the United Arab Emirates, while grassroots organization Masakhane seeks to advance research on natural language processes in African languages.

Crowdsourcing is a useful tool for gathering voice and language data in a nation like India, according to Ms. Bali, who was listed by Time magazine in September as one of the 100 most influential people in AI.

Language, cultural, and socioeconomic subtleties are also better captured through crowdsourcing, according to Ms. Bali.

However, she added, “it has to be done ethically, by paying the workers, educating them, and making a specific effort to collect smaller languages. There also needs to be awareness of gender, ethnic, and socioeconomic bias.” “If not, it doesn’t scale.”

According to Safiya Husain, co-founder of Karya, there is a need for languages that “we haven’t even heard of” due to the quick development of AI, especially from academics who want to preserve them.

In order to find workers who are below the poverty line or have an annual income of less than US$325 (S$433), Karya collaborates with non-profit organizations. In exchange, the workers are paid approximately US$5 (S$6.70) per hour, which is significantly more than the Indian minimum wage.

Employees may receive royalties for a portion of the data they produce, and Ms. Husain stated that this data may be used to create AI solutions for the community in industries like farming and healthcare.

Village voice
Out of the 1.4 billion people in India, less than 11% speak English. Since many people find it difficult to read and write, many AI models concentrate on speech and speech recognition.

About a million Indians’ speech data is being gathered by Google-funded Project Vaani, or voice, and made publicly available for use in speech-to-speech and automatic speech recognition systems.

The Supreme Courts of Bangladesh and India both use AI-based translation tools from Bengaluru-based EkStep Foundation, while the government-backed AI4Bharat center has introduced Jugalbandi, an AI-based chatbot that can respond to queries on welfare programs in many Indian languages.

The AI4Bharat and Microsoft reasoning models are employed by the bot, which is called after a duet in which two artists riff off one another. The bot can be accessed on WhatsApp, which is used by roughly 500 million people in India.

In addition to working with farmers, the social venture Gram Vaani, also known as voice of the village, employs AI-powered chatbots to answer inquiries about welfare benefits.

According to Mr. Shubhmoy Kumar Garg, a product lead of Gram Vaani, “automatic speech recognition technologies are helping to mitigate language barriers and provide outreach at the grassroots level.”

“They will support the empowerment of communities that most need it.”

The rising need for speech data in her home Odia has also given Ms. Swarnalata Nayak in the Raghurajpur district of Odisha a much-needed boost in revenue from her job for Karya.

“I work on it at night when I have free time. I can converse on the phone and provide for my family,” she remarked.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Editors Corner

How can Artificial Intelligence tools be a blessing for recruiters?

Will Artificial Intelligence ever match human intelligence?

Artificial Intelligence: Features of peer-to-peer networking

What not to share or ask on Chatgpt?

How can Machine Learning help in detecting and eliminating poverty?

How can Artificial Intelligence help in treating Autism?

Speech Recognition and its Wonders in your corporate life

Most groundbreaking Artificial Intelligence-based gadgets to vouch for in 2023

Recommended News

AI Next

Google: AI From All Perspectives

Alphabet subsidiary Google may have been slower than OpenAI to make its AI capabilities publicly available in the past, but...

by India Next
May 31, 2024
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

New research from Bryter, which involved over 200 doctors from the US and the UK, including neurologists, hematologists, and oncologists,...

by India Next
May 31, 2024
Solutions

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

Three government agencies joined forces to form a synergy in order to deliver eMigrate services through Common Services Centers (CSCs)...

by India Next
May 31, 2024
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

The advent of artificial intelligence has significantly changed the landscape of entrepreneurship. The figures say it all. Global AI startups...

by India Next
May 31, 2024

Related Posts

Google
AI Next

Google: AI From All Perspectives

May 31, 2024
Pfizer
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

May 31, 2024
Artificial-Intelligence
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

May 31, 2024
openai
AI Next

OpenAI Creates An AI Safety Committee Following Significant Departures

May 31, 2024
Load More
Next Post
data-science

Ten Smartphone Apps For Fans Of Data Science And Data Analysis

IndiaNext Logo
IndiaNext Brings you latest news on artificial intelligence, Healthcare & Energy sector from all top sources in India and across the world.

Recent Posts

Google: AI From All Perspectives

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

OpenAI Creates An AI Safety Committee Following Significant Departures

Tags

  • AI
  • EV
  • Mental WellBeing
  • Clean Energy
  • TeleMedicine
  • Healthcare
  • Electric Vehicles
  • Artificial Intelligence
  • Chatbots
  • Data Science
  • Electric Vehicles
  • Energy Storage
  • Machine Learning
  • Renewable Energy
  • Green Energy
  • Solar Energy
  • Solar Power

Follow us

  • Facebook
  • Linkedin
  • Twitter
© India Next. All Rights Reserved.     |     Privacy Policy      |      Web Design & Digital Marketing by Heeren Tanna
No Result
View All Result
  • About Us
  • Activate
  • Activity
  • Advisory Council
  • Archive
  • Career Page
  • Companies
  • Contact Us
  • cryptodemo
  • Energy next
  • Energy Next Archive
  • Home
  • Interviews
  • Make in India
  • Market
  • Members
  • Mission
  • News
  • News Update
  • People
  • Policy
  • Privacy Policy
  • Register
  • Reports
  • Subscription Page
  • Technology
  • Top 10
  • Videos
  • White Papers
  • Work Culture
  • Write For Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

IndiaNext Logo

Join Our Newsletter

Get daily access to news updates

no spam, we hate it more than you!