Current directory: /home3/bjinbymy/public_html/indianext/wp-content/mu-plugins Interesting ML Algorithms: State–Action–Reward–State–Action, Lasso And Self-Play - AI Next
Indianext
No Result
View All Result
Subscribe
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
No Result
View All Result
Latest News on AI, Healthcare & Energy updates in India
No Result
View All Result
Home AI Next

Interesting ML Algorithms: State–Action–Reward–State–Action, Lasso And Self-Play

August 27, 2022
AI

Machine learning allows computers to mimic human behaviour by training them with historical and predicted data. This section will examine some interesting machine learning algorithms such as state-action-reward-state-action, Lasso, and Self-play.

State–action–reward–state–action

The state-action-reward-state-action (SARSA) algorithm is a reinforcement learning tool for learning a Markov decision process policy. Rummery and Niranjan put the idea in a technical note under “Modified Connectionist Q-Learning” (MCQ-L). Rich Sutton’s alternate name, SARSA, was only mentioned in a footnote. As an on-policy learning algorithm, a SARSA agent interacts with the environment and updates the policy based on actions taken. For example, an error, adjusted by the learning rate alpha, updates the Q value for a state-action. The Q values represent the potential reward received in the next step for acting state s and the discounted future reward received from the following state-action observation. 

Furthermore, due to the iterative nature of SARSA, an initial condition is implicitly assumed before the first update. The update rule causes any action always to have higher values than the other alternative, increasing their chance of making their choice. This process is known as having a low (infinite) initial value or “optimistic initial conditions” and can encourage exploration. 

Researchers proposed in 2013 that the initial conditions using the first reward, or r. This theory states that the reward is to determine Q’s value. In the case of fixed deterministic rewards, this enables instant learning. In repeated binary choice experiments, the resetting-of-initial-conditions (RIC) strategy appears consistent with human behaviour.

Lasso

Less absolute shrinkage and selection operator (LASSO) is a regression analysis technique used in statistics and machine learning. It performs variable selection and regularization. In addition, it improves the predictability and understandability of the resulting statistical model. Robert Tibshirani, who coined the term, first used it in geophysics and later.

Lasso is for models of linear regression. Its connections to ridge regression, best subset selection, lasso coefficient estimates and soft thresholding are a few examples. Additionally, it demonstrates that if covariates are collinear, the coefficient estimates do not necessarily need to be unique, unlike in standard linear regression. Furthermore, the LASSO and basis pursuit denoising are closely related.

Self-play

Self-play is a method for enhancing reinforcement learning agents’ performance. Agents naturally learn to perform better by competing with themselves. Researchers attempt to maximize a learning agent’s performance on a task in multi-agent reinforcement learning experiments in collaboration or competition with one or more agents. Researchers may decide to have the learning algorithm take on the roles of two or more different agents as these agents learn through trial and error. When used effectively, this technique offers two benefits:

  • It offers a simple method to ascertain what the other agents are doing, creating a substantial challenge.
  • Since we can use the perspectives of the various agents for learning, it multiplies the amount of experience we can use to improve the policy by a factor of two or more.

Furthermore, the AlphaZero program uses self-play to enhance its abilities in the games of go, shogi, and chess. In addition, the epistemological idea of tabula rasa, which describes how people learn from a “blank slate,” has been compared to the act of self-play.

Source: indiaai.gov.in

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Editors Corner

How can Artificial Intelligence tools be a blessing for recruiters?

Will Artificial Intelligence ever match human intelligence?

Artificial Intelligence: Features of peer-to-peer networking

What not to share or ask on Chatgpt?

How can Machine Learning help in detecting and eliminating poverty?

How can Artificial Intelligence help in treating Autism?

Speech Recognition and its Wonders in your corporate life

Most groundbreaking Artificial Intelligence-based gadgets to vouch for in 2023

Recommended News

AI Next

Google: AI From All Perspectives

Alphabet subsidiary Google may have been slower than OpenAI to make its AI capabilities publicly available in the past, but...

by India Next
May 31, 2024
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

New research from Bryter, which involved over 200 doctors from the US and the UK, including neurologists, hematologists, and oncologists,...

by India Next
May 31, 2024
Solutions

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

Three government agencies joined forces to form a synergy in order to deliver eMigrate services through Common Services Centers (CSCs)...

by India Next
May 31, 2024
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

The advent of artificial intelligence has significantly changed the landscape of entrepreneurship. The figures say it all. Global AI startups...

by India Next
May 31, 2024

Related Posts

Google
AI Next

Google: AI From All Perspectives

May 31, 2024
Pfizer
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

May 31, 2024
Artificial-Intelligence
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

May 31, 2024
openai
AI Next

OpenAI Creates An AI Safety Committee Following Significant Departures

May 31, 2024
Load More
Next Post
edtech

Edtech Startup Jackett Raises $1 Million In Funding Led By Forge Ventures

IndiaNext Logo
IndiaNext Brings you latest news on artificial intelligence, Healthcare & Energy sector from all top sources in India and across the world.

Recent Posts

Google: AI From All Perspectives

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

OpenAI Creates An AI Safety Committee Following Significant Departures

Tags

  • AI
  • EV
  • Mental WellBeing
  • Clean Energy
  • TeleMedicine
  • Healthcare
  • Electric Vehicles
  • Artificial Intelligence
  • Chatbots
  • Data Science
  • Electric Vehicles
  • Energy Storage
  • Machine Learning
  • Renewable Energy
  • Green Energy
  • Solar Energy
  • Solar Power

Follow us

  • Facebook
  • Linkedin
  • Twitter
© India Next. All Rights Reserved.     |     Privacy Policy      |      Web Design & Digital Marketing by Heeren Tanna
No Result
View All Result
  • About Us
  • Activate
  • Activity
  • Advisory Council
  • Archive
  • Career Page
  • Companies
  • Contact Us
  • cryptodemo
  • Energy next
  • Energy Next Archive
  • Home
  • Interviews
  • Make in India
  • Market
  • Members
  • Mission
  • News
  • News Update
  • People
  • Policy
  • Privacy Policy
  • Register
  • Reports
  • Subscription Page
  • Technology
  • Top 10
  • Videos
  • White Papers
  • Work Culture
  • Write For Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

IndiaNext Logo

Join Our Newsletter

Get daily access to news updates

no spam, we hate it more than you!