Indianext
No Result
View All Result
Subscribe
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
No Result
View All Result
Latest News on AI, Healthcare & Energy updates in India
No Result
View All Result
Home AI Next

User Submission: Cross-Industry Standard Process For Data Science And Machine Learning Projects

August 11, 2022
AI

How would you systematically go about planning a machine learning/data science project? As a Data Scientist/ML Engineer, you don’t want to make the mistake of diving straight into designing the solution/modeling without first understanding the problem and objectives at hand. Additionally, it’s very crucial to spend more time on the data itself. If you like frameworks and would prefer to build some discipline around structuring your data science/machine learning process, CRISP-DM (Cross-industry Standard Process for Data Mining) can help you. CRISM-DM is one of the well-known and widely used industry-standard processes that can help modularize your data science/machine learning project into iterative steps. CRISM-DM breaks down a machine learning project into six iterative phases:

  1. Business understanding: This involves understanding the problem statement and the target user(s). Think about what you’re solving and why it matters. Identify the gaps in the current state and understand how the problem is solved today. Then go about quantifying the business impact you expect to achieve once you solve the problem for your target user/s. It’s crucial to translate this business impact into outcome and output metrics. Make sure you define your success and failure metrics before diving into the data. It behooves you to understand your problem, gather domain expertise, and identify relevant factors.
  2. Data understanding: This involves gathering data, validating data, and performing exploratory data analysis. Gathering data entails sourcing the relevant data from identified sources, labeling the data if it’s not already labeled, and creating the relevant features. Validating data involves deciding how you/the business would like to handle missing/empty/erroneous data and outliers. This step also includes performing quality control over your data to ensure the values are what you expect and appropriately cleaning the data. Finally, EDA involves exploring the data, performing statistical analysis and visualizations, identifying any relationships and patterns, and dimensionality reduction.
  3. Data preparation: Data preparation entails feature engineering and selection, and subsequently, the steps that will get your data prepped for modeling. This includes scaling/standardizing your data set, splitting your data into training and test sets, resolving any class imbalances, and optionally encoding categorical features.
  4. Modeling: Modeling involves both model selection and tuning. The modeling process involves evaluating various algorithms via cross-validation, hyperparameter optimization/tuning, documenting and versioning of the model, and subsequently re-training the model. This also involves making necessary trade-offs (performance, interpretability, and computational cost) when choosing the optimal algorithm as part of your model selection.
  5. Evaluation: This is where you’d score your model on the test set and interpret your model output and evaluate its performance. It’s a good idea to write unit and integration tests to test your model and make it more robust. Subsequently, user tests would build more confidence before operationalizing your model.
  6. Deployment: After you have evaluated the performance of your model and tested your solution, it’s time to deploy your model, adhering to the software deployment process and security measures built in place. Your final model could be exposed as an API or can be integrated within a current product/service. Ensure that you have a plan in place to monitor the performance of your model and potentially re-train the model if there is a need to do so.

Again, remember this is an iterative process, so you might find yourself switching back and forth if project requirements change and/or you’re not satisfied with the outcomes. You might want to modify some steps depending on your project requirements and timelines. Overall, CRISP-DM is a domain-agnostic process that can help you build discipline and a framework around executing your data science/machine learning projects.

Source: indiaai.gov.in

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Editors Corner

How can Artificial Intelligence tools be a blessing for recruiters?

Will Artificial Intelligence ever match human intelligence?

Artificial Intelligence: Features of peer-to-peer networking

What not to share or ask on Chatgpt?

How can Machine Learning help in detecting and eliminating poverty?

How can Artificial Intelligence help in treating Autism?

Speech Recognition and its Wonders in your corporate life

Most groundbreaking Artificial Intelligence-based gadgets to vouch for in 2023

Recommended News

AI Next

Google: AI From All Perspectives

Alphabet subsidiary Google may have been slower than OpenAI to make its AI capabilities publicly available in the past, but...

by India Next
May 31, 2024
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

New research from Bryter, which involved over 200 doctors from the US and the UK, including neurologists, hematologists, and oncologists,...

by India Next
May 31, 2024
Solutions

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

Three government agencies joined forces to form a synergy in order to deliver eMigrate services through Common Services Centers (CSCs)...

by India Next
May 31, 2024
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

The advent of artificial intelligence has significantly changed the landscape of entrepreneurship. The figures say it all. Global AI startups...

by India Next
May 31, 2024

Related Posts

Google
AI Next

Google: AI From All Perspectives

May 31, 2024
Pfizer
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

May 31, 2024
Artificial-Intelligence
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

May 31, 2024
openai
AI Next

OpenAI Creates An AI Safety Committee Following Significant Departures

May 31, 2024
Load More
Next Post
Top-10-Deep-Learning-Software-Manufacturers-to-Know-in-2022

Top 10 Deep Learning Software Manufacturers To Know In 2022

IndiaNext Logo
IndiaNext Brings you latest news on artificial intelligence, Healthcare & Energy sector from all top sources in India and across the world.

Recent Posts

Google: AI From All Perspectives

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

OpenAI Creates An AI Safety Committee Following Significant Departures

Tags

  • AI
  • EV
  • Mental WellBeing
  • Clean Energy
  • TeleMedicine
  • Healthcare
  • Electric Vehicles
  • Artificial Intelligence
  • Chatbots
  • Data Science
  • Electric Vehicles
  • Energy Storage
  • Machine Learning
  • Renewable Energy
  • Green Energy
  • Solar Energy
  • Solar Power

Follow us

  • Facebook
  • Linkedin
  • Twitter
© India Next. All Rights Reserved.     |     Privacy Policy      |      Web Design & Digital Marketing by Heeren Tanna
No Result
View All Result
  • About Us
  • Activate
  • Activity
  • Advisory Council
  • Archive
  • Career Page
  • Companies
  • Contact Us
  • cryptodemo
  • Energy next
  • Energy Next Archive
  • Home
  • Interviews
  • Make in India
  • Market
  • Members
  • Mission
  • News
  • News Update
  • People
  • Policy
  • Privacy Policy
  • Register
  • Reports
  • Subscription Page
  • Technology
  • Top 10
  • Videos
  • White Papers
  • Work Culture
  • Write For Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

IndiaNext Logo

Join Our Newsletter

Get daily access to news updates

no spam, we hate it more than you!