Current directory: /home3/bjinbymy/public_html/indianext/wp-content/mu-plugins PASS: An ImageNet Replacement Augmenting Ethics in AI Datasets - AI Next
Indianext
No Result
View All Result
Subscribe
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
  • News
    • Project Watch
    • Policy
  • AI Next
  • People
    • Interviews
    • Profiles
  • Companies
  • Make In India
    • Solutions
    • State News
  • About Us
    • Editors Corner
    • Mission
    • Contact Us
    • Work Culture
  • Events
  • Guest post
No Result
View All Result
Latest News on AI, Healthcare & Energy updates in India
No Result
View All Result
Home AI Next

PASS: An ImageNet Replacement Augmenting Ethics in AI Datasets

November 5, 2021
ImageNet

The progress made by AI over the last few years is remarkable. What started as an alien technology has now become embedded in every walk of life. AI is helping people and organizations augment human intelligence almost everywhere. However, this progress wouldn’t have been possible without the availability of increasingly large and diverse research datasets.

These datasets are collections of images sampled from the internet that provide a better representation of the statistics than images taken in the laboratory. It’s because of these datasets that we’re able to generalize to the real world better. They also allow reproducible and quantitative comparison of algorithms enabling researchers to efficiently build on each other’s work.

Modern machine learning relies on these diverse datasets to function. However, these datasets have technical and ethical shortcomings:

One, there is a copyright issue. While some datasets contain images licensed for use, some do not. They contain personal information collected via unsupervised methods and unclear license usage.

Two, it violates data protection. The majority of these images are collected by humans for their consumption and thus a large number of them contain people. As it is nearly impossible to obtain consent from all those people, the data is collected without consent.

To address these challenges and give the datasets a technical, ethical, and legal perspective, the Visual Geometry Group, University of Oxford has proposed unlabeled datasets. Named PASS: Pictures without humAns for Self-Supervision, the dataset only contains images with CC-BY license and complete attribution metadata. Also, it contains no images of people at all and avoids all images problematic for ethics and data protection.

“We do so by starting from a large-scale (100 million random Flickr images) dataset— YFCC100M meaning that the data is better randomized and identify a ‘safer’ subset within it. We also focus on data made available under the most permissive Creative Commons license (CC-BY) to address copyright concerns. Given this data, we then conduct an extensive evaluation of SSL methods, discussing performance differences when these are trained using ImageNet and PASS” the research team conceded in its paper.

“The annotators were asked to identify images that contain people or body parts, as well as personal information such as IDs, names, license plates, signatures, etc. Additionally, the annotators were asked to flag images with problematic content such as drugs, nudity, blood, and other offensive content. From the remaining images (1.46M) we further removed duplicates and randomly selected a subset with approximately the same size as IN-1k (1.440.191 images)”, the team specified.

Compared to ImageNet, PASS datasets enjoys certain advantages:

  • It three essential differences: lack of class-level curation, lack of community optimization, and lack of people
  • The self-supervised approaches such as MoCo, SwAV, and DINO train very well on the PASS dataset
  • The lack of humans does not cause an effect on downstream task performances
  • Models trained on the PASS dataset have better results than ImageNet in 8/13 frozen encoder evaluation benchmarks

However, it still has a fair share of limitations. First, despite filtering the images, some harmful content might have slipped through. Second, given the fact that PASS does not contain the existence of people, the model cannot be used to learn models of people, like pose recognition. Third, as PASS contains no labels, it cannot be used alone for training and benchmarking. This means the curated datasets that carry privacy and copyright issues remain necessary.

In such a case it would be interesting to witness how far PASS can go to reduce ethical and legal risks in datasets. Only time can tell whether it can actually curate and improve our datasets and introduce a more realistic training scenario of utilizing images obtained from labelled detests.

Source: indiaai.gov.in

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Editors Corner

How can Artificial Intelligence tools be a blessing for recruiters?

Will Artificial Intelligence ever match human intelligence?

Artificial Intelligence: Features of peer-to-peer networking

What not to share or ask on Chatgpt?

How can Machine Learning help in detecting and eliminating poverty?

How can Artificial Intelligence help in treating Autism?

Speech Recognition and its Wonders in your corporate life

Most groundbreaking Artificial Intelligence-based gadgets to vouch for in 2023

Recommended News

AI Next

Google: AI From All Perspectives

Alphabet subsidiary Google may have been slower than OpenAI to make its AI capabilities publicly available in the past, but...

by India Next
May 31, 2024
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

New research from Bryter, which involved over 200 doctors from the US and the UK, including neurologists, hematologists, and oncologists,...

by India Next
May 31, 2024
Solutions

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

Three government agencies joined forces to form a synergy in order to deliver eMigrate services through Common Services Centers (CSCs)...

by India Next
May 31, 2024
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

The advent of artificial intelligence has significantly changed the landscape of entrepreneurship. The figures say it all. Global AI startups...

by India Next
May 31, 2024

Related Posts

Google
AI Next

Google: AI From All Perspectives

May 31, 2024
Pfizer
AI Next

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

May 31, 2024
Artificial-Intelligence
AI Next

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

May 31, 2024
openai
AI Next

OpenAI Creates An AI Safety Committee Following Significant Departures

May 31, 2024
Load More
Next Post
AI for Data-Driven Governance

AI for Data-Driven Governance

IndiaNext Logo
IndiaNext Brings you latest news on artificial intelligence, Healthcare & Energy sector from all top sources in India and across the world.

Recent Posts

Google: AI From All Perspectives

US And UK Doctors Think Pfizer Is Setting The Standard For AI And Machine Learning In Drug Discovery

An Agreement Is Signed By MEA, MeitY, And CSC To Offer E-Migration Services Via Shared Service Centers

PR Handbook For AI Startups: How To Avoid Traps And Succeed In A Crowded Field

OpenAI Creates An AI Safety Committee Following Significant Departures

Tags

  • AI
  • EV
  • Mental WellBeing
  • Clean Energy
  • TeleMedicine
  • Healthcare
  • Electric Vehicles
  • Artificial Intelligence
  • Chatbots
  • Data Science
  • Electric Vehicles
  • Energy Storage
  • Machine Learning
  • Renewable Energy
  • Green Energy
  • Solar Energy
  • Solar Power

Follow us

  • Facebook
  • Linkedin
  • Twitter
© India Next. All Rights Reserved.     |     Privacy Policy      |      Web Design & Digital Marketing by Heeren Tanna
No Result
View All Result
  • About Us
  • Activate
  • Activity
  • Advisory Council
  • Archive
  • Career Page
  • Companies
  • Contact Us
  • cryptodemo
  • Energy next
  • Energy Next Archive
  • Home
  • Interviews
  • Make in India
  • Market
  • Members
  • Mission
  • News
  • News Update
  • People
  • Policy
  • Privacy Policy
  • Register
  • Reports
  • Subscription Page
  • Technology
  • Top 10
  • Videos
  • White Papers
  • Work Culture
  • Write For Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

IndiaNext Logo

Join Our Newsletter

Get daily access to news updates

no spam, we hate it more than you!