0% found this document useful (0 votes)
71 views7 pages

Semi-Supervised Learning in ML

Semi-supervised learning is a machine learning approach that combines a small amount of labeled data with a large amount of unlabeled data to train models. It is particularly useful in scenarios where labeling data is expensive or impractical, such as in text and image classification, and anomaly detection. The method relies on assumptions like continuity, cluster, and manifold to effectively utilize both labeled and unlabeled data for better prediction accuracy.

Uploaded by

Sajid Hussain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
71 views7 pages

Semi-Supervised Learning in ML

Semi-supervised learning is a machine learning approach that combines a small amount of labeled data with a large amount of unlabeled data to train models. It is particularly useful in scenarios where labeling data is expensive or impractical, such as in text and image classification, and anomaly detection. The method relies on assumptions like continuity, cluster, and manifold to effectively utilize both labeled and unlabeled data for better prediction accuracy.

Uploaded by

Sajid Hussain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks

Semi-Supervised Learning in ML
Last Updated : 21 May, 2024

Today’s Machine Learning algorithms can be broadly classified into three


categories, Supervised Learning, Unsupervised Learning, and Reinforcement
Learning. Casting Reinforced Learning aside, the primary two categories of
Machine Learning problems are Supervised and Unsupervised Learning. The
basic difference between the two is that Supervised Learning datasets have
an output label associated with each tuple while Unsupervised Learning
datasets do not.

What is Semi-Supervised Learning?


Semi-supervised learning is a type of machine learning that falls in between
supervised and unsupervised learning. It is a method that uses a small
AI ML DS Data Science Data Analysis Data Visualization Machine Learning Deep Learning NLP Comp
amount of labeled data and a large amount of unlabeled data to train a
model. The goal of semi-supervised learning is to learn a function that can
accurately predict the output variable based on the input variables, similar to
supervised learning. However, unlike supervised learning, the algorithm is
trained on a dataset that contains both labeled and unlabeled data.

Semi-supervised learning is particularly useful when there is a large amount


of unlabeled data available, but it’s too expensive or difficult to label all of
it.

Semi-Supervised Learning Flow Chart

https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 1/7
23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks

Intuitively, one may imagine the three types of learning algorithms as


Supervised learning where a student is under the supervision of a teacher at
both home and school, Unsupervised learning where a student has to figure
out a concept himself and Semi-Supervised learning where a teacher
teaches a few concepts in class and gives questions as homework which are
based on similar concepts.

Examples of Semi-Supervised Learning


Text classification: In text classification, the goal is to classify a given text
into one or more predefined categories. Semi-supervised learning can be
used to train a text classification model using a small amount of labeled
data and a large amount of unlabeled text data.
Image classification: In image classification, the goal is to classify a given
image into one or more predefined categories. Semi-supervised learning
can be used to train an image classification model using a small amount
of labeled data and a large amount of unlabeled image data.
Anomaly detection: In anomaly detection, the goal is to detect patterns
or observations that are unusual or different from the norm

Assumptions followed by Semi-Supervised Learning


A Semi-Supervised algorithm assumes the following about the data

https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 2/7
23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks

1. Continuity Assumption: The algorithm assumes that the points which are
closer to each other are more likely to have the same output label.
2. Cluster Assumption: The data can be divided into discrete clusters and
points in the same cluster are more likely to share an output label.
3. Manifold Assumption: The data lie approximately on a manifold of a
much lower dimension than the input space. This assumption allows the
use of distances and densities which are defined on a manifold.

Applications of Semi-Supervised Learning


1. Speech Analysis: Since labeling audio files is a very intensive task, Semi-
Supervised learning is a very natural approach to solve this problem.
2. Internet Content Classification: Labeling each webpage is an impractical
and unfeasible process and thus uses Semi-Supervised learning
algorithms. Even the Google search algorithm uses a variant of Semi-
Supervised learning to rank the relevance of a webpage for a given query.
3. Protein Sequence Classification: Since DNA strands are typically very
large in size, the rise of Semi-Supervised learning has been imminent in
this field.

Disadvantages of Semi-Supervised Learning


The most basic disadvantage of any Supervised Learning algorithm is that
the dataset has to be hand-labeled either by a Machine Learning Engineer or
a Data Scientist. This is a very costly process, especially when dealing with
large volumes of data. The most basic disadvantage of any Unsupervised
Learning is that its application spectrum is limited.

To counter these disadvantages, the concept of Semi-Supervised Learning


was introduced. In this type of learning, the algorithm is trained upon a
combination of labeled and unlabelled data. Typically, this combination will
contain a very small amount of labeled data and a very large amount of
unlabelled data. The basic procedure involved is that first, the programmer
will cluster similar data using an unsupervised learning algorithm and then
use the existing labeled data to label the rest of the unlabelled data. The
typical use cases of such type of algorithm have a common property among
them – The acquisition of unlabelled data is relatively cheap while labeling
the said data is very expensive.

https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 3/7
23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks

Are you passionate about data and looking to make one giant leap into your
career? Our Data Science Course will help you change your game and, most
importantly, allow students, professionals, and working adults to tide over
into the data science immersion. Master state-of-the-art methodologies,
powerful tools, and industry best practices, hands-on projects, and real-
world applications. Become the executive head of industries related to Data
Analysis, Machine Learning, and Data Visualization with these growing
skills. Ready to Transform Your Future? Enroll Now to Be a Data Science
Expert!

A Alind… 24

Previous Article Next Article


Unsupervised Learning Reinforcement learning

Similar Reads
Semi Supervised Learning Examples
Semi-supervised learning is a type of machine learning where the training
dataset contains both labeled and unlabeled data. This approach is useful…
5 min read

Automated Machine Learning for Supervised Learning using R


Automated Machine Learning (AutoML) is an approach that aims to automate
various stages of the machine learning process, making it easier for users wit…
8 min read

Can I Use Unsupervised Learning Followed by Supervised Learning?


Answer : Yes, you can use unsupervised learning to discover patterns or
features and then apply supervised learning for prediction or classification…
2 min read

Real-Life Examples of Supervised Learning and Unsupervised Learning


Two primary branches of machine learning, supervised learning and
unsupervised learning, form the foundation of various applications. This articl…
6 min read
https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 4/7
23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks

ALBERT - A Light BERT for Supervised Learning


The BERT was proposed by researchers at Google AI in 2018. BERT has
created something like a transformation in NLP similar to that caused by…
4 min read

Self-Supervised Learning (SSL)


In this article, we will learn a major type of machine learning model which is
Self-Supervised Learning Algorithms. Usage of these algorithms has increase…
8 min read

A beginner's guide to supervised learning with Python


Supervised learning is a foundational concept, and Python provides a robust
ecosystem to explore and implement these powerful algorithms. Explore the…
10 min read

Difference between Supervised and Unsupervised Learning


Navigating the realm of machine learning, many grapple with understanding
the key disparities between supervised and unsupervised learning. This articl…
6 min read

Supervised Machine Learning Examples


Supervised machine learning technology is a key in the world of the dramatic
innovations of the modern AI. It is applied in numerous items, such as coat th…
7 min read

Time Series Forecasting as Supervised Learning


Time series forecasting involves predicting future values based on previously
observed data points. By reframing it as a supervised learning problem, you…
3 min read

Article Tags : Machine Learning python

Practice Tags : Machine Learning python

https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 5/7
23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks

Corporate & Communications Address:-


A-143, 9th Floor, Sovereign Corporate
Tower, Sector- 136, Noida, Uttar Pradesh
(201305) | Registered Address:- K 061,
Tower K, Gulshan Vivante Apartment,
Sector 137, Noida, Gautam Buddh
Nagar, Uttar Pradesh, 201305

Company Languages
About Us Python
Legal Java
In Media C++
Contact Us PHP
Advertise with us GoLang
GFG Corporate Solution SQL
Placement Training Program R Language
GeeksforGeeks Community Android Tutorial
Tutorials Archive

DSA Data Science & ML


Data Structures Data Science With Python
Algorithms Data Science For Beginner
DSA for Beginners Machine Learning
Basic DSA Problems ML Maths
DSA Roadmap Data Visualisation
Top 100 DSA Interview Problems Pandas
DSA Roadmap by Sandeep Jain NumPy
All Cheat Sheets NLP
Deep Learning

Web Technologies Python Tutorial


https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 6/7
23/10/2024, 22:43 Semi-Supervised Learning in ML - GeeksforGeeks
HTML Python Programming Examples
CSS Python Projects
JavaScript Python Tkinter
TypeScript Web Scraping
ReactJS OpenCV Tutorial
NextJS Python Interview Question
Bootstrap Django
Web Design

Computer Science DevOps


Operating Systems Git
Computer Network Linux
Database Management System AWS
Software Engineering Docker
Digital Logic Design Kubernetes
Engineering Maths Azure
Software Development GCP
Software Testing DevOps Roadmap

System Design Inteview Preparation


High Level Design Competitive Programming
Low Level Design Top DS or Algo for CP
UML Diagrams Company-Wise Recruitment Process
Interview Guide Company-Wise Preparation
Design Patterns Aptitude Preparation
OOAD Puzzles
System Design Bootcamp
Interview Questions

School Subjects GeeksforGeeks Videos


Mathematics DSA
Physics Python
Chemistry Java
Biology C++
Social Science Web Development
English Grammar Data Science
Commerce CS Subjects
World GK

@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved

https://www.geeksforgeeks.org/ml-semi-supervised-learning/?ref=next_article 7/7

You might also like