summary/main.tex

\documentclass[sigconf]{acmart}
\usepackage{amsmath}
\usepackage{bbm}
\usepackage{mathtools}

\usepackage[inline]{enumitem}

\settopmatter{printacmref=false} % Removes citation information below abstract
\renewcommand\footnotetextcopyrightpermission[1]{} % removes footnote with conference information in first column
\pagestyle{plain} % removes running headers

%%
%% \BibTeX command to typeset BibTeX logo in the docs
\AtBeginDocument{%
  \providecommand\BibTeX{{%
    \normalfont B\kern-0.5em{\scshape i\kern-0.25em b}\kern-0.8em\TeX}}}

\acmConference{Cross-Model Pseudo-Labeling}{2023}{Linz}

%%
%% end of the preamble, start of the body of the document source.
\begin{document}

%%
%% The "title" command has an optional parameter,
%% allowing the author to define a "short title" to be used in page headers.
\title{Cross-Model Pseudo-Labeling for Semi-Supervised Action recognition}

%%
%% The "author" command and its associated commands are used to define
%% the authors and their affiliations.
%% Of note is the shared affiliation of the first two authors, and the
%% "authornote" and "authornotemark" commands
%% used to denote shared contribution to the research.
\author{Lukas Heiligenbrunner}
\email{k12104785@students.jku.at}
\affiliation{%
  \institution{Johannes Kepler University Linz}
  \city{Linz}
  \state{Upperaustria}
  \country{Austria}
  \postcode{4020}
}

%%
%% By default, the full list of authors will be used in the page
%% headers. Often, this list is too long, and will overlap
%% other information printed in the page headers. This command allows
%% the author to define a more concise list
%% of authors' names for this purpose.
\renewcommand{\shortauthors}{Lukas Heilgenbrunner}

%%
%% The abstract is a short summary of the work to be presented in the
%% article.
\begin{abstract}
  Cross-Model Pseudo-Labeling is a new framework for generating Pseudo-Labels
  for supervised learning tasks where only a subset of true labels is known.
  It builds upon the existing approach of FixMatch and improves it further by
  using two different sized models complementing each other.
\end{abstract}

%%
%% Keywords. The author(s) should pick words that accurately describe
%% the work being presented. Separate the keywords with commas.
\keywords{neural networks, videos, pseudo-labeling, action recognition}

%\received{20 February 2007}
%\received[revised]{12 March 2009}
%\received[accepted]{5 June 2009}

%%
%% This command processes the author and affiliation and title
%% information and builds the first part of the formatted document.
\maketitle

\section{Introduction}\label{sec:introduction}
For most supervised learning tasks are lots of training samples essential.
With too less training data the model will gerneralize not well and not fit a real world task.
Labeling datasets is commonly seen as an expensive task and wants to be avoided as much as possible.
Thats why there is a machine-learning field called Semi-Supervised learning.
The general approach is to train a model that predicts Pseudo-Labels which then can be used to train the main model.

The goal of this paper is video action recognition.
Given are approximately 10 seconds long videos which should be classified.
In this paper datasets with 400 and 101 different classes are used.
The proposed approach is tested with 1\% and 10\% of known labels of all data points.

\section{Semi-Supervised learning}\label{sec:semi-supervised-learning}
In traditional supervised learning we have a labeled dataset.
Each datapoint is associated with a corresponding target label.
The goal is to fit a model to predict the labels from datapoints.

In traditional unsupervised learning there are also datapoints but no labels are known.
The goal is to find patterns or structures in the data.
Moreover, it can be used for clustering or downprojection.

Those two techniques combined yield semi-supervised learning.
Some of the labels are known, but for most of the data we have only the raw datapoints.
The basic idea is that the unlabeled data can significantly improve the model performance when used in combination with the labeled data.

\section{FixMatch}\label{sec:fixmatch}
There exists an already existing approach called FixMatch.
This was introduced in a Google Research paper from 2020~\cite{fixmatch}.
The key idea of FixMatch is to leverage the unlabeled data by predicting pseudo-labels out of the known labels.
Then both, the known labels and the predicted ones are used side by side to train the model.
The labeled samples guide the learning process and the unlabeled samples gain additional information.

Not every pseudo prediction is kept to train the model further.
A confidence threshold is defined to evaluate how `confident` the model is about its prediction.
The prediction is dropped if the model is too less confident.
The quantity and quality of the obtained labels is crucial and they have an significant impact on the overall accuracy.
This means improving the pseudo-label framework as much as possible is essential.

FixMatch results in some major limitations.
It relies on a single model for generating pseudo-labels which can introduce errors and uncertainty in the labels.
Incorrect pseudo-labels may effect the learning process negatively.
Furthermore, Fixmatch uses a compareably small model for label prediction which has a limited capacity.
This can negatively affect the learning process as well.
%There is no measure defined how certain the model is about its prediction.
%Such a measure improves overall performance by filtering noisy and unsure predictions.
Cross-Model Pseudo-Labeling tries to address all of those limitations.

\subsection{Math of FixMatch}\label{subsec:math-of-fixmatch}
Equation~\ref{eq:fixmatch} defines the loss-function that trains the model.
The sum over a batch size $B_u$ takes the average loss of this batch and should be familiar.
The input data is augmented in two different ways.
At first there is a weak augmentation $\mathcal{T}_{\text{weak}}(\cdot)$ which only applies basic transformation such as filtering and bluring.
Moreover, there is the strong augmentation $\mathcal{T}_{\text{strong}}(\cdot)$ which does cropouts and random augmentations.

\begin{equation}
  \label{eq:fixmatch}
  \mathcal{L}_u = \frac{1}{B_u} \sum_{i=1}^{B_u} \mathbbm{1}(\max(p_i) \geq \tau) \mathcal{H}(\hat{y}_i,F(\mathcal{T}_{\text{strong}}(u_i)))
\end{equation}

The indicator function $\mathbbm{1}(\cdot)$ applies a principle called `confidence-based masking`.
It retains a label only if its largest probability is above a threshold $\tau$.
Where $p_i \coloneqq F(\mathcal{T}_{\text{weak}}(u_i))$ is a model evaluation with a weakly augmented input.

\begin{equation}
  \label{eq:crossentropy}
  \mathcal{H}(\hat{y}_i, y_i) = -\sum_{i=1} y_i \cdot log(\hat{y}_i)
\end{equation}

The second part $\mathcal{H}(\cdot, \cdot)$ is a standard Cross-entropy loss function which takes two inputs, the predicted and the true label.
$\hat{y}_i$, the obtained pseudo-label and $F(\mathcal{T}_{\text{strong}}(u_i))$, a model evaluation with strong augmentation.
The indicator function evaluates in $0$ if the pseudo prediction is not confident and the current loss evaluation will be dropped.
Otherwise it evaluates to 1 and it will be kept and trains the model further.

\section{Cross-Model Pseudo-Labeling}\label{sec:cross-model-pseudo-labeling}
The newly invented approach of this paper is called Cross-Model Pseudo-Labeling (CMPL)\cite{Xu_2022_CVPR}.
Figure~\ref{fig:cmpl-structure} visualizs the structure of CMPL\@.
Two different models, a smaller auxiliary model and a larger model are defined.
They provide pseudo-labels for each other.
The two different models have a different structural bias which leads to complementary representations.
This symetric design performs a boost in performance.
The SG label means stop gradient.
The loss function evaluations are fed into the opposite model as loss.
The two models train each other.


\begin{figure}[h]
  \centering
  \includegraphics[width=\linewidth]{../presentation/rsc/structure}
  \caption{Architecture of Cross-Model Pseudo-Labeling}
  \label{fig:cmpl-structure}
\end{figure}

\subsection{Math of CMPL}\label{subsec:math}
The loss function of CMPL is similar to that one explaind above.
But we have to differ from the loss generated from the supervised samples where the labels are known and the unsupervised loss where no labels are knonw.

The two equations~\ref{eq:cmpl-losses1} and~\ref{eq:cmpl-losses2} are normal Cross-Entropy loss functions generated with the supervised labels of the two seperate models.


\begin{align}
  \label{eq:cmpl-losses1}
  \mathcal{L}_s^F &= \frac{1}{B_l} \sum_{i=1}^{B_l} \mathcal{H}(y_i,F(\mathcal{T}^F_{\text{standard}}(v_i)))\\
  \label{eq:cmpl-losses2}
  \mathcal{L}_s^A &= \frac{1}{B_l} \sum_{i=1}^{B_l} \mathcal{H}(y_i,A(\mathcal{T}^F_{\text{standard}}(v_i)))
\end{align}

Equation~\ref{eq:cmpl-loss3} and~\ref{eq:cmpl-loss4} are the unsupervised losses.
They are very similar to FastMatch, but important to note is that the confidence-based masking is applied to the opposite corresponding model.

\begin{align}
  \label{eq:cmpl-loss3}
  \mathcal{L}_u^F &= \frac{1}{B_u} \sum_{i=1}^{B_u} \mathbbm{1}(\max(p_i^A) \geq \tau) \mathcal{H}(\hat{y}_i^A,F(\mathcal{T}_{\text{strong}}(u_i)))\\
  \label{eq:cmpl-loss4}
  \mathcal{L}_u^A &= \frac{1}{B_u} \sum_{i=1}^{B_u} \mathbbm{1}(\max(p_i^F) \geq \tau) \mathcal{H}(\hat{y}_i^F,A(\mathcal{T}_{\text{strong}}(u_i)))
\end{align}

Finally to train the main objective an overall loss is calculated by simply summing all the losses.
The loss is regulated by an hyperparamter $\lambda$ to enhance the importance of the supervised loss.

\begin{equation}
  \label{eq:loss-main-obj}
  \mathcal{L} = (\mathcal{L}_s^F + \mathcal{L}_s^A) + \lambda(\mathcal{L}_u^F + \mathcal{L}_u^A)
\end{equation}

\section{Architecture}\label{sec:Architecture}
The used model architectures depend highly on the task to be performed.
In this case the task is video action recognition.
A 3D-ResNet50 was chosen for the main model and a smaller 3D-ResNet18 for the auxiliary model.

\section{Performance}\label{sec:performance}

In figure~\ref{fig:results} a performance comparison is shown between just using the supervised samples for training against some different pseudo label frameworks.
One can clearly see that the performance gain with the new CMPL framework is quite significant.
For evaluation the Kinetics-400 and UCF-101 datasets are used.
And as a backbone model a 3D-ResNet18 and 3D-ResNet50 are used.
Even when only 1\% of true labels are known for the UCF-101 dataset 25.1\% of the labels could be predicted right.

\begin{figure}[h]
  \centering
  \includegraphics[width=\linewidth]{../presentation/rsc/results}
  \caption{Performance comparisons between CMPL, FixMatch and supervised learning only}
  \label{fig:results}
\end{figure}

\section{Further schemes}\label{sec:further-schemes}
How the pseudo-labels are generated may impact the overall performance.
In this paper the pseudo-labels are obtained by the cross-model approach.
But there might be other strategies.
For example:
\begin{enumerate*}
  \item Self-First: Each network uses just its own prediction if its confident enough.
  If not, it uses its sibling net prediction.
  \item Opposite-First: Each net prioritizes the prediction of the sibling network.
  \item Maximum: The most confident prediction is leveraged.
  \item Average: The two predictions are averaged before deriving the pseudo-label
\end{enumerate*}.

Those are just other approaches one can keep in mind.
This doesn't mean they are better, in fact they performed even worse in this study.

\section{Conclusion}\label{sec:conclusion}
In conclusion, Cross-Model Pseudo-Labeling demonstrates the potential to significantly advance the field of semi-supervised action recognition.
Cross-Model Pseudo-Labeling outperforms the supervised-only approach over several experiments by a multiple.
It surpasses most of the other existing pseudo-labeling frameworks.
Through the integration of main and auxiliary models, consistency regularization, and uncertainty estimation, CMPL offers a powerful framework for leveraging unlabeled data and improving model performance.
It paves the way for more accurate and efficient action recognition systems.

%%
%% The next two lines define the bibliography style to be used, and
%% the bibliography file.
\bibliographystyle{ACM-Reference-Format}
\bibliography{sources}

%%
%% If your work has an appendix, this is the place to put it.
\appendix

% appendix

\end{document}
\endinput
add summary template 2023-03-29 14:14:05 +02:00			`\documentclass[sigconf]{acmart}`
shorten the template 2023-03-29 14:52:30 +02:00			`\usepackage{amsmath}`
			`\usepackage{bbm}`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`\usepackage{mathtools}`
move to subdir add ci 2023-03-14 22:26:54 +01:00
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`\usepackage[inline]{enumitem}`

add stuff 2023-06-10 12:11:21 +02:00			`\settopmatter{printacmref=false} % Removes citation information below abstract`
			`\renewcommand\footnotetextcopyrightpermission[1]{} % removes footnote with conference information in first column`
			`\pagestyle{plain} % removes running headers`

add summary template 2023-03-29 14:14:05 +02:00			`%%`
			`%% \BibTeX command to typeset BibTeX logo in the docs`
			`\AtBeginDocument{%`
			`\providecommand\BibTeX{{%`
			`\normalfont B\kern-0.5em{\scshape i\kern-0.25em b}\kern-0.8em\TeX}}}`
move to subdir add ci 2023-03-14 22:26:54 +01:00
add stuff 2023-06-10 12:11:21 +02:00			`\acmConference{Cross-Model Pseudo-Labeling}{2023}{Linz}`
add summary template 2023-03-29 14:14:05 +02:00
			`%%`
			`%% end of the preamble, start of the body of the document source.`
move to subdir add ci 2023-03-14 22:26:54 +01:00			`\begin{document}`

add summary template 2023-03-29 14:14:05 +02:00			`%%`
			`%% The "title" command has an optional parameter,`
			`%% allowing the author to define a "short title" to be used in page headers.`
			`\title{Cross-Model Pseudo-Labeling for Semi-Supervised Action recognition}`

			`%%`
			`%% The "author" command and its associated commands are used to define`
			`%% the authors and their affiliations.`
			`%% Of note is the shared affiliation of the first two authors, and the`
			`%% "authornote" and "authornotemark" commands`
			`%% used to denote shared contribution to the research.`
			`\author{Lukas Heiligenbrunner}`
			`\email{k12104785@students.jku.at}`
			`\affiliation{%`
shorten the template 2023-03-29 14:52:30 +02:00			`\institution{Johannes Kepler University Linz}`
			`\city{Linz}`
			`\state{Upperaustria}`
			`\country{Austria}`
			`\postcode{4020}`
add summary template 2023-03-29 14:14:05 +02:00			`}`

			`%%`
			`%% By default, the full list of authors will be used in the page`
			`%% headers. Often, this list is too long, and will overlap`
			`%% other information printed in the page headers. This command allows`
			`%% the author to define a more concise list`
			`%% of authors' names for this purpose.`
add stuff 2023-06-10 12:11:21 +02:00			`\renewcommand{\shortauthors}{Lukas Heilgenbrunner}`
add summary template 2023-03-29 14:14:05 +02:00
			`%%`
			`%% The abstract is a short summary of the work to be presented in the`
			`%% article.`
			`\begin{abstract}`
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`Cross-Model Pseudo-Labeling is a new framework for generating Pseudo-Labels`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`for supervised learning tasks where only a subset of true labels is known.`
add summary template 2023-03-29 14:14:05 +02:00			`It builds upon the existing approach of FixMatch and improves it further by`
			`using two different sized models complementing each other.`
			`\end{abstract}`

			`%%`
			`%% Keywords. The author(s) should pick words that accurately describe`
			`%% the work being presented. Separate the keywords with commas.`
			`\keywords{neural networks, videos, pseudo-labeling, action recognition}`

fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`%\received{20 February 2007}`
			`%\received[revised]{12 March 2009}`
			`%\received[accepted]{5 June 2009}`
add summary template 2023-03-29 14:14:05 +02:00
			`%%`
			`%% This command processes the author and affiliation and title`
			`%% information and builds the first part of the formatted document.`
			`\maketitle`

add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`\section{Introduction}\label{sec:introduction}`
write introduction 2023-05-03 16:04:46 +02:00			`For most supervised learning tasks are lots of training samples essential.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`With too less training data the model will gerneralize not well and not fit a real world task.`
			`Labeling datasets is commonly seen as an expensive task and wants to be avoided as much as possible.`
write introduction 2023-05-03 16:04:46 +02:00			`Thats why there is a machine-learning field called Semi-Supervised learning.`
			`The general approach is to train a model that predicts Pseudo-Labels which then can be used to train the main model.`

add stuff 2023-06-10 12:11:21 +02:00			`The goal of this paper is video action recognition.`
add some dataset infos 2023-06-03 23:20:07 +02:00			`Given are approximately 10 seconds long videos which should be classified.`
			`In this paper datasets with 400 and 101 different classes are used.`
add stuff 2023-06-10 12:11:21 +02:00			`The proposed approach is tested with 1\% and 10\% of known labels of all data points.`
add some dataset infos 2023-06-03 23:20:07 +02:00
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`\section{Semi-Supervised learning}\label{sec:semi-supervised-learning}`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`In traditional supervised learning we have a labeled dataset.`
			`Each datapoint is associated with a corresponding target label.`
			`The goal is to fit a model to predict the labels from datapoints.`

add conclusion 2023-06-18 09:17:16 +02:00			`In traditional unsupervised learning there are also datapoints but no labels are known.`
add stuff 2023-06-10 12:11:21 +02:00			`The goal is to find patterns or structures in the data.`
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`Moreover, it can be used for clustering or downprojection.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00
			`Those two techniques combined yield semi-supervised learning.`
			`Some of the labels are known, but for most of the data we have only the raw datapoints.`
			`The basic idea is that the unlabeled data can significantly improve the model performance when used in combination with the labeled data.`
write introduction 2023-05-03 16:04:46 +02:00
			`\section{FixMatch}\label{sec:fixmatch}`
			`There exists an already existing approach called FixMatch.`
			`This was introduced in a Google Research paper from 2020~\cite{fixmatch}.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`The key idea of FixMatch is to leverage the unlabeled data by predicting pseudo-labels out of the known labels.`
			`Then both, the known labels and the predicted ones are used side by side to train the model.`
			`The labeled samples guide the learning process and the unlabeled samples gain additional information.`

			`Not every pseudo prediction is kept to train the model further.`
add stuff 2023-06-10 12:11:21 +02:00			A confidence threshold is defined to evaluate how `confident` the model is about its prediction.
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`The prediction is dropped if the model is too less confident.`
			`The quantity and quality of the obtained labels is crucial and they have an significant impact on the overall accuracy.`
add stuff 2023-06-10 12:11:21 +02:00			`This means improving the pseudo-label framework as much as possible is essential.`

			`FixMatch results in some major limitations.`
			`It relies on a single model for generating pseudo-labels which can introduce errors and uncertainty in the labels.`
			`Incorrect pseudo-labels may effect the learning process negatively.`
			`Furthermore, Fixmatch uses a compareably small model for label prediction which has a limited capacity.`
			`This can negatively affect the learning process as well.`
add conclusion 2023-06-18 09:17:16 +02:00			`%There is no measure defined how certain the model is about its prediction.`
			`%Such a measure improves overall performance by filtering noisy and unsure predictions.`
add stuff 2023-06-10 12:11:21 +02:00			`Cross-Model Pseudo-Labeling tries to address all of those limitations.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00
			`\subsection{Math of FixMatch}\label{subsec:math-of-fixmatch}`
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`Equation~\ref{eq:fixmatch} defines the loss-function that trains the model.`
add stuff 2023-06-10 12:11:21 +02:00			`The sum over a batch size $B_u$ takes the average loss of this batch and should be familiar.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`The input data is augmented in two different ways.`
			`At first there is a weak augmentation $\mathcal{T}_{\text{weak}}(\cdot)$ which only applies basic transformation such as filtering and bluring.`
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`Moreover, there is the strong augmentation $\mathcal{T}_{\text{strong}}(\cdot)$ which does cropouts and random augmentations.`
add cmpl stuff and structure image 2023-05-19 18:18:57 +02:00
			`\begin{equation}`
			`\label{eq:fixmatch}`
			`\mathcal{L}_u = \frac{1}{B_u} \sum_{i=1}^{B_u} \mathbbm{1}(\max(p_i) \geq \tau) \mathcal{H}(\hat{y}_i,F(\mathcal{T}_{\text{strong}}(u_i)))`
			`\end{equation}`

add stuff 2023-06-10 12:11:21 +02:00			The indicator function $\mathbbm{1}(\cdot)$ applies a principle called `confidence-based masking`.
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`It retains a label only if its largest probability is above a threshold $\tau$.`
			`Where $p_i \coloneqq F(\mathcal{T}_{\text{weak}}(u_i))$ is a model evaluation with a weakly augmented input.`
add conclusion 2023-06-18 09:17:16 +02:00
			`\begin{equation}`
			`\label{eq:crossentropy}`
			`\mathcal{H}(\hat{y}_i, y_i) = -\sum_{i=1} y_i \cdot log(\hat{y}_i)`
			`\end{equation}`

add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`The second part $\mathcal{H}(\cdot, \cdot)$ is a standard Cross-entropy loss function which takes two inputs, the predicted and the true label.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00			`$\hat{y}_i$, the obtained pseudo-label and $F(\mathcal{T}_{\text{strong}}(u_i))$, a model evaluation with strong augmentation.`
			`The indicator function evaluates in $0$ if the pseudo prediction is not confident and the current loss evaluation will be dropped.`
add stuff 2023-06-10 12:11:21 +02:00			`Otherwise it evaluates to 1 and it will be kept and trains the model further.`
add stuff about semi-supervised learning and fixmatch 2023-05-19 17:11:47 +02:00
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`\section{Cross-Model Pseudo-Labeling}\label{sec:cross-model-pseudo-labeling}`
			`The newly invented approach of this paper is called Cross-Model Pseudo-Labeling (CMPL)\cite{Xu_2022_CVPR}.`
add stuff 2023-06-10 12:11:21 +02:00			`Figure~\ref{fig:cmpl-structure} visualizs the structure of CMPL\@.`
add conclusion 2023-06-18 09:17:16 +02:00			`Two different models, a smaller auxiliary model and a larger model are defined.`
			`They provide pseudo-labels for each other.`
			`The two different models have a different structural bias which leads to complementary representations.`
			`This symetric design performs a boost in performance.`
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`The SG label means stop gradient.`
			`The loss function evaluations are fed into the opposite model as loss.`
			`The two models train each other.`

add cmpl stuff and structure image 2023-05-19 18:18:57 +02:00
			`\begin{figure}[h]`
			`\centering`
			`\includegraphics[width=\linewidth]{../presentation/rsc/structure}`
add stuff 2023-06-10 12:11:21 +02:00			`\caption{Architecture of Cross-Model Pseudo-Labeling}`
add cmpl stuff and structure image 2023-05-19 18:18:57 +02:00			`\label{fig:cmpl-structure}`
			`\end{figure}`
add summary template 2023-03-29 14:14:05 +02:00
add cmpl stuff and structure image 2023-05-19 18:18:57 +02:00			`\subsection{Math of CMPL}\label{subsec:math}`
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`The loss function of CMPL is similar to that one explaind above.`
add stuff 2023-06-10 12:11:21 +02:00			`But we have to differ from the loss generated from the supervised samples where the labels are known and the unsupervised loss where no labels are knonw.`
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00
			`The two equations~\ref{eq:cmpl-losses1} and~\ref{eq:cmpl-losses2} are normal Cross-Entropy loss functions generated with the supervised labels of the two seperate models.`


			`\begin{align}`
			`\label{eq:cmpl-losses1}`
			`\mathcal{L}_s^F &= \frac{1}{B_l} \sum_{i=1}^{B_l} \mathcal{H}(y_i,F(\mathcal{T}^F_{\text{standard}}(v_i)))\\`
			`\label{eq:cmpl-losses2}`
			`\mathcal{L}_s^A &= \frac{1}{B_l} \sum_{i=1}^{B_l} \mathcal{H}(y_i,A(\mathcal{T}^F_{\text{standard}}(v_i)))`
			`\end{align}`

			`Equation~\ref{eq:cmpl-loss3} and~\ref{eq:cmpl-loss4} are the unsupervised losses.`
add stuff 2023-06-10 12:11:21 +02:00			`They are very similar to FastMatch, but important to note is that the confidence-based masking is applied to the opposite corresponding model.`
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00
			`\begin{align}`
			`\label{eq:cmpl-loss3}`
			`\mathcal{L}_u^F &= \frac{1}{B_u} \sum_{i=1}^{B_u} \mathbbm{1}(\max(p_i^A) \geq \tau) \mathcal{H}(\hat{y}_i^A,F(\mathcal{T}_{\text{strong}}(u_i)))\\`
			`\label{eq:cmpl-loss4}`
			`\mathcal{L}_u^A &= \frac{1}{B_u} \sum_{i=1}^{B_u} \mathbbm{1}(\max(p_i^F) \geq \tau) \mathcal{H}(\hat{y}_i^F,A(\mathcal{T}_{\text{strong}}(u_i)))`
			`\end{align}`

			`Finally to train the main objective an overall loss is calculated by simply summing all the losses.`
			`The loss is regulated by an hyperparamter $\lambda$ to enhance the importance of the supervised loss.`

fix eq 2023-03-30 00:29:23 +02:00			`\begin{equation}`
add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`\label{eq:loss-main-obj}`
			`\mathcal{L} = (\mathcal{L}_s^F + \mathcal{L}_s^A) + \lambda(\mathcal{L}_u^F + \mathcal{L}_u^A)`
fix eq 2023-03-30 00:29:23 +02:00			`\end{equation}`
add summary template 2023-03-29 14:14:05 +02:00
add stuff 2023-06-10 12:11:21 +02:00			`\section{Architecture}\label{sec:Architecture}`
			`The used model architectures depend highly on the task to be performed.`
			`In this case the task is video action recognition.`
			`A 3D-ResNet50 was chosen for the main model and a smaller 3D-ResNet18 for the auxiliary model.`

add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`\section{Performance}\label{sec:performance}`

			`In figure~\ref{fig:results} a performance comparison is shown between just using the supervised samples for training against some different pseudo label frameworks.`
			`One can clearly see that the performance gain with the new CMPL framework is quite significant.`
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`For evaluation the Kinetics-400 and UCF-101 datasets are used.`
			`And as a backbone model a 3D-ResNet18 and 3D-ResNet50 are used.`
add stuff 2023-06-10 12:11:21 +02:00			`Even when only 1\% of true labels are known for the UCF-101 dataset 25.1\% of the labels could be predicted right.`
add summary template 2023-03-29 14:14:05 +02:00
			`\begin{figure}[h]`
			`\centering`
shorten the template 2023-03-29 14:52:30 +02:00			`\includegraphics[width=\linewidth]{../presentation/rsc/results}`
write introduction 2023-05-03 16:04:46 +02:00			`\caption{Performance comparisons between CMPL, FixMatch and supervised learning only}`
			`\label{fig:results}`
add summary template 2023-03-29 14:14:05 +02:00			`\end{figure}`

fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`\section{Further schemes}\label{sec:further-schemes}`
add some dataset infos 2023-06-03 23:20:07 +02:00			`How the pseudo-labels are generated may impact the overall performance.`
fix some typos and add some stuff 2023-05-27 11:40:13 +02:00			`In this paper the pseudo-labels are obtained by the cross-model approach.`
			`But there might be other strategies.`
			`For example:`
			`\begin{enumerate*}`
			`\item Self-First: Each network uses just its own prediction if its confident enough.`
			`If not, it uses its sibling net prediction.`
			`\item Opposite-First: Each net prioritizes the prediction of the sibling network.`
			`\item Maximum: The most confident prediction is leveraged.`
			`\item Average: The two predictions are averaged before deriving the pseudo-label`
			`\end{enumerate*}.`

			`Those are just other approaches one can keep in mind.`
			`This doesn't mean they are better, in fact they performed even worse in this study.`
add conclusion 2023-06-18 09:17:16 +02:00
			`\section{Conclusion}\label{sec:conclusion}`
			`In conclusion, Cross-Model Pseudo-Labeling demonstrates the potential to significantly advance the field of semi-supervised action recognition.`
			`Cross-Model Pseudo-Labeling outperforms the supervised-only approach over several experiments by a multiple.`
			`It surpasses most of the other existing pseudo-labeling frameworks.`
			`Through the integration of main and auxiliary models, consistency regularization, and uncertainty estimation, CMPL offers a powerful framework for leveraging unlabeled data and improving model performance.`
			`It paves the way for more accurate and efficient action recognition systems.`

add summary template 2023-03-29 14:14:05 +02:00			`%%`
			`%% The next two lines define the bibliography style to be used, and`
			`%% the bibliography file.`
			`\bibliographystyle{ACM-Reference-Format}`
			`\bibliography{sources}`

			`%%`
			`%% If your work has an appendix, this is the place to put it.`
			`\appendix`

add remaining loss formulas fix some typos 2023-05-22 18:28:41 +02:00			`% appendix`
move to subdir add ci 2023-03-14 22:26:54 +01:00
			`\end{document}`
add summary template 2023-03-29 14:14:05 +02:00			`\endinput`