Self-supervision on Deep Nets

Arguably, the main reason that deep nets became so powerful is self-supervision. In many domains, from image, to text, to DNA analysis, the concept of self-supervision was sufficient to generate practically infinite “labelled” data for training deep models. The idea is simple yet extremely powerful: just hide some parts of the (unlabelled) data and turn the hidden parts into the labels to predict.

Here are some notes (mostly to myself) about self-supervision.

There are two standard ways to make self supervision: Auto-regression and de-noising. Auto-regression typically involves a causal model, i.e., we aim to predict the next word given the previous words, without using words “from the future” (e.g., if we predict the nth word of a sentence, we have access only to the first n-i words). De-noising typically does not underlie causal model. An example to de-noising would be taking an image and hiding or adding noise to some part of it, and then predicting the noise-free (i.e., original) image.

State of the art in 3D face reconstruction may be wrong

LSTM Examples #1: Basic Time Series Prediction

State of the art in 3D face reconstruction may be wrong

LSTM Examples #1: Basic Time Series Prediction

You May Also Like

State of the art in 3D face reconstruction may be wrong

3DI: Face Reconstruction via Inequality Constraints

SyncRef: Fast & Scalable Way to Find Synchronized Time Series