What this paper is about
The paper proposes prototypical networks for the problem of few-shot classification.[S1] The paper describes few-shot classification as a setting where a classifier must generalize to new classes not seen in the training set.[S1] The paper states that this generalization must happen given only a small number of examples of each new class.[S1] The paper describes prototypical networks as learning a metric space in which classification can be performed by computing distances to prototype representations of each class.[S1]
The paper positions its approach relative to recent few-shot learning methods by stating that prototypical networks reflect a simpler inductive bias that is beneficial in a limited-data regime.[S1] The paper reports that prototypical networks achieve excellent results compared to recent approaches for few-shot learning.[S1] The paper also reports an analysis showing that simple design decisions can yield substantial improvements over recent approaches involving complicated architectural choices and meta-learning.[S1]
The paper further extends prototypical networks to zero-shot learning.[S1] The paper reports state-of-the-art results on the CU-Birds dataset for this zero-shot extension.[S1]
Core claims to remember
Prototypical networks are proposed for few-shot classification in which a classifier must generalize to new classes not seen in the training set using only a small number of examples per new class.[S1] The method learns a metric space where classification is carried out by computing distances to prototype representations of each class.[S1]
The paper states that, compared to recent few-shot learning approaches, prototypical networks reflect a simpler inductive bias that is beneficial in a limited-data regime.[S1] The paper reports excellent results from prototypical networks compared to recent approaches for few-shot learning.[S1]
The paper reports an analysis in which simple design decisions yield substantial improvements over recent approaches that involve complicated architectural choices and meta-learning.[S1] The paper extends prototypical networks to zero-shot learning and reports state-of-the-art results on the CU-Birds dataset.[S1]
Limitations and caveats
The paper’s primary problem setting is few-shot classification where only a small number of examples are available for each new class.[S1] The paper’s few-shot setting is defined around generalizing to new classes that are not seen in the training set.[S1]
The paper’s zero-shot learning results are reported on the CU-Birds dataset in the extension described in the paper.[S1] The paper’s comparison language is expressed against “recent approaches for few-shot learning” and “recent approaches involving complicated architectural choices and meta-learning,” as stated in the paper’s description.[S1]
How to apply this in study or projects
Read the paper’s definition of few-shot classification as generalizing to new classes not seen in the training set given only a small number of examples per new class.[S1] Trace the paper’s description of prototypical networks as learning a metric space in which classification is performed by computing distances to prototype representations of each class.[S1]
Locate the section where the paper compares prototypical networks to recent few-shot learning approaches and records the claim that a simpler inductive bias is beneficial in a limited-data regime.[S1] Review the analysis the paper reports about simple design decisions yielding substantial improvements over approaches involving complicated architectural choices and meta-learning.[S1]
Follow the paper’s extension from few-shot learning to zero-shot learning and note how the paper reports state-of-the-art results on the CU-Birds dataset for that extension.[S1]