Model-Optimization-for-few-shot-learning

This is the implementation of Paper Optimization as a model for few-shot learning in pytorch

Motivation

Gradient-based algorithms are not designed for a limited number of updates. specifically when the objective function is non-convex (which only has a local optimum solution).
Due to the random initialization of parameters, a limited number of updates will not lead to the optimal solution. (Using pre-trained network parameters greatly reduces the accuracy of the network as the trained task diverges from the target task)

This paper solves the problem in two levels using Meta learning

The first is the quick acquisition of knowledge within each separate task presented. This process is guided by the second, which involves a slower extraction of information learned across all the tasks.

Important points from the Paper

we leverage that gradient descent update resembles the update for the cell state in an LSTM.
Avoiding batch normalization during meta testing
preprocessing method of Andrychowicz et al. (2016) worked well when applied to both the dimensions of the gradients and the losses at each time step.
We set the cell state of the LSTM to be the parameters of the learner, or ct = θt

Hence the name meta-learner LSTM.

💡 Learner and the meta-learner are two different things. The learner is a neural network classifier and meta-learner LSTM is an optimizer trained to optimize a learner similar to the cell state update of LSTM

Here the output produced by the meta-learner is again used by the learner and the meta-learner in the next iteration. Hence the two arrows from meta-learner.
The dashed line indicates that the gradient of loss function of learner parameters is used in the meta-learner output equation.

Lines 1—>5

Random initialization of meta-learner parameters ( Initialize the cell state of the metalearner with the parameters of the Learner Network)
Iterate for the data sets in the meta-train
Randomly pick D_train and D_test from the meta-train
Initialize the learner parameters

Lines 7—>12

Start training on D_train randomly pick a batch from (D_train)
calculate Loss using the initialized learner parameters
Calculate the learner parameters by leveraging the LSTM cell update.

Lines 14—>16

Calculate the loss on the (D_test) using the updated learner parameters
Update the meta-learner parameters using gradient descent

Methods followed while training

Parameter sharing
Avoiding batch normalization during meta testing

Data Preparation

Class Create episode()

[ Reading the images from each folder separately using ImageDataset Class] In each episode n_shot + n_eval = batch_size images are being read

Class ImageDataset

ImageDataset is called in create episode called for reading images in each folder

Class EpisodicSampler

Episode sampler is used as a batch sampler while loading data BatchSampler takes indices from your Sampler() instance (in this case 3 of them) and returns it as list so those can be used in your MyDataset getitem method

If you have any doubts contact me : [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
data_preparation.py		data_preparation.py
helper.py		helper.py
image-1.png		image-1.png
image-2.png		image-2.png
image.png		image.png
learner_and_meta_learner.py		learner_and_meta_learner.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model-Optimization-for-few-shot-learning

Motivation

This paper solves the problem in two levels using Meta learning

Important points from the Paper

Hence the name meta-learner LSTM.

Lines 1—>5

Lines 7—>12

Lines 14—>16

Methods followed while training

Data Preparation

Class Create episode()

Class ImageDataset

Class EpisodicSampler

About

Releases

Packages

Languages

License

vikramreddy0307/Model-Optimization-for-few-shot-learning

Folders and files

Latest commit

History

Repository files navigation

Model-Optimization-for-few-shot-learning

Motivation

This paper solves the problem in two levels using Meta learning

Important points from the Paper

Hence the name meta-learner LSTM.

Lines 1—>5

Lines 7—>12

Lines 14—>16

Methods followed while training

Data Preparation

Class Create episode()

Class ImageDataset

Class EpisodicSampler

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages