Replication Repository for the paper When Does Label Smoothing Help? by Müller et al.

This repository contains the code and report for a replication of the paper

Müller, Rafael, Simon Kornblith, and Geoffrey E. Hinton. "When does label smoothing
help?." Advances in neural information processing systems 32 (2019).

1. Installation

This implementation uses Jupyter notebooks to directly visualize the results while keeping the logs of the training process directly accessible. Furthermore, it uses PyTorch to implement the different Neural Networks.
Step One: Clone this Repo to the location of your liking.
Step Two: The required packages can be installed as follows (using Python 3.8-3.11):

pip install -r requirements.txt

After conducting these steps, you should be able to run the Jupyter notebooks, provided you have an editor that can open them. Possible Editors are Visual Studio Code with the appropriate extensions or JupyterLab.

While most Datasets are downloaded automatically using PyTorch, there are two datasets that have to be downloaded manually and unpacked in the correct folder.

Dataset	Download-Link	Data-Folder
CUB-200-2011	CUB_200_2011.tgz	`data/CUB_200_2011`
Tiny ImageNet	tiny-imagenet-200.zip	`data/tiny-imagenet-200`

Note that the files should be placed in a way that for CUB-200-2011, the "Data-Folder" should include the classes.txt file, and for Tiny ImageNet the "Data-Folder" should include the words.txt file.

If you intend to use the pre-trained models, this Link provides the models trained by us to verify the results of the paper. You just need to download the models folder and place it directly in the project root folder.

2. Folder Structure

The project consists of the following folder structure:

root: This folder provides the notebooks to the corresponding experiments detailed in the replication paper.
datasets/: This folder provides the wrapper classes for the used datasets with custom DataLoaders for some of them.
architectures/: This folder provides the custom implementations of the different Neural Networks.
util/: This folder provides Python files for utility modules, as well as training methods for the different architectures.
figures/: This folder provides the figures used in our replication paper and some more.
models/: In this folder, you should put the pre-trained models or your own trained models should appear here.
report/: This folder contains the LaTeX source code of the replication report.

3. Using the Code

The available notebooks correspond to the following architectures, datasets, and experiments in the replication paper:

File Name	Architecture	Dataset	Knowledge Distillation (Section 6)
`FC_MNIST_Accuracy_IMC_Toy.ipynb`	Fully-Connected	MNIST	(Toy Example)
`FC_MNIST_KD.ipynb`	Fully-Connected	MNIST
`FC_EMNIST_Accuracy_IMC.ipynb`	Fully-Connected	EMNIST
`FC_FMNIST_Accuracy_IMC.ipynb`	Fully-Connected	FMNIST
`AlexNet_CIFAR10_Accuracy_IMC.ipynb`	AlexNet	CIFAR-10
`ResNet34_CUB-200-2011_Accuracy_IMC.ipynb`	ResNet-34	CUB-200-2011
`ResNet50_TinyImageNet_Accuracy_IMC.ipynb`	ResNet-50	TinyImageNet
`ResNet56_CIFAR10_Accuracy_IMC.ipynb`	ResNet-56	CIFAR-10
`ResNet56_CIFAR100_Accuracy_IMC.ipynb`	ResNet-56	CIFAR-100
`ResNet56_AlexNet_CIFAR10_KD.ipynb`	ResNet-56/AlexNet	CIFAR-10
`Transformer_Multi30K_IMC.ipynb`	Transformer	Multi30k

Furthermore, the PenultimateLayerRepresentation.ipynb notebook combines experiments for the Penultimate Layer Representation (Section 4).
Inside the notebooks, you can execute the cells in which order you want. While a full training cycle might take several hours (or days), using the pre-trained models, only the cells corresponding to the evaluation can be executed. As such, the models can be used to directly validate our findings. Some of the evaluations might take several minutes.

Don't be afraid to try out our code and do your own experiments with it :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Replication Repository for the paper When Does Label Smoothing Help? by Müller et al.

1. Installation

2. Folder Structure

3. Using the Code

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
architectures		architectures
artifact		artifact
data		data
datasets		datasets
figures		figures
models		models
report		report
util		util
.gitignore		.gitignore
AlexNet_CIFAR10_Accuracy_IMC.ipynb		AlexNet_CIFAR10_Accuracy_IMC.ipynb
FC_EMNIST_Accuracy_IMC.ipynb		FC_EMNIST_Accuracy_IMC.ipynb
FC_FMNIST_Accuracy_IMC.ipynb		FC_FMNIST_Accuracy_IMC.ipynb
FC_MNIST_Accuracy_IMC_Toy.ipynb		FC_MNIST_Accuracy_IMC_Toy.ipynb
FC_MNIST_KD.ipynb		FC_MNIST_KD.ipynb
LICENSE		LICENSE
PenultimateLayerRepresentation.ipynb		PenultimateLayerRepresentation.ipynb
README.md		README.md
ResNet34_CUB-200-2011_Accuracy_IMC.ipynb		ResNet34_CUB-200-2011_Accuracy_IMC.ipynb
ResNet50_TinyImageNet_Accuracy_IMC.ipynb		ResNet50_TinyImageNet_Accuracy_IMC.ipynb
ResNet56_AlexNet_CIFAR10_KD.ipynb		ResNet56_AlexNet_CIFAR10_KD.ipynb
ResNet56_CIFAR100_Accuracy_IMC.ipynb		ResNet56_CIFAR100_Accuracy_IMC.ipynb
ResNet56_CIFAR10_Accuracy_IMC.ipynb		ResNet56_CIFAR10_Accuracy_IMC.ipynb
Transformer_Multi30K_IMC.ipynb		Transformer_Multi30K_IMC.ipynb
requirements.txt		requirements.txt

License

sdwagner/re-labelsmoothing

Folders and files

Latest commit

History

Repository files navigation

Replication Repository for the paper When Does Label Smoothing Help? by Müller et al.

1. Installation

2. Folder Structure

3. Using the Code

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages