Thesis title: Gradient-based Regularisation Hyperparameter Optimisation for Neural Link Predictors

Candidate number: NXLX9

Supervisors: Yihong Chen, Pasquale Minervini, Pontus Stenetorp

This repository contains all code corresponding to the stated thesis, submitted as part of UCL Machine Learning MSc.

Abstract

See thesis PDF for Abstract citations

Innovations in neural link prediction models have been plentiful of late, with representation learning approaches continuing to achieve state of the art performance on benchmark datasets. Model performance has been shown to be highly dependent on hyperparameter choice, including the method and strength of regularisation. Sparsity in knowledge graph datasets, with long-tailed entity/relation frequency distributions due to infrequent entities and relations, presents a special challenge to mitigate over overfitting through regularisation. Work to date has focused on L2-norm or tensor N3-norm penalty regularisation with regularisation strength controlled by a single scalar hyperparameter. This leaves significant scope for innovation in the regulariser, including consideration of heavily hyperparameterised regularisers, to improve model generalisation performance.

Traditional approaches to hyperparameter optimisation have limited applicability beyond settings with small or medium hyperparameter dimensionality. Therefore, we explored gradient-based hyperparameter optimisation as an alternative more suited to scaling to high dimensions, and novel in the setting of neural link prediction. We empirically evaluated backward mode iterative differentiation for hyper-gradient calculation, first in a toy bilevel optimisation setting, then in a single regularisation hyperparameter neural link prediction setting, and finally in a high-dimensional per-embedding hyperparameter setting. Multiple method failure modes were characterised and resolved, through techniques including enforcement of positive-only regularisation hyperparameter values, gradient clipping, gradient-induced re-initialisation, and gradient accumulation and averaging. Optimisation of per-embedding regularisation hyperparameters, with full-batch training, was associated with improved validation loss and validation mean reciprocal rank compared to a grid search baseline. This represents new evidence that the method may improve neural link prediction performance in combination with fine-grained regularisation hyperparameters. Preliminary results suggest evolutionary strategies provide a promising, more memory-efficient alternative to backpropagation for hyper-gradient estimation.

Key files

Toy experiments

toymeta/kbc-cli-toymeta2_higher_sqrdL2norm.py - used to perform toy experiments for the single hyperparameter noise free setting (RQ1)
toymeta/kbc-cli-toymeta2_higher_sqrdL2norm_multidim_noiseinject.py - used to perform toy experiments for the multidimensional hyperparameter and noisy settings (RQ1)

Nations dataset experiments

kbc-cli-realmeta-grid.py - used to perform grid searches for baseline hyperparameter optimisation
kbc-cli-realmeta-higher.py - used to perform all gradient-based hyperparameter optimisation with backpropagation in the single regularisation hyperparameter setting, except for gradient accumulation experiments (RQ2, RQ3)
kbc-cli-realmeta-higher_gradaccum.py - used to perform gradient accumulation experiments (RQ2)
kbc-cli-realmeta-higher_noexpon.py - used to perform gradient-based hyperparameter optimisation with backpropagation, but without enforcing a positive-only regularisation hyperparameter (i.e. before implementation of working with log(lambda)) (RQ2)
kbc-cli-realmeta-higher_multireg.py - used to perform all gradient-based hyperparameter optimisation with backpropagation in the per-embedding regularisation hyperparameter setting (RQ4)
kbc-cli-realmeta-higher-es.py - used to perform evolutionary strategy experiments (RQ5)

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
csvs		csvs
data		data
experiments		experiments
kbc		kbc
logs/old		logs/old
old		old
snippets		snippets
tests		tests
tools		tools
toymeta		toymeta
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset_exploration.ipynb		dataset_exploration.ipynb
kbc-cli-realmeta_grid.py		kbc-cli-realmeta_grid.py
kbc-cli-realmeta_higher.py		kbc-cli-realmeta_higher.py
kbc-cli-realmeta_higher_es.py		kbc-cli-realmeta_higher_es.py
kbc-cli-realmeta_higher_gradaccum.py		kbc-cli-realmeta_higher_gradaccum.py
kbc-cli-realmeta_higher_multireg.py		kbc-cli-realmeta_higher_multireg.py
kbc-cli-realmeta_higher_noexpon.py		kbc-cli-realmeta_higher_noexpon.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
wercker.yml		wercker.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thesis title: Gradient-based Regularisation Hyperparameter Optimisation for Neural Link Predictors

Abstract

Key files

Toy experiments

Nations dataset experiments

About

Releases

Packages

Contributors 2

Languages

License

ravipatelxyz/kbc-meta

Folders and files

Latest commit

History

Repository files navigation

Thesis title: Gradient-based Regularisation Hyperparameter Optimisation for Neural Link Predictors

Abstract

Key files

Toy experiments

Nations dataset experiments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages