- Subset Selection
- Best Subset Selection
- Stepwise Selection
- Ridge Regression
- Lagrange Multiplier
- Bayesian Interpretation (Maximum a Posteriori Estimation)
- LASSO Regression
- Bayesian Interpretation (Maximum a Posteriori Estimation)
- Shrinking parameters towards zero
- Principal Component Analysis (PCA)
- Eigenvalues Decomposition
- Variance Maximization
- K-dimension Representation
- Curse of dimension
- From Fourier Series to Fourier Transform
- Inner Product
- Fourier Series
- Fourier Transform
- Convolution Theorem
- Spectral Graph Theory: Graph Laplacian
- Graph Fourier Transform and Graph Convolution
- Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering (NIPS 2016)
- Semi-Supervised Classification with Graph Convolutional Networks (ICLR2017)
- Modeling Relational Data with Graph Convolutional Networks (ESWC 2018)
- Transformer
- Self/Global/Soft/Local/Hard Attention
- Locality-sensitive hashing