-
Explanatory interactive learning Can we, by interacting with models during training, encourage their explanations to line up with our priors on what parts of the input are relevant?
-
Mosaic makes it easier for ML practitioners to interact with high-dimensional, multi-modal data. It provides simple abstractions for data inspection, model evaluation and model training supported by efficient and robust IO under the hood. Mosaic's core contribution is the DataPanel, a simple columnar data abstraction. The Mosaic DataPanel can house columns of arbitrary type – from integers and strings to complex, high-dimensional objects like videos, images, medical volumes and graphs.
- Introducing Mosaic (blog post)
- Working with Images in Mosaic (Google Colab)
- Working with Medical Images in Mosaic (Google Colab)
With the ability to train models without needing labelled data through self-supervision, the focus became on scaling models up and training on more data.
- GPT-3 was the first 170B parameter model capable of few-shot in-context learning developed by OpenAI.
- Moore's Law for Everything is a post about scale and its effect on AI / society.
- Switch Transformers is a mixture of experts for training massive models beyond the scale of GPT-3.
The way experts interact with their data (e.g. a radiologist’s eye movements) contains rich information about the task (e.g. classification difficulty), and the expert (e.g. drowsiness level). With the current trend of wearable technology (e.g. AR with eye tracking capability), the hardware needed to collect such human-data interactions is expected to become more ubiquitous, affordable, and standardized. In observational supervision, we investigate how to extract the rich information embedded in the human-data interaction, to either supervise models from scratch, or to improve model robustness.
Interesting works have collected observational signals such as:
- Eye tracking data in medicine:
- Eye tracking plus brain activity in NLP (Zuco dataset)
Critical papers in observational supervision:
- Some of the pioneering work on using gaze data. N. Hollenstein and C. Zhang showed how to use gaze data to improve NLP models paper.
- Improving zero-shot learning with gaze by N. Karasseli et al. paper
- Weak supervision and multi-task learning with gaze by K. Saab et al. paper
Existing works:
- Oala, L., Aversa, M., Nobis, G., Willis, K., Neuenschwander, Y., Buck, M., Matek, C., Extermann, J., Pomarico, E., Samek, W. and Murray-Smith, R., 2022. Data Models for Dataset Drift Controls in Machine Learning With Images. arXiv preprint arXiv:2211.02578. paper | code | data
- Maier, A., Köstler, H., Heisig, M., Krauss, P. and Yang, S.H., 2022. Known operator learning and hybrid machine learning in medical imaging—a review of the past, the present, and the future. Progress in Biomedical Engineering. paper
- Raissi, M., Perdikaris, P. and Karniadakis, G.E., 2019. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational physics, 378, pp.686-707. paper
- Wright, L.G., Onodera, T., Stein, M.M. et al. Deep physical neural networks trained with backpropagation. Nature 601, 549–555 (2022). paper