Associate professor at the Institute of Computer Science, Polish Academy of Sciences.
I studied theoretical physics, work in computer science, prove theorems in mathematics, and research linguistics.
My interests revolve around:
- information theory: grammar-based codes, prediction by partial matching, Kolmogorov complexity
- stochastic processes: sigma-fields, ergodic decomposition, Santa Fe processes, algorithmic randomness
- quantitative linguistics: Zipf's law, Hilberg's law, hapax rate, urn model, maximal repetition, decay of correlations
- large language models: neural scaling law, overparametrization, embeddings, hallucinations, memory
- computational mechanics: universal prediction, epsilon-machine, causal states, excess entropy
I wrote one monograph and two textbooks:
- Information Theory Meets Power Laws: Stochastic Processes and Language Models
- A Short Course in Universal Coding
- Information Theory and Statistics
Here are my CV and publications, slides, teaching materials.
💬 My email address is [email protected]. Ping me if you want to collaborate.
My prospective PhD student should read this.
Profiles: IPI PAN, ORCID, arXiv, GitHub, DBLP, Google Scholar, ResearchGate, LinkedIn, Twitter.
🎉 A few lighter items to catch your attention:
- I have a strictly larger Erdős number than my first PhD student.
- Jak się wzbogacić prawie na pewno? (How to get rich almost surely?)
- Charty zostały... czyli o generowaniu wierszy sylabicznych. (On automatic generation of rhymed poems.)
- The Chaos by Gerard Nolst Trenité, transcribed into IPA symbols by me.
- The Making of My Mother's Book: From a Family Database to Family Trees.
- The Making of My Mother's Book: Named Entity Recognition for the Index of Persons.
- Wycieczki 1992-2022. (Trips 1992-2022.)