Muriz Serifovic
Hi! 👋, I'm a Machine Learning Engineer at
Paymira, where I train neural networks, bestowing low-rank updates into something useful. Besides work, my research interests include sparse autoencoders for model interpretability, and the mathematical foundations of learning theory (both in vivo and in silico). Previously ETH Zürich, Hesiod, ascarix, and Zühlke.
A question I keep gravitating around, is, why neural networks, fed by an insatiable appetite for compute, settle on some representations rather than others, which features emerge first, and which "solution" might be getting picked from the many that fit the data equally well, although Quine had this complaint long before anyone built a transformer:
Physical theories can be at odds with each other and yet compatible with all possible data even in the broadest sense.
research work.
from the blog archive.
Hi! 👋, I'm a Machine Learning Engineer at Paymira, where I train neural networks, bestowing low-rank updates into something useful. Besides work, my research interests include sparse autoencoders for model interpretability, and the mathematical foundations of learning theory (both in vivo and in silico). Previously ETH Zürich, Hesiod, ascarix, and Zühlke.
A question I keep gravitating around, is, why neural networks, fed by an insatiable appetite for compute, settle on some representations rather than others, which features emerge first, and which "solution" might be getting picked from the many that fit the data equally well, although Quine had this complaint long before anyone built a transformer:
Physical theories can be at odds with each other and yet compatible with all possible data even in the broadest sense.
Recent Projects: Hesiod.
My ha-index is well above 80 (which is considered excellent).