Publications
Publications in reversed chronological order. Please check my Google Scholar for an up-to-date list.
2025
- How language models learn facts? Dynamics, curricula and hallucinationsarXiv preprint arXiv:2503.21676, 2025
2024
2023
2022
- Beyond backpropagation: bilevel optimization through implicit differentiation and equilibrium propagationNeural Computation, 2022
2021
- Learning where to learn: Gradient sparsity in meta and continual learningIn Advances in Neural Information Processing Systems, 2021