publications

2024

  1. gmg_theory.png
    Mitigating Goal Misgeneralization via Minimax Regret
    𝐊𝐚𝐫𝐒𝐦 π€π›ππžπ₯ π’πšππžπ€ ,Β  Matthew Farrugia-Roberts ,Β  Hannah Erlebach , and 4 more authors
    Conference Paper under Review, 2024
  2. cachingmts.png
    Algorithms for Caching and MTS with reduced number of predictions
    𝐊𝐚𝐫𝐒𝐦 π€π›ππžπ₯ π’πšππžπ€ ,Β  andΒ  Marek Elias
    International Conference on Learning Representations (ICLR), 2024
  3. pruning.png
    Dynamic Vocabulary Pruning in Early-Exit LLMs
    Jort Vincenti ,Β  𝐊𝐚𝐫𝐒𝐦 π€π›ππžπ₯ π’πšππžπ€ ,Β  Joan Velja , and 2 more authors
    ENSLP Workshop, NeurIPS, 2024
  4. rl.png
    ’Explaining RL decisions with trajectories’: A reproducibility Study
    𝐊𝐚𝐫𝐒𝐦 π€π›ππžπ₯ π’πšππžπ€ ,Β  Matteo Nulli ,Β  Joan Velja , and 1 more author
    Transactions on Machine Learning Research (TMLR), 2024