![]() | Up a level |
Ghilardi, Davide, Belotti, Federico, Molinari, Marco and Lim, Jaehyuk (2024) Accelerating sparse autoencoder training via layer-wise transfer learning in large language models. In: Belinkov, Yonatan, Kim, Najoung, Jumelet, Jaap, Mohebbi, Hosein, Mueller, Aaron and Chen, Hanjie, (eds.) Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP. Proceedings of BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP) (7). Association for Computational Linguistics (ACL), Miami, FL, 530 - 550. ISBN 9798891761704