Research
Charles O'Neill
Compute Optimal Inference and Provable Amortisation Gap in Sparse AutoencodersCharles O'Neill
Sparse Autoencoders for Disentangling Dense Embeddings of Scientific ConceptsCharles O'Neill
Sparse autoencoders for dense text embeddings reveal hierarchical feature sub-structureCharles O'Neill
Steering semantic search with interpretable features from sparse autoencodersCharles O'Neill
Disentangling Dense Embeddings with Sparse AutoencodersCharles O'Neill and Thang Bui
Sparse Autoencoders Enable Scalable and Reliable Circuit Identification in Language ModelsCharles O'Neill, Yuan-Sen Ting, Ioana Ciuca, Roberta Raileanu, Jack Miller, and Thang Bui
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data GenerationTuan Dung Nguyen, Yuan-Sen Ting, Ioana Ciuca, Charles O'Neill, and others
AstroLLaMA: Towards specialised foundation models in astronomyJack Miller, Charles O'Neill, and Thang Bui
Grokking Beyond Neural Networks: An Empirical Exploration with Model ComplexityErnest Perkowski, Rui Pan, Tuan Dung Nguyen, Yuan-Sen Ting, Sandor Kruk, Tong Zhang, Charles O'Neill, and others
AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse DatasetsJack Miller, Patrick Gleeson, Charles O'Neill, Thang Bui, and Noam Levi
Measuring Sharpness in Grokking