LLM Activation Engineering: An Easy Foray
This is a recap of an old project from May 2024. Credit to Neel Nanda for llm lens and Mihaiiii for llm steer python modules. I've been playing around with steering LLM outputs by manipulating their
Mar 31, 20265 min read202