This blog covers my project for AI Safety Fundamentals Alignment Course, showing how steering vectors created with using Sparse Auto-Encoders can affect a models generated output.
Using an SAE as a Steering Vector
Using an SAE as a Steering Vector
Using an SAE as a Steering Vector
This blog covers my project for AI Safety Fundamentals Alignment Course, showing how steering vectors created with using Sparse Auto-Encoders can affect a models generated output.