Hello! I’m Jacob Merizian. I work on sandbagging and propensity at the UK AI Security Institute. In the past, I’ve done research in high-performance computing, language model pretraining, interpretability, and hardware enabled governance.

writing

projects