An early warning system for novel AI risks
DeepMind Blog
MAY 24, 2023
AI researchers already use a range of evaluation benchmarks to identify unwanted behaviours in AI systems, such as AI systems making misleading statements, biased decisions, or repeating copyrighted content.
Let's personalize your content