Probability that a prompt or candidate matches a harm category.
Content has a high chance of being unsafe.
Content has a low chance of being unsafe.
Content has a medium chance of being unsafe.
Content has a negligible chance of being unsafe.
Probability that a prompt or candidate matches a harm category.