1sec.ai

Tag

#safety-benchmarks

Every item tagged safety-benchmarks, newest first.

1 item

modelsOct 27

Addendum to GPT-5 System Card: Sensitive conversations

OpenAI published an addendum to the GPT-5 system card focusing on sensitive conversations, detailing improvements in emotional reliance, mental health, and jailbreak resistance. The update provides new benchmarks and metrics for evaluating GPT-5's performance in these areas. You can use these benchmarks to assess GPT-5's capabilities and limitations in handling sensitive topics. This update reflects OpenAI's ongoing efforts to enhance GPT-5's safety and reliability.

Key takeaways
  • GPT-5 shows improved emotional reliance and mental health handling.
  • New benchmarks for jailbreak resistance added.
  • OpenAI provides metrics for evaluating GPT-5's sensitive conversation capabilities.