After releasing a flurry of wave-making AI tools like ChatGPT -- the prevailing gold standard for chatbots -- and text-to-image generator DALL.E, OpenAI has finally turned its attention towards ...
Most AI safety research treats alignment as an all-or-nothing property. Our framework shows it's more complex. The same AI can be aligned with humans in one context but misaligned in another. Ideally, ...