News
Leading models take chilling tradeoffs in realistic scenarios, new research finds
Models that maximize business performance in realistic role-play scenarios are also more likely to inflict harms.
News
Models that maximize business performance in realistic role-play scenarios are also more likely to inflict harms.
News
Aspirations to adapt the principle of 'defense in depth' from nuclear engineering to AI appear to fall short on key requirements. In a preprint from October 13, two researchers from the Ruhr University Bochum and the University of Bonn in Germany found that while leading AI companies say
News
Results add to doubts about whether corporations can be expected to voluntarily align powerful incoming AI systems when they do not align existing algorithms. In a study from September 17, a group of researchers from the University of Michigan, Stanford University, and the Massachusetts Institute of Technology (MIT) showed that