Tag: FrontierScience benchmark
OpenAI Aims to Build AI Able to Perform Research Tasks: Deep Research and FrontierScience Breakthroughs
OpenAI aims to build AI able to perform research tasks through Deep Research agent and FrontierScience benchmark—tackling PhD-level science with 26.6% accuracy on expert exams. ... Read More
