Close Menu
Earth & BeyondEarth & Beyond

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Tyler, the Creator Releases New Album ‘Don’t Tap the Glass’

    Petitti letter calls for no new penalties in Michigan sign-stealing scandal

    All Of Superman’s Best, Coolest, Most Memorable Moments

    Facebook X (Twitter) Instagram
    Earth & BeyondEarth & Beyond
    YouTube
    Subscribe
    • Home
    • Business
    • Entertainment
    • Gaming
    • Health
    • Lifestyle
    • Sports
    • Technology
    • Trending & Viral News
    Earth & BeyondEarth & Beyond
    Subscribe
    You are at:Home»Technology»OpenAI’s o3 tops new AI league table for answering scientific questions
    Technology

    OpenAI’s o3 tops new AI league table for answering scientific questions

    Earth & BeyondBy Earth & BeyondJuly 10, 2025003 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email
    OpenAI’s o3 tops new AI league table for answering scientific questions
    Share
    Facebook Twitter LinkedIn Pinterest Email

    A smartphone screen with a colourful background displays app icons for DeepSeek, ChatGPT and Google Gemini.

    The AI tools Gemini and DeepSeek came behind o3 in a league table of responses to scientific questions.Credit: Andrey Rudakov/Bloomberg via Getty

    o3, an artificial intelligence (AI) model developed by the creators of ChatGPT, has been ranked the best AI tool for answering science questions in multiple fields, according to a benchmarking platform launched last week.

    SciArena, developed by the Allen Institute for Artificial Intelligence (Ai2) in Seattle, Washington, ranked 23 large language models (LLMs) according to their answers to scientific questions. The quality of the answers was voted on by 102 researchers. o3, created by OpenAI in San Francisco, California, was ranked the best at answering questions on natural sciences, health care, engineering, and humanities and social science, after more than 13,000 votes.

    DeepSeek-R1, built by DeepSeek in Hangzhou, China, came second on natural-sciences questions and fourth on engineering. Google’s Gemini-2.5-Pro ranked third in natural sciences and fifth in engineering and health care.

    Users’ preference for o3 might stem from the model’s tendency to give a lot of detail on the literature it cites and to produce technically nuanced responses, says Arman Cohan, a research scientist at Ai2. But explaining why models’ performance varies is challenging because most are proprietary. Differences in training data and what the model has been optimized for, among other things, could partially explain it, he says.

    SciArena is the latest platform developed to evaluate how AI models perform on certain tasks — and one of the first to rank performance on scientific tasks using crowdsourced feedback. “SciArena is a positive effort that motivates a careful evaluation of LLM-assisted literature tasks,” says Rahul Shome, a robotics and AI researcher at the Australian National University in Canberra.

    Randomly selected

    To rank the 23 LLMs, SciArena asked researchers to submit scientific questions. They received answers from two randomly selected models, which supported their responses with references drawn from Semantic Scholar, an AI research tool also created by Ai2. Users then voted on whether one model provided the best answer, the two models were comparable or both performed badly.

    The platform is now publicly available and lets users ask research questions for free. All users get answers from two models and can vote on their performance, but only votes from verified users who consent to the terms are included in the leaderboard, which the company says will be updated frequently.

    The ability to question LLMs on science topics and have confidence in the answers will help researchers to keep up with the latest literature in their field, says AI researcher Jonathan Kummerfeld at the University of Sydney in Australia. “This will help researchers find work they may have otherwise missed.”

    answering league OpenAIs Questions scientific table tops
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleBank of Korea keeps rates steady as it assesses measures to cool Seoul’s housing market
    Next Article Australia news live: universities, cultural bodies and broadcasters to be targeted in antisemitism envoy plan | Australia news
    Earth & Beyond
    • Website

    Related Posts

    Former Tesla president discloses the secret to scaling a company

    July 21, 2025

    How to design an actually good flash flood alert system

    July 21, 2025

    How your research can survive a US federal grant termination

    July 21, 2025
    Leave A Reply Cancel Reply

    Latest Post

    If you do 5 things, you’re more indecisive than most—what to do instead

    UK ministers launch investigation into blaze that shut Heathrow

    The SEC Resets Its Crypto Relationship

    How MLB plans to grow Ohtani, Dodger fandom in Japan into billions for league

    Stay In Touch
    • YouTube
    Latest Reviews

    Former Tesla president discloses the secret to scaling a company

    By Earth & BeyondJuly 21, 2025

    How to design an actually good flash flood alert system

    By Earth & BeyondJuly 21, 2025

    How your research can survive a US federal grant termination

    By Earth & BeyondJuly 21, 2025

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Bitcoin in the bush – crypto mining brings power to rural areas

    March 25, 202513 Views

    Israeli Police Question Palestinian Director Hamdan Ballal After West Bank Incident

    March 25, 20258 Views

    How to print D&D’s new gold dragon at home

    March 25, 20257 Views
    Our Picks

    Tyler, the Creator Releases New Album ‘Don’t Tap the Glass’

    Petitti letter calls for no new penalties in Michigan sign-stealing scandal

    All Of Superman’s Best, Coolest, Most Memorable Moments

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2025 Earth & Beyond.
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.

    Newsletter Signup

    Subscribe to our weekly newsletter below and never miss the latest product or an exclusive offer.

    Enter your email address

    Thanks, I’m not interested