Close Menu
Earth & BeyondEarth & Beyond

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Second Mistrial Motion, ‘Freak-Off’ Audio

    Turkey loss gives Poch, USMNT more questions than answers

    5 Great Games To Kick Off Summer With

    Facebook X (Twitter) Instagram
    Earth & BeyondEarth & Beyond
    YouTube
    Subscribe
    • Home
    • Business
    • Entertainment
    • Gaming
    • Health
    • Lifestyle
    • Sports
    • Technology
    • Trending & Viral News
    Earth & BeyondEarth & Beyond
    Subscribe
    You are at:Home»Technology»OpenAI launches program to design new ‘domain-specific’ AI benchmarks
    Technology

    OpenAI launches program to design new ‘domain-specific’ AI benchmarks

    Earth & BeyondBy Earth & BeyondApril 9, 2025002 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email
    OpenAI launches program to design new ‘domain-specific’ AI benchmarks
    Share
    Facebook Twitter LinkedIn Pinterest Email

    OpenAI, like many AI labs, thinks AI benchmarks are broken. It says it wants to fix them through a new program.

    Called the OpenAI Pioneers Program, the program will focus on creating evaluations for AI models that “set the bar for what good looks like,” as OpenAI phrased it in a blog post.

    “As the pace of AI adoption accelerates across industries, there is a need to understand and improve its impact in the world,” the company continued in its post. “Creating domain-specific evals are one way to better reflect real-world use cases, helping teams assess model performance in practical, high-stakes environments.”

    As the recent controversy with the crowdsourced benchmark LM Arena and Meta’s Maverick model illustrate, it’s tough to know, these days, precisely what differentiates one model from another. Many widely-used AI benchmarks measure performance on esoteric tasks, like solving doctorate-level math problems. Others can be gamed, or don’t align well with most people’s preferences.

    Through the Pioneers Program, OpenAI hopes to create benchmarks for specific domains like legal, finance, insurance, healthcare, and accounting. The lab says that, in the coming months, it’ll work with “multiple companies” to design tailored benchmarks and eventually share those benchmarks publicly, along with “industry-specific” evaluations.

    “The first cohort will focus on startups who will help lay the foundations of the OpenAI Pioneers Program,” OpenAI wrote in the blog post. “We’re selecting a handful of startups for this initial cohort, each working on high-value, applied use cases where AI can drive real-world impact.”

    Companies in the program will also have the opportunity to work with OpenAI’s team to create model improvements via reinforcement fine tuning, a technique that optimizes models for a narrow set of tasks, OpenAI says.

    The big question is whether the AI community will embrace benchmarks whose creation was funded by OpenAI. OpenAI has supported benchmarking efforts financially before, and designed its own evaluations. But partnering with customers to release AI tests may be seen as an ethical bridge too far.

    benchmarks design domainspecific launches OpenAI program
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticlePort pileup coming as cash-short CEOs reject orders
    Next Article Scientists Map Miles of Wiring in a Speck of Mouse Brain
    Earth & Beyond
    • Website

    Related Posts

    here’s what it will do

    June 8, 2025

    Searching for Ancient Rocks in the ‘Forlandet’ Flats

    June 8, 2025

    Bill Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74

    June 8, 2025
    Leave A Reply Cancel Reply

    Latest Post

    If you do 5 things, you’re more indecisive than most—what to do instead

    UK ministers launch investigation into blaze that shut Heathrow

    The SEC Resets Its Crypto Relationship

    How MLB plans to grow Ohtani, Dodger fandom in Japan into billions for league

    Stay In Touch
    • YouTube
    Latest Reviews

    here’s what it will do

    By Earth & BeyondJune 8, 2025

    Searching for Ancient Rocks in the ‘Forlandet’ Flats

    By Earth & BeyondJune 8, 2025

    Bill Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74

    By Earth & BeyondJune 8, 2025

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Bitcoin in the bush – crypto mining brings power to rural areas

    March 25, 202513 Views

    Israeli Police Question Palestinian Director Hamdan Ballal After West Bank Incident

    March 25, 20258 Views

    How to print D&D’s new gold dragon at home

    March 25, 20257 Views
    Our Picks

    Second Mistrial Motion, ‘Freak-Off’ Audio

    Turkey loss gives Poch, USMNT more questions than answers

    5 Great Games To Kick Off Summer With

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2025 Earth & Beyond.
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.

    Newsletter Signup

    Subscribe to our weekly newsletter below and never miss the latest product or an exclusive offer.

    Enter your email address

    Thanks, I’m not interested