Close Menu
Earth & BeyondEarth & Beyond

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Jimmy Kimmel Suggests Sending ICE Agents in Minneapolis to Iran

    Football gossip: Anderson, Guehi, Abraham, Malen, Gallagher

    My Winter Car is a ‘dangerous, depressing and tiring’ life sim, and the developer doesn’t want you to play it unless you’ve mastered the infamously tricky My Summer Car

    Facebook X (Twitter) Instagram
    Earth & BeyondEarth & Beyond
    YouTube
    Subscribe
    • Home
    • Business
    • Entertainment
    • Gaming
    • Health
    • Lifestyle
    • Sports
    • Technology
    • Trending & Viral News
    Earth & BeyondEarth & Beyond
    Subscribe
    You are at:Home»Technology»OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents
    Technology

    OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents

    Earth & BeyondBy Earth & BeyondJanuary 10, 2026004 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email
    OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents
    Share
    Facebook Twitter LinkedIn Pinterest Email

    OpenAI is asking third-party contractors to upload real assignments and tasks from their current or previous workplaces so that it can use the data to evaluate the performance of its next-generation AI models, according to records from OpenAI and the training data company Handshake AI obtained by WIRED.

    The project appears to be part of OpenAI’s efforts to establish a human baseline for different tasks that can then be compared with AI models. In September, the company launched a new evaluation process to measure the performance of its AI models against human professionals across a variety of industries. OpenAI says this is a key indicator of its progress towards achieving AGI, or an AI system that outperforms humans at most economically valuable tasks.

    “We’ve hired folks across occupations to help collect real-world tasks modeled off those you’ve done in your full-time jobs, so we can measure how well AI models perform on those tasks,” reads one confidential document from OpenAI. “Take existing pieces of long-term or complex work (hours or days+) that you’ve done in your occupation and turn each into a task.”

    OpenAI is asking contractors to describe tasks they’ve done in their current job or in the past and to upload real examples of work they did, according to an OpenAI presentation about the project viewed by WIRED. Each of the examples should be “a concrete output (not a summary of the file, but the actual file), e.g., Word doc, PDF, Powerpoint, Excel, image, repo,” the presentation notes. OpenAI says people can also share fabricated work examples created to demonstrate how they would realistically respond in specific scenarios.

    OpenAI and Handshake AI declined to comment.

    Real-world tasks have two components, according to the OpenAI presentation. There’s the task request (what a person’s manager or colleague told them to do) and the task deliverable (the actual work they produced in response to that request). The company emphasizes multiple times in instructions that the examples contractors share should reflect “real, on-the-job work” that the person has “actually done.”

    One example in the OpenAI presentation outlines a task from a “Senior Lifestyle Manager at a luxury concierge company for ultra-high-net-worth individuals.” The goal is to “prepare a short, 2-page PDF draft of a 7-day yacht trip overview to the Bahamas for a family who will be traveling there for the first time.” It includes additional details regarding the family’s interests and what the itinerary should look like. The “experienced human deliverable” then shows what the contractor in this case would upload: a real Bahamas itinerary created for a client.

    OpenAI instructs the contractors to delete corporate intellectual property and personally identifiable information from the work files they upload. Under a section labeled “Important reminders,” OpenAI tells the workers to “remove or anonymize any: personal information, proprietary or confidential data, material nonpublic information (e.g., internal strategy, unreleased product details).”

    One of the files viewed by WIRED document mentions a ChatGPT tool called “Superstar Scrubbing” that provides advice on how to delete confidential information.

    Evan Brown, an intellectual property lawyer with Neal & McDevitt, tells WIRED that AI labs that receive confidential information from contractors at this scale could be subject to trade secret misappropriation claims. Contractors who offer documents from their previous workplaces to an AI company, even scrubbed, could be at risk of violating their previous employers’ nondisclosure agreements or exposing trade secrets.

    “The AI lab is putting a lot of trust in its contractors to decide what is and isn’t confidential,” says Brown. “If they do let something slip through, are the AI labs really taking the time to determine what is and isn’t a trade secret? It seems to me that the AI lab is putting itself at great risk.”

    agents Contractors Evaluate jobs OpenAI performance Upload work
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleOwn Apple and Nvidia as broad rally expands before earnings
    Next Article Counter-Strike 2 brings Anubis back as Premier Season Four map pool confirmed
    Earth & Beyond
    • Website

    Related Posts

    Jimmy Kimmel Suggests Sending ICE Agents in Minneapolis to Iran

    January 13, 2026

    NASA’s Webb Delivers Unprecedented Look Into Heart of Circinus Galaxy

    January 13, 2026

    New Proposed Legislation Would Let Self-Driving Cars Operate in New York State

    January 13, 2026
    Leave A Reply Cancel Reply

    Latest Post

    If you do 5 things, you’re more indecisive than most—what to do instead

    UK ministers launch investigation into blaze that shut Heathrow

    The SEC Resets Its Crypto Relationship

    How MLB plans to grow Ohtani, Dodger fandom in Japan into billions for league

    Stay In Touch
    • YouTube
    Latest Reviews

    Jimmy Kimmel Suggests Sending ICE Agents in Minneapolis to Iran

    By Earth & BeyondJanuary 13, 2026

    NASA’s Webb Delivers Unprecedented Look Into Heart of Circinus Galaxy

    By Earth & BeyondJanuary 13, 2026

    New Proposed Legislation Would Let Self-Driving Cars Operate in New York State

    By Earth & BeyondJanuary 13, 2026

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Blackpink Share New Song “Jump” Amid Deadline World Tour: Watch the Video

    July 13, 202528 Views

    Bitcoin in the bush – crypto mining brings power to rural areas

    March 25, 202513 Views

    Honor of Kings breaks esports attendance Guinness World Record 

    November 10, 202511 Views
    Our Picks

    Jimmy Kimmel Suggests Sending ICE Agents in Minneapolis to Iran

    Football gossip: Anderson, Guehi, Abraham, Malen, Gallagher

    My Winter Car is a ‘dangerous, depressing and tiring’ life sim, and the developer doesn’t want you to play it unless you’ve mastered the infamously tricky My Summer Car

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Earth & Beyond.
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.

    Newsletter Signup

    Subscribe to our weekly newsletter below and never miss the latest product or an exclusive offer.

    Enter your email address

    Thanks, I’m not interested