OpenAI and training data firm Handshake AI are reportedly requesting contractors to submit authentic work samples from their previous and current employment, according to a Wired report. This initiative appears to be part of a broader trend among AI companies focused on acquiring high-quality training data, with the goal of enabling their models to automate more complex, white-collar tasks.
According to the report, OpenAI's instructions to contractors include detailing tasks performed at previous jobs and uploading tangible examples of work completed, such as Word documents, PDFs, PowerPoint presentations, Excel spreadsheets, images, or code repositories. The company reportedly advises contractors to remove proprietary information and personally identifiable data before uploading these files, offering access to a "ChatGPT Superstar Scrubbing tool" for this purpose.
The practice raises concerns regarding intellectual property rights. Evan Brown, an intellectual property lawyer, told Wired that AI labs adopting this approach face significant risks, as it relies heavily on the trustworthiness of contractors to properly scrub sensitive data.
The underlying principle behind this data collection is to improve the ability of AI models to understand and replicate real-world tasks. AI models, particularly large language models like those developed by OpenAI, require vast amounts of data to learn patterns and generate accurate outputs. By training these models on actual work samples, companies aim to enhance their performance in automating tasks that typically require human expertise.
The implications of this trend extend beyond individual companies, potentially impacting the future of work. As AI models become more adept at performing white-collar tasks, there is a possibility of increased automation across various industries. This could lead to shifts in employment patterns and require workers to adapt to new roles that complement AI technologies.
Currently, it remains unclear how widespread this practice is across the AI industry and what safeguards are in place to protect intellectual property and privacy. The legal and ethical considerations surrounding the use of real-world work samples for AI training are likely to be subject to ongoing scrutiny as the technology continues to evolve.
Discussion
Join the conversation
Be the first to comment