Share

The unseen human workforce behind AI, including ChatGPT

accreditation
0:00
play article
Subscribers can listen to this article
Many AI models are dependent on a large human workforce to be able to operate.
Many AI models are dependent on a large human workforce to be able to operate.
Pexels
  • AI often relies on large human workforces to be able to operate.
  • Workers often perform tasks which can include moderating harmful and explicit content, and can be psychologically damaging to them.
  • For more stories, visit the Tech and Trends homepage.

We are missing a big part of the AI story and that is the fact that it often relies on a large human workforce. 

This is what Sonam Jindal, the programme lead for AI, labour, and the economy at the Partnership on AI, a US-based artificial intelligence research institute, told NBC News earlier this month.

She said that a lot of the discussion around AI is very congratulatory, but a Partnership on AI web post said that the "data enrichment work" required to make machine learning tools function optimally was "often overlooked, [and] performed by faraway workers under precarious conditions".

Humans provide feedback to AI tools which is used to provide better and more accurate information for users of the service.

Humans perform tasks such as labeling what they see in an image or providing feedback on whether certain content is explicit or unsafe, which the AI can take on board and use to provide safer and more accurate responses in the future.

Without some human input, it is very difficult for AI tools to know whether the responses it is providing are accurate or whether the material is potentially harmful.

So big technology companies often reach out to firms that have a large army of workers available to train AI models. 

Training ChatGPT

One of the organisations involved in data enrichment work is Sama, a US-based machine learning training institution that hires workers from Kenya, Uganda and India to label data for clients like Google, Meta and Microsoft.

An investigation by Time that was published earlier this year found that OpenAI, the owners of ChatGPT, also contracted Sama to train ChatGPT, and that workers received between $1.32 and $2 per hour for the work. 

OpenAI does not disclose the names of its data enrichment partners so it is not known if it worked with other data enrichment firms to train ChatGPT.

OpenAI contracted Sama in 2021 to label textual descriptions of sexual abuse, hate speech, and violence. Time spoke to four Sama employees, who wished to remain anonymous, all of whom reported that they were "mentally scarred" by the work.

One Sama worker who was labeling text for OpenAI said that they had to read a graphic description that included descriptions of beastiality and a minor. They described it as being "torture" and said that it was common to read a number of statements like that through the week.

Mental health sessions were made available to the employees by Sama, but the employees that Time spoke to said they were "unhelpful" and "rare".

A Sama spokesperson told the publication that "professionally-trained and licenced mental health therapists" were available any time for group or individual sessions for the employees. 

Sama canceled its contract with ChatGPT in February 2022, eight months earlier than the contract stipulated, owing to the traumatic nature of the work.

An OpenAI spokesperson said in a statement to Time that OpenAI's mission was to "ensure artificial general intelligence benefits all of humanity".

READ MORE |  OpenAI chief goes before US Congress to propose licenses for building AI

The statement said that the company worked hard to ensure that AI systems were useful and safe and that bias and harmful content was limited.

"Classifying and filtering harmful [text and images] is a necessary step in minimising the amount of violent and sexual content included in training data and creating tools that can detect harmful content."

Time said that the labour, which is often sourced from the global south, required for AI to function safely could often be "damaging and exploitative". 

"These invisible workers remain on the margins even as their work contributes to billion-dollar industries."

Amazon Mechanical Turk

Another tool that can be used to train AI is Amazon’s Mechanical Turk.

The crowdsourcing marketplace allows businesses to capture data from an international workforce, which Amazon says is 500 000 people strong.

Companies post what are known as "human intelligence tasks" that freelance workers, known as turkers, compete to complete. These tasks include labeling photographs, completing surveys, or moderating content.

Some tasks only pay $0.01 or about R0.19.

A study cited by the New York Times found that the median "turkers" hourly wage was $1.77. Only 4% of turkers earned above minimum wage.

Mechanical Turk can be used for tasks that contain adult content, but that content must be labeled as such and cannot contain illegal material.

Amazon said that crowdsourcing workers to complete microtasks was better than the traditional ways of completing work like this, which required hiring a temporary workforce.

"While technology continues to improve, there are still many things that human beings can do much more effectively than computers, such as moderating content, performing data deduplication, or research," said the company.



We live in a world where facts and fiction get blurred
Who we choose to trust can have a profound impact on our lives. Join thousands of devoted South Africans who look to News24 to bring them news they can trust every day. As we celebrate 25 years, become a News24 subscriber as we strive to keep you informed, inspired and empowered.
Join News24 today
heading
description
username
Show Comments ()
Voting Booth
Should the Proteas pick Faf du Plessis for the T20 World Cup in West Indies and the United States in June?
Please select an option Oops! Something went wrong, please try again later.
Results
Yes! Faf still has a lot to give ...
68% - 2306 votes
No! It's time to move on ...
32% - 1103 votes
Vote
Rand - Dollar
18.51
+0.3%
Rand - Pound
23.23
-0.0%
Rand - Euro
19.94
-0.1%
Rand - Aus dollar
12.22
+0.1%
Rand - Yen
0.12
+0.1%
Platinum
966.10
-0.0%
Palladium
950.00
-0.1%
Gold
0.00
0.0%
Silver
0.00
0.0%
Brent Crude
82.96
-0.9%
Top 40
70,300
+0.5%
All Share
76,428
+0.5%
Resource 10
60,246
-0.2%
Industrial 25
107,200
+1.3%
Financial 15
16,554
-0.2%
All JSE data delayed by at least 15 minutes Iress logo
Editorial feedback and complaints

Contact the public editor with feedback for our journalists, complaints, queries or suggestions about articles on News24.

LEARN MORE