by Srravya Chandhiramowuli, PhD Researcher, University of Edinburgh, UK
4 min read
Data workers in the AI industry are expected to spend no more than 6 seconds on drawing boxes around objects—an essential task for training computer vision models used widely in self-driving cars, X-ray analyses and facial recognition systems. A senior data annotation manager in India, that I interviewed during my fieldwork in outsourcing data annotation centres, referred to this 6 second target as the ‘industry standard’ for this type of task. Such stringent targets form the basis of how data work, the indispensable work of compiling, cleaning and annotating datasets with crucial information for training AI systems, is described, discussed and decided upon in the AI industry today. And this approach adversely impacts data workers and the AI systems they help build.
In the wake of burgeoning interest in AI, the global data work industry is rapidly growing. Millions of workers in regions of low income, high inflation, growing unemployment, as well as in rural areas, migrant communities, prisons, and refugee camps produce datasets that are foundational to AI systems. Yet, a model-centric outlook towards AI innovation devalues data work as nothing more than a set of repetitive, mundane tasks. It is this characterisation of data work as mundane that is evident in the targets (like the one mentioned above) and other reductive metrics that make up data work.
One of the most widely used metrics in data work is the average handling time (AHT). It indicates the time required to complete a given task. For each type of task, the AHT varies. It determines deliverable timelines and deadlines. The AHT value is agreed upon by the client (AI company providing the dataset) and vendor (the annotation centre annotating the dataset) at the start of the project.
And this value is deeply contested - AI companies negotiate for a lower AHT, to push for faster turnarounds; annotation vendors, on the other hand, seek to push back unrealistic expectations without jeopardising the client relationship. In the global supply chain of AI, counting aids the assertion of the client’s power and authority over annotation processes. Underlying these AHT values and negotiations is a presumption of total countability - the notion that everything, from tasks, datasets and deliverables, to workers, work time, quality and performance, can be managed by applying the logic of counting.
This spawns more counting such as computing and tracking hourly targets, accuracy rates and productivity scores. In annotation centres, these metrics are closely monitored by managers, placing stressful demands on workers to meet the target count every hour. Instead of valuing people’s judgement and discretion and allowing machines to learn from people, counting regimes drive human workers to behave more like machines. Their work 1 and expertise is devalued, resulting in difficult working conditions, meagre wages, limited autonomy and scope for skill development.
Efforts to promote safety and responsibility in AI would be incomplete without acknowledging and responding to the ground realities of data production for AI. Empowering worker representation and autonomy, and enforcing better regulatory scrutiny and accountability in the AI supply chain are crucial to envisioning more just futures of data work and AI.
Further reading:
US lawmakers demand answers from AI companies on use of underpaid, overworked data workers
The Exploited Labor Behind Artificial Intelligence
Fairwork Cloudwork Ratings 2023: Work in the Planetary Labour Market
Srravya Chandhiramowuli is a PhD researcher in Design Informatics at the University of Edinburgh. Her PhD research examines the work of data annotation for AI, paying particular attention to systemic challenges and frictions, to envision and inform just, equitable futures of AI.
Technology, Employment and Wellbeing is a new FES blog that offers original insights on the ways new technologies impact the world of work. The blog focuses on bringing different views from tech practitioners, academic researchers, trade union representatives and policy makers.
Friedrich-Ebert-Stiftung Future of Work
Cours Saint Michel 30e 1040 Brussels Belgium
+32 2 329 30 32
futureofwork(at)fes.de
Team
This site uses third-party website tracking technologies to provide and continually improve our services, and to display advertisements according to users' interests. I agree and may revoke or change my consent at any time with effect for the future.
These technologies are required to activate the core functionality of the website.
This is an self hosted web analytics platform.
Data Purposes
This list represents the purposes of the data collection and processing.
Technologies Used
Data Collected
This list represents all (personal) data that is collected by or through the use of this service.
Legal Basis
In the following the required legal basis for the processing of data is listed.
Retention Period
The retention period is the time span the collected data is saved for the processing purposes. The data needs to be deleted as soon as it is no longer needed for the stated processing purposes.
The data will be deleted as soon as they are no longer needed for the processing purposes.
These technologies enable us to analyse the use of the website in order to measure and improve performance.
This is a video player service.
Processing Company
Google Ireland Limited
Google Building Gordon House, 4 Barrow St, Dublin, D04 E5W5, Ireland
Location of Processing
European Union
Data Recipients
Data Protection Officer of Processing Company
Below you can find the email address of the data protection officer of the processing company.
https://support.google.com/policies/contact/general_privacy_form
Transfer to Third Countries
This service may forward the collected data to a different country. Please note that this service might transfer the data to a country without the required data protection standards. If the data is transferred to the USA, there is a risk that your data can be processed by US authorities, for control and surveillance measures, possibly without legal remedies. Below you can find a list of countries to which the data is being transferred. For more information regarding safeguards please refer to the website provider’s privacy policy or contact the website provider directly.
Worldwide
Click here to read the privacy policy of the data processor
https://policies.google.com/privacy?hl=en
Click here to opt out from this processor across all domains
https://safety.google/privacy/privacy-controls/
Click here to read the cookie policy of the data processor
https://policies.google.com/technologies/cookies?hl=en
Storage Information
Below you can see the longest potential duration for storage on a device, as set when using the cookie method of storage and if there are any other methods used.
This service uses different means of storing information on a user’s device as listed below.
This cookie stores your preferences and other information, in particular preferred language, how many search results you wish to be shown on your page, and whether or not you wish to have Google’s SafeSearch filter turned on.
This cookie measures your bandwidth to determine whether you get the new player interface or the old.
This cookie increments the views counter on the YouTube video.
This is set on pages with embedded YouTube video.
This is a service for displaying video content.
Vimeo LLC
555 West 18th Street, New York, New York 10011, United States of America
United States of America
Privacy(at)vimeo.com
https://vimeo.com/privacy
https://vimeo.com/cookie_policy
This cookie is used in conjunction with a video player. If the visitor is interrupted while viewing video content, the cookie remembers where to start the video when the visitor reloads the video.
An indicator of if the visitor has ever logged in.
Registers a unique ID that is used by Vimeo.
Saves the user's preferences when playing embedded videos from Vimeo.
Set after a user's first upload.
This is an integrated map service.
Gordon House, 4 Barrow St, Dublin 4, Ireland
https://support.google.com/policies/troubleshooter/7575787?hl=en
United States of America,Singapore,Taiwan,Chile
http://www.google.com/intl/de/policies/privacy/