Research shows AI agents fail most remote tasks, with top performer automating just 2.5% of freelance work.
The study, called the Remote Labor Index (RLI), represents one of the most detailed attempts so far to measure AI’s performance on practical digital work.
Researchers collected 240 completed projects from professional freelancers working through platforms such as Upwork.
Six advanced AI agents were then tested on the same projects.
AI agents fail most remote tasks, with top performer automating just 2.5% of freelance work.
Author’s summary: AI fails most remote tasks with top performer automating just 2.5%.