Tech2 hrs ago

AI Training on Global South Works Sparks Copyright Debate

Foreign AI firms train models on unpaid cultural works from Africa, Asia and Latin America, igniting a global debate over copyright, fair use and creator compensation.

Alex Mercer/3 min/NG

Senior Tech Correspondent

TweetLinkedIn
Cars on multiple flyovers

Cars on multiple flyovers

Source: AzureOriginal source

AI companies are using unpaid cultural works from the Global South to train models, relying on fair‑use claims that creators say violate copyright.

Generative AI systems depend on massive datasets that include books, articles, music and images created by writers, artists, journalists, and academics. Many of these works originate in countries across Africa, Asia and Latin America, where creators often lack the legal resources to challenge foreign corporations. The data is harvested through web crawlers and uploaded to training pipelines without explicit permission or compensation.

AI systems built by foreign companies are using cultural works from the Global South without adequate compensation or legal input from those creators. Modern AI models rely on vast amounts of human-created content from writers, artists, journalists, and academics for their training data. AI companies argue that using large amounts of copyrighted material is necessary for innovation and are relying on fair use and text‑data‑mining exceptions to justify it.

This practice creates a structural imbalance where the value of local creativity flows to overseas tech firms while originators receive little return. Critics warn that unchecked extraction can distort markets, undercut local cultural industries and erode the social compact that copyright represents. Supporters of the AI firms say that broad access to data fuels breakthroughs in health, education and climate modeling, and that overly strict rules could choke innovation. The debate hinges on whether current legal exceptions can accommodate the scale of today’s models or whether new licensing frameworks are needed.

Policymakers in Nigeria, Kenya and India are drafting proposals that would require AI developers to disclose training‑data sources and pay royalties for copyrighted material; upcoming legislative sessions will test whether such measures can balance innovation with creator rights.

TweetLinkedIn

More in this thread

Reader notes

Loading comments...