Stay up to date with notifications from The Independent

Notifications can be managed in browser preferences.

Facial recognition systems trained on millions of photos of people without their consent

'This is the dirty little secret of AI training sets,' a legal expert warns

Anthony Cuthbertson

Wednesday 13 March 2019 14:36 GMT

Artificial intelligence algorithms have typically struggled to identify women and people with darker skin through facial recognition (Getty/iStock)

Facial recognition algorithms are being trained using photos of people who have not given their consent, legal experts have warned.

Companies like IBM are scraping millions of publicly available images from Flickr and other sites in order to improve the technology, though the people in the photos have no idea this is happening.

Civil rights activists warn that this technology could one day be used to track and spy on the same people whose faces have been used to train it.

"This is the dirty little secret of AI training sets. Researchers often just grab whatever images are available in the wild," NYU School of Law professor Jason Schultz told NBC, who first reported on the issue.

Around 100 million Creative Commons-licensed images are available for artificial intelligence researchers to draw upon to train facial recognition systems through Yahoo's YFCC-100M dataset.

Gadget and tech news: In pictures

Show all 25

1/25Gadget and tech news: In pictures

Gadget and tech news: In pictures

Gun-toting humanoid robot sent into space

Russia has launched a humanoid robot into space on a rocket bound for the International Space Station (ISS). The robot Fedor will spend 10 days aboard the ISS practising skills such as using tools to fix issues onboard. Russia's deputy prime minister Dmitry Rogozin has previously shared videos of Fedor handling and shooting guns at a firing range with deadly accuracy.

Dmitry Rogozin/Twitter

Gadget and tech news: In pictures

Google turns 21

Google celebrates its 21st birthday on September 27. The The search engine was founded in September 1998 by two PhD students, Larry Page and Sergey Brin, in their dormitories at California’s Stanford University. Page and Brin chose the name google as it recalled the mathematic term 'googol', meaning 10 raised to the power of 100

Google

Gadget and tech news: In pictures

Hexa drone lifts off

Chief engineer of LIFT aircraft Balazs Kerulo demonstrates the company's "Hexa" personal drone craft in Lago Vista, Texas on June 3 2019

Reuters

Gadget and tech news: In pictures

Project Scarlett to succeed Xbox One

Microsoft announced Project Scarlett, the successor to the Xbox One, at E3 2019. The company said that the new console will be 4 times as powerful as the Xbox One and is slated for a release date of Christmas 2020

Getty

Gadget and tech news: In pictures

First new iPod in four years

Apple has announced the new iPod Touch, the first new iPod in four years. The device will have the option of adding more storage, up to 256GB

Apple

Gadget and tech news: In pictures

Folding phone may flop

Samsung will cancel orders of its Galaxy Fold phone at the end of May if the phone is not then ready for sale. The $2000 folding phone has been found to break easily with review copies being recalled after backlash

PA

Gadget and tech news: In pictures

Charging mat non-starter

Apple has cancelled its AirPower wireless charging mat, which was slated as a way to charge numerous apple products at once

AFP/Getty

Gadget and tech news: In pictures

"Super league" India shoots down satellite

India has claimed status as part of a "super league" of nations after shooting down a live satellite in a test of new missile technology

EPA

Gadget and tech news: In pictures

5G incoming

5G wireless internet is expected to launch in 2019, with the potential to reach speeds of 50mb/s

Getty

Gadget and tech news: In pictures

Uber halts driverless testing after death

Uber has halted testing of driverless vehicles after a woman was killed by one of their cars in Tempe, Arizona. March 19 2018

Getty

Gadget and tech news: In pictures

A humanoid robot gestures during a demo at a stall in the Indian Machine Tools Expo, IMTEX/Tooltech 2017 held in Bangalore

Getty

Gadget and tech news: In pictures

A humanoid robot gestures during a demo at a stall in the Indian Machine Tools Expo, IMTEX/Tooltech 2017 held in Bangalore

Getty

Gadget and tech news: In pictures

Engineers test a four-metre-tall humanoid manned robot dubbed Method-2 in a lab of the Hankook Mirae Technology in Gunpo, south of Seoul, South Korea

Jung Yeon-Je/AFP/Getty

Gadget and tech news: In pictures

Engineers test a four-metre-tall humanoid manned robot dubbed Method-2 in a lab of the Hankook Mirae Technology in Gunpo, south of Seoul, South Korea

Jung Yeon-Je/AFP/Getty

Gadget and tech news: In pictures

The giant human-like robot bears a striking resemblance to the military robots starring in the movie 'Avatar' and is claimed as a world first by its creators from a South Korean robotic company

Jung Yeon-Je/AFP/Getty

Gadget and tech news: In pictures

Engineers test a four-metre-tall humanoid manned robot dubbed Method-2 in a lab of the Hankook Mirae Technology in Gunpo, south of Seoul, South Korea

Jung Yeon-Je/AFP/Getty

Gadget and tech news: In pictures

Waseda University's saxophonist robot WAS-5, developed by professor Atsuo Takanishi

Rex

Gadget and tech news: In pictures

Waseda University's saxophonist robot WAS-5, developed by professor Atsuo Takanishi and Kaptain Rock playing one string light saber guitar perform jam session

Rex

Gadget and tech news: In pictures

A test line of a new energy suspension railway resembling the giant panda is seen in Chengdu, Sichuan Province, China

Reuters

Gadget and tech news: In pictures

A test line of a new energy suspension railway, resembling a giant panda, is seen in Chengdu, Sichuan Province, China

Reuters

Gadget and tech news: In pictures

A concept car by Trumpchi from GAC Group is shown at the International Automobile Exhibition in Guangzhou, China

Rex

Gadget and tech news: In pictures

A Mirai fuel cell vehicle by Toyota is displayed at the International Automobile Exhibition in Guangzhou, China

Reuters

Gadget and tech news: In pictures

A visitor tries a Nissan VR experience at the International Automobile Exhibition in Guangzhou, China

Reuters

Gadget and tech news: In pictures

A man looks at an exhibit entitled 'Mimus' a giant industrial robot which has been reprogrammed to interact with humans during a photocall at the new Design Museum in South Kensington, London

Getty

Gadget and tech news: In pictures

A new Israeli Da-Vinci unmanned aerial vehicle manufactured by Elbit Systems is displayed during the 4th International conference on Home Land Security and Cyber in the Israeli coastal city of Tel Aviv

Getty

IBM used around one million images from the dataset in its 'Diversity in Faces' research that aimed to improve AI's historical issue with identifying women and people with darker skin.

"We are harnessing the power of science to create AI systems that are more fair and accurate," IBM researcher John Smith wrote in a blog that detailed the research.

"The AI systems learn what they're taught, and if they are not taught with robust and diverse datasets, accuracy and fairness could be at risk. For that reason, IBM, along with AI developers and the research community, need to be thoughtful about what data we use for training."

The researcher claims the publicly available images are the best way of ensuring training data is large enough and diverse enough to reflect the distribution of face types around the world.

People who have since discovered their pictures are in the dataset used by IBM took to Twitter to question the ethics of using such images.

"IBM is using 14 of my photos," said Flickr co-founder Caterina Fake. "IBM says people can opt out, but is making it impossible to do so."

The Independent reached out to IBM for a comment.

Join our commenting forum

Join thought-provoking conversations, follow other Independent readers and see their replies