AI re-writing historical image data

 Like many people, we have a collection of old photos of various ancestors.  Typical of the time, they were probably taken on box-brownie cameras or something equivalent that produced slightly blurry photos that are typical of candid photos of the time.

My brother has been using an AI image tool to clean them up and colourise the black and white ones.

Generally, the tool has done a remarkable job at cleaning up some of the images.  It's also done a great job of colourising some of them, so they look a lot more interesting.

However, there are two important issues from a data management perspective that point to AI changing or making up history that may not be correct.

Firstly, one of the photos shows a young version of one of my uncles at the seaside for a day-outing around 1934.  The original black and white image shows that he has two bottles of what I know was home-made lemonade in those old glass drink bottles that were re-sealable with a marble and a lever-action clip.  The AI has replaced those bottles with Coke bottles, featuring the classic Coca-Cola script and bottle caps.



Anyone looking at the AI-enhanced photo in the photo will assume that my uncle had 2 bottles of Coke which is not true.  It's likely that Coke wasn't even commonly available in England in the 1930s.  In this way, AI can inadvertently and subtly change historical data.

A second issue is with the colourisation.   I note that the AI tool has regularly colourised people's clothing with dull blues, reds, and browns, and there's rarely anyone dressed in anything bright unless it assumed a bold black and white pattered fabric.   There's probably a reason for this - maybe feedback has suggested colourising images using colours that aren't strong and so don't appear obviously wrong.   In some cases, the AI was spot-on with its colour selections - for example Boy Scout unforms are the correct colour - maybe because the AI can match it to historical colour images.  However, in one case a camel-coloured coat has ended up as a mid-green colour as a result of colourisation.  

Once again, the colourisation could be considered re-writing history.  People might look back on old photos and infer that people at the turn of the century only wore muted colours when in reality they liked bright colours as much as people today.

From a data management perspective, this is like changing a NULL value to a not-NULL.  It's the equivalent of seeing a person record with no sex recorded and deciding that because their first name is recorded as Jo, they must be a female.   The data was clear that we didn't know their sex, but some process has come along and invented the missing data.  Maybe to hedge its bets, if sex is unknown it might decide to make them non-binary!

The implications for data management are pretty worrying as organisations enthusiastically embrace AI.  Who knows what "facts" are going to the invented by an AI engine without anyone knowing.



Comments

Popular posts from this blog

Do we have a right to anonymous cash transactions?

The cyclic nature of data management