Skip to main content
The Daily Wollongong

Wollongong news, every day

News

How Wollongong's Digital Archives Fell Into the Duplicate Image Problem — and What It Cost to Fix It

Years of fragmented record-keeping across council, university and industrial bodies left the Illawarra region's digital photo libraries riddled with duplicates, wasted storage and lost provenance — here's how that happened.

By Wollongong News Desk · Published 5 July 2026, 4:48 am · Updated

3 min read

Wollongong City Council's digital asset library holds tens of thousands of images — aerial shots of Port Kembla's coke ovens, planning renders for the Crown Street Mall precinct, candid photographs from Viva! Energia and Multicultural Wollongong festivals stretching back to the early 2000s. A significant portion of those files, however, are exact or near-exact copies of each other, stored under different filenames, in different folders, by different staff who had no reliable way to know a version already existed.

The duplication problem didn't emerge overnight. It accumulated over roughly two decades of digital growth without a unified content management policy, and it matters now because several large public-interest projects — including the University of Wollongong's digitisation of its Illawarra Historical Society collection and BlueScope Steel's internal documentation of the green steel transition at Port Kembla — are forcing organisations to confront the inherited mess before they add more material to it.

How the Illawarra Got Here

The roots run back to the early 2000s, when government bodies and institutions began digitising at pace without coordinating with each other. Wollongong City Council, the Illawarra Shoalhaven Joint Organisation and the then-Illawarra Regional Development Board each built separate repositories. Staff leaving one agency and joining another routinely brought USB drives or emailed files, seeding copies across networks. Image metadata — the embedded date, author and rights information that makes a library searchable — was frequently stripped when files were re-saved or converted from TIFF to JPEG to PNG across different software versions.

The University of Wollongong's Library in Northfields Avenue has been grappling with this since at least 2019, when it began a formal audit of its Special Collections holdings. Digitised photographs of the Illawarra escarpment, the historic Bulli Pass and early-twentieth-century North Wollongong beachfront scenes appeared in some cases four or five times in separate catalogue entries, each attributed differently or not at all. The audit found that meaningful cataloguing had to restart essentially from scratch for several sub-collections.

BlueScope's situation is distinct but parallel. As the company documents its transition away from coal-based steelmaking at Port Kembla — a process tied to federal and state green hydrogen investment commitments — its internal communications teams have found legacy image sets from the 1990s and 2000s with no clear rights chain. Images shot by contracted photographers, reused in annual reports, then uploaded to intranet systems by different departments, now exist in versions that cannot be reliably traced to a single original file.

The Cost of Duplication

Storage is the most legible cost. Cloud storage pricing across Australian government-tier services typically sits between $0.02 and $0.05 per gigabyte per month, and while that sounds trivial per image, a library running 800,000 duplicate files at an average of 8 megabytes each represents roughly 6.4 terabytes of redundant data — an ongoing annual cost that compounds with each year a cleanup is deferred.

The less visible cost is editorial. When Destination Wollongong produces tourism marketing material, or when the Illawarra Mercury archives department tries to source a historical image of, say, the old BHP blast furnaces on Springhill Road, a fractured image library means staff spend hours searching rather than minutes. Rights clearances for images with lost provenance can take weeks and sometimes require legal advice.

The industry response has been the gradual adoption of digital asset management platforms — software that uses perceptual hashing and metadata matching to flag duplicate or near-duplicate files before they are ingested. Several councils in the Greater Sydney region moved to centralised DAM systems between 2022 and 2025. Wollongong City Council's own ICT strategy, which covers the period through to 2027, identifies digital asset governance as a priority area, though implementation timelines for specific tools have not been publicly confirmed.

For organisations in the Illawarra starting or mid-way through digitisation projects right now, the practical advice from library and records management professionals is consistent: establish a single ingestion point, assign persistent unique identifiers to every file at the moment of creation, and audit existing holdings in batches rather than waiting for a crisis to force a full-library review. The alternative — another decade of duplicates — is a problem that only gets more expensive to unwind.

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Wollongong

This article was produced by the The Daily Wollongong editorial desk and covers news in Wollongong. See our editorial standards for how we use AI.

The Daily Wollongong brief

The day's Wollongong news in a 2-minute read, every weekday morning. Free.

Join 2,847 locals getting The Daily Wollongong every morning in Wollongong.

By subscribing you agree to receive emails from The Daily Wollongong and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Wollongong news every morning.

Free, in your inbox before 7am. Weekdays.

Join 2,847 locals getting The Daily Wollongong every morning in Wollongong.

By subscribing you agree to receive emails from The Daily Wollongong and accept our Privacy Policy. Unsubscribe anytime.

Stay in the loop

Enjoyed this story? Get tomorrow's briefing free.