Wollongong City Council's digital asset library holds tens of thousands of images — aerial shots of Port Kembla's coke ovens, planning renders for the Crown Street Mall precinct, candid photographs from Viva! Energia and Multicultural Wollongong festivals stretching back to the early 2000s. A significant portion of those files, however, are exact or near-exact copies of each other, stored under different filenames, in different folders, by different staff who had no reliable way to know a version already existed.
The duplication problem didn't emerge overnight. It accumulated over roughly two decades of digital growth without a unified content management policy, and it matters now because several large public-interest projects — including the University of Wollongong's digitisation of its Illawarra Historical Society collection and BlueScope Steel's internal documentation of the green steel transition at Port Kembla — are forcing organisations to confront the inherited mess before they add more material to it.
How the Illawarra Got Here
The roots run back to the early 2000s, when government bodies and institutions began digitising at pace without coordinating with each other. Wollongong City Council, the Illawarra Shoalhaven Joint Organisation and the then-Illawarra Regional Development Board each built separate repositories. Staff leaving one agency and joining another routinely brought USB drives or emailed files, seeding copies across networks. Image metadata — the embedded date, author and rights information that makes a library searchable — was frequently stripped when files were re-saved or converted from TIFF to JPEG to PNG across different software versions.
The University of Wollongong's Library in Northfields Avenue has been grappling with this since at least 2019, when it began a formal audit of its Special Collections holdings. Digitised photographs of the Illawarra escarpment, the historic Bulli Pass and early-twentieth-century North Wollongong beachfront scenes appeared in some cases four or five times in separate catalogue entries, each attributed differently or not at all. The audit found that meaningful cataloguing had to restart essentially from scratch for several sub-collections.
BlueScope's situation is distinct but parallel. As the company documents its transition away from coal-based steelmaking at Port Kembla — a process tied to federal and state green hydrogen investment commitments — its internal communications teams have found legacy image sets from the 1990s and 2000s with no clear rights chain. Images shot by contracted photographers, reused in annual reports, then uploaded to intranet systems by different departments, now exist in versions that cannot be reliably traced to a single original file.
The Cost of Duplication
Storage is the most legible cost. Cloud storage pricing across Australian government-tier services typically sits between $0.02 and $0.05 per gigabyte per month, and while that sounds trivial per image, a library running 800,000 duplicate files at an average of 8 megabytes each represents roughly 6.4 terabytes of redundant data — an ongoing annual cost that compounds with each year a cleanup is deferred.
The less visible cost is editorial. When Destination Wollongong produces tourism marketing material, or when the Illawarra Mercury archives department tries to source a historical image of, say, the old BHP blast furnaces on Springhill Road, a fractured image library means staff spend hours searching rather than minutes. Rights clearances for images with lost provenance can take weeks and sometimes require legal advice.
The industry response has been the gradual adoption of digital asset management platforms — software that uses perceptual hashing and metadata matching to flag duplicate or near-duplicate files before they are ingested. Several councils in the Greater Sydney region moved to centralised DAM systems between 2022 and 2025. Wollongong City Council's own ICT strategy, which covers the period through to 2027, identifies digital asset governance as a priority area, though implementation timelines for specific tools have not been publicly confirmed.
For organisations in the Illawarra starting or mid-way through digitisation projects right now, the practical advice from library and records management professionals is consistent: establish a single ingestion point, assign persistent unique identifiers to every file at the moment of creation, and audit existing holdings in batches rather than waiting for a crisis to force a full-library review. The alternative — another decade of duplicates — is a problem that only gets more expensive to unwind.