Bendigo Health holds more than 340,000 patient image files across its radiology and clinical photography systems, according to figures the organisation presented to a Victorian regional health forum in March 2026. A growing share of those files are duplicates — the same scan or photograph stored two, three, or more times under slightly different file names or patient identifiers. The cost of storing, retrieving and managing that redundancy is no longer trivial.
The problem matters right now because Bendigo Health is mid-way through a capital expansion that includes a new imaging wing at the Lucan Street campus. That construction, part of a project the state government has publicly committed funding to, is forcing IT teams to migrate legacy data into new infrastructure. When you migrate a messy dataset, you inherit every duplicate — and potentially pay to store it on more expensive modern hardware.
A Local Audit Reveals the Scale
City of Greater Bendigo's digital services team ran an internal file audit across shared network drives in late 2025, covering departments from planning to community services. The audit, referenced in a council agenda paper published on the council's website in February 2026, found that approximately 28 percent of stored documents were exact or near-exact duplicates. Across a total storage footprint of roughly 14 terabytes, that translated to nearly four terabytes of redundant data — storage the council was paying to back up every night.
The Bendigo Goldfields Library consortium, which manages collections across the Hargreaves Street branch and two satellite locations, faces a version of the same problem in its digitisation program. The consortium has been scanning historical photographs from the Goldfields region since 2019. Without a deduplication policy locked in from the start, the same glass-plate negatives were sometimes scanned by different volunteers on different days, generating multiple high-resolution TIFFs of identical images. Each uncompressed TIFF can run to 80 megabytes. Multiply that across hundreds of items and the storage bill climbs fast.
La Trobe University's Bendigo campus is dealing with the issue from a different direction. The campus library system links to the broader La Trobe network, which in a 2024 infrastructure review identified duplicate digital assets — particularly lecture recordings and course materials — as one of the top three avoidable costs in its storage budget. Regional campuses like Bendigo absorb a proportional share of those overheads.
What Deduplication Actually Costs — and Saves
Commercial deduplication software for mid-sized institutional datasets typically ranges from $8,000 to $35,000 for a one-time licensing and implementation package, depending on dataset size and integration complexity — figures drawn from publicly available vendor pricing sheets from suppliers including Veritas and Commvault as of mid-2026. Cloud-native solutions offered through providers like AWS can reduce that upfront cost but introduce per-gigabyte retrieval fees that compound over time.
For a regional organisation like Bendigo Health, the calculus is not just financial. Duplicate patient images carry a genuine clinical risk: a radiologist working from an incomplete or duplicated file set could theoretically miss a comparison scan. The Victorian Managed Insurance Authority, which covers public health services in the state, has flagged data integrity as a category of operational risk in its guidance documents, though it has not publicly cited Bendigo Health specifically.
The City of Greater Bendigo's February 2026 agenda paper noted that resolving its duplication problem would require a dedicated project officer for approximately six months, at an estimated cost of $52,000 in staff time. The council had not, as of the paper's publication date, approved that expenditure.
For organisations in Bendigo wrestling with the same issue, the practical starting point is an audit before any migration or new hardware purchase. Mapping what you have — and how many times you have it — before committing to infrastructure costs is the step most institutions skip. The Goldfields Library consortium, for its part, has introduced a mandatory checksum verification step into its scanning workflow as of January 2026, ensuring new duplicates are flagged at the point of creation rather than discovered years later during a budget crisis.