mrpijey wrote:This topic was created from this topic as it merits its own discussion and not part of the errata topic.
Thanks!
It's a good idea, but are the trailing zeroes the ISOs original? Ie, was it added by Microsoft in some of their ISOs or did some disc dumpers just add them? The reason I ask is because if they are original (by Microsoft) they should be kept as-is, and hashed as-is. If they are not original then all discovered ISOs with trailing zeroes should be trimmed and repacked.
I had a look at .isos I've downloaded directly from Microsoft and they end in zeros. So I wouldn't say that the hash of the file with the zeros removed is the "correct hash" of the "actual image" or anything, it's just a tool for helping to detect images which aren't actually identical but are effectively identical.
Certainly if you gave me two .iso files, this technique would tell me that they're both effectively identical, but it wouldn't tell me which one was the original one from Microsoft. I don't know that there's any way to figure that out.
So I definitely wouldn't suggest that the "hash of the file with trailing zeros removed" be the only hash you record, just an extra one that is sometimes useful.
Thanks for the interesting information, claunia! I can certainly see that the size of the .iso image from MS I'm looking at is a multiple of 2KiB.
I wonder if various tools would complain if you were to trim all the zeros off the end of an .iso file? I certainly wouldn't try trimming the zeros off myself, it doesn't seem like a useful thing to do, particularly if it's only going to save me up to 2KB per file