|
|||||||||||||||||||||||||
|
Last updated
June 29th 2008, Version 0.999
|
|||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Introduction |
Most harddisks contain quite a lot of completely identical files,
which consume a lot of disk space. This waste of space can be drastically
reduced by using the NTFS filesystem hardlinks functionality, if identical
files aka dupes are hardlinked together. Dupemerge searches for identical files on a logical drive and creates hardlink among those file thus saving lots of harddisk space. |
||||||||||||||||||||||||
| Installation | Dupemerge.exe is a command line utility, so copy dupemerge.exe to some directory referenced by your PATH environment variable. %systemroot% is a good place. e.g.: c:\winnt | ||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Using dupemerge | dupemerge.exe can be controlled by a few command line arguments, and its highlights are as follows: | ||||||||||||||||||||||||
|
|||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Backgrounders |
Dupemerge creates a cryptological hashsum for each file found below
the given pathes
and compares those hashes to each other find the dupes. There is no file date comparison
involved in detecting dupes, which might cause troubles. To speed up comparison only files with same size get compared to each other. Furthermore the hashsums for equal sized files get calculated incrementally, which means, that during the first pass only the first 4 kilobyte are hashed and compared and during the next rounds more and more data are hashed and compared. Due to long runtime on large disks some files, which have already been hashsumed, might change before all dupes to that file are found. To prevent false hardlink creation due to intermediate changes, dupemerge saves the file write time of a file when it hashsums the file and checks back if this time changed when it tries to hardlink dupes. If dupemerge is run once, hardlinks among indentical files are created. To save time during a second run on the same locations, dupemerge checks if a file is already a hardlink, and tries to find the other hardlinks by comparing the unique NTFS file-id. This saves a lot of time, because especially checksums for large files need not to to be created twice. Dupemerge has a dupe find algorithm, which is extremly tuned to especially perform well on large server disks, where it has been tested in depth to guarantee data integrity. |
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Limitations |
|
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Frequently Asked questions |
Hello, this may seem a basic question, but how do I know how much space dupemerge has saved by using hard links?
If I have two identical directories A & B and run dupemerge.exe C:\A C:\B, I’d imagine that the resulting size of the two directories would be halved. However windows explorer still thinks the size on disk of the two directories combined is double rather than half. Can you not see the saved space via explorer? A: You can't see the saved space via explorer, because explorer simply adds the size of files found below a given location, and because hardlinks are very transparent, explorer does not know, that a summed up file is a hardlink, so it thinks it is a file. To see the saved space open a command prompt and type 'dir', run dupemerge, and once again run 'dir'. Or via Explorer: Open the drive and take a look at the drive properties, before and after running dupemerge. |
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| History |
|
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Status | The 0.998 version
is stable enough to satisfy most needs. A bugfixing release
is scheduled for February 2008, which should contain a fix for the junction
problem. |
||||||||||||||||||||||||
| Acknowledgements | I wish to thank those who have contributed significantly to the development of dupemerge. | ||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Open Issues |
|
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Disclaimer |
This program is provided
as is. |
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Contact / Donations |
Bug reports, or feature requests send to
Hermann Schinagl.. Dupemerge.exe is and will be freeware, but if Dupemerge.exe was really helpful for you and saved lots of your time please think of donations either via PayPal or by sending me a gift certificate from |
||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||
| Download |
| ||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||