Fdupes
fdupes -r jellyfin /mnt/terrahdd1/jellyfin
ChatGPT says: "If you need a faster alternative to fdupes
for identifying duplicate files, here are some tools and methods that are optimized for speed:"
rmlint
rmlint
is a high-performance tool specifically designed to find duplicate files (and other "lint" like empty files) quickly.
sudo apt install rmlint
rmlint /path/to/directory1 /path/to/directory2
jdupes
jdupes
is a fork of fdupes
with significant performance improvements.
sudo apt install jdupes
jdupes -r /path/to/directory1 /path/to/directory2
jdupes
is generally 2–10 times faster than fdupes
, especially for large datasets.
dupeGuru
dupeGuru
is a GUI and CLI-based duplicate finder optimized for speed and flexibility.
sudo apt install dupeguru
dupeguru -d /path/to/directory1 /path/to/directory2
hashsum
ScriptFor absolute control and minimal overhead, you can write a custom script to hash files and compare results.
find /path/to/directory1 /path/to/directory2 -type f -exec sha256sum {} + | sort | uniq -D -w 64
sha256sum
: Calculates hashes for all files.sort
: Sorts the hash results for efficient comparison.uniq -D -w 64
: Shows only duplicate hashes (first 64 characters, i.e., the hash).This method avoids scanning metadata and directly works on content. Use md5sum
for slightly faster hashing at the cost of weaker checksum robustness.
czkawka
(Rust-based, blazingly fast)czkawka
is a Rust-based tool designed for fast file deduplication and cleaning.
Download the binary from the Czkawka GitHub releases page.
czkawka dup /path/to/directory1 /path/to/directory2
Tool | Speed | Features |
---|---|---|
rmlint |
Fastest | Advanced deduplication and cleanup |
jdupes |
Very Fast | CLI-only, simple and effective |
dupeGuru |
Fast | GUI + CLI, flexible matching |
czkawka |
Blazingly Fast | Rust-based, modern |
fdupes |
Moderate | Simple, widely available |
rmlint
or jdupes
.czkawka
.