Use Case
Many users have duplicate files in their system which consume unnecessary storage.
This tool will help identify files with identical content even if the filenames are different.
Proposed Solution
Create a Python script that:
- Scans a user-specified folder
- Calculates the hash (MD5/SHA256) of each file
- Compares hashes to detect duplicates
- Prints a list of duplicate files
Example Output
Duplicate files found:
file1.txt == file_copy.txt
image.png == image_backup.png
Additional Information
This project will demonstrate:
- File handling in Python
- Hashing using hashlib
- Dictionary-based comparison for efficient duplicate detection