Skip to content

Add a Python project that scans a directory and detects duplicate files using hashing (MD5 or SHA256). #8

@paltannistha

Description

@paltannistha

Use Case

Many users have duplicate files in their system which consume unnecessary storage.
This tool will help identify files with identical content even if the filenames are different.

Proposed Solution

Create a Python script that:

  • Scans a user-specified folder
  • Calculates the hash (MD5/SHA256) of each file
  • Compares hashes to detect duplicates
  • Prints a list of duplicate files

Example Output

Duplicate files found:
file1.txt == file_copy.txt
image.png == image_backup.png

Additional Information

This project will demonstrate:

  • File handling in Python
  • Hashing using hashlib
  • Dictionary-based comparison for efficient duplicate detection

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions