Module content_hash

Module content_hash 

Source
Expand description

Byte-level duplicate detection: SHA-256 over file contents.

Groups paths by stored size first (from SQLite), then hashes only size buckets with more than one path.

Structs§

ContentDupGroup
ContentDupPath
ContentDupScanResult

Constants§

READ_CHUNK 🔒

Functions§

find_byte_duplicate_groups
entries: (path, size_bytes, kind) for the whole library.
hash_file_sha256
Hex-encoded SHA-256 of file bytes, or None if unreadable.