# qarmin/czkawka

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/qarmin-czkawka).**

29,337 stars · 969 forks · Rust · other

## Links

- GitHub: https://github.com/qarmin/czkawka
- awesome-repositories: https://awesome-repositories.com/repository/qarmin-czkawka.md

## Topics

`cleaner` `duplicates` `multiplatform` `optimization` `optimizer` `rust` `similar-images` `similar-music` `similar-videos`

## Description

Czkawka is a cross-platform utility designed for storage optimization and filesystem maintenance. It functions as a comprehensive file analysis engine that identifies redundant data, including duplicate files, empty directories, broken symbolic links, and temporary files. By utilizing hash-based content verification, the tool ensures accurate identification of duplicates regardless of file names or metadata.

The project distinguishes itself by offering both a native graphical user interface and a command-line interface, allowing for both interactive management and automated, headless system maintenance. It includes specialized multimedia analysis capabilities that go beyond simple file matching, enabling the detection of visually or audibly similar media, the removal of metadata tags, and the optimization of video files to reclaim disk space.

Beyond basic deduplication, the tool provides extensive filesystem auditing to detect corrupted files and invalid references. It supports advanced workflows through features like incremental scan caching to accelerate repeated operations, automated file selection logic, and the ability to export scan results for external auditing. The core logic is also exposed as a library, enabling integration into custom scripts or external applications.

## Tags

### System Administration & Monitoring

- [Duplicate File Finders](https://awesome-repositories.com/f/system-administration-monitoring/duplicate-file-finders.md) — Identifies and removes redundant files, empty directories, and broken symbolic links to reclaim disk space. ([source](https://github.com/qarmin/czkawka/blob/master/Changelog.md))
- [Disk Cleanup Utilities](https://awesome-repositories.com/f/system-administration-monitoring/disk-cleanup-utilities.md) — Integrates storage scanning and cleanup tasks into automated workflows to maintain disk space without manual intervention. ([source](https://github.com/qarmin/czkawka#readme))
- [Content-Based Video Deduplication](https://awesome-repositories.com/f/system-administration-monitoring/duplicate-file-finders/content-based-video-deduplication.md) — Utilizes multimedia analysis frameworks to identify and remove redundant video files by comparing content. ([source](https://github.com/qarmin/czkawka/tree/master/krokiet))
- [File Integrity Scanners](https://awesome-repositories.com/f/system-administration-monitoring/file-integrity-scanners.md) — Audits storage for corrupted files and invalid references to ensure data health and system consistency. ([source](https://github.com/qarmin/czkawka#readme))
- [Temporary File Cleaners](https://awesome-repositories.com/f/system-administration-monitoring/temporary-file-cleaners.md) — Identifies and removes temporary or cache files based on predefined patterns to free up disk space. ([source](https://github.com/qarmin/czkawka/blob/master/czkawka_cli/README.md))
- [Empty Directory Scanners](https://awesome-repositories.com/f/system-administration-monitoring/empty-directory-scanners.md) — Locates and removes empty directories to clean up filesystem structures and improve organization. ([source](https://github.com/qarmin/czkawka/blob/master/czkawka_cli/README.md))
- [Empty File Scanners](https://awesome-repositories.com/f/system-administration-monitoring/empty-file-scanners.md) — Detects and removes zero-byte files to maintain a clean directory structure. ([source](https://github.com/qarmin/czkawka/blob/master/czkawka_cli/README.md))
- [Filesystem Integrity Auditors](https://awesome-repositories.com/f/system-administration-monitoring/filesystem-integrity-auditors.md) — Identifies corrupted or invalid files by verifying internal headers or performing full decoding to ensure data validity. ([source](https://github.com/qarmin/czkawka/blob/master/Changelog.md))

### Data & Databases

- [Command-Line Storage Managers](https://awesome-repositories.com/f/data-databases/storage-abstraction/command-line-storage-managers.md) — Provides a high-performance file analysis engine with both graphical and command-line interfaces for storage management.
- [Media Deduplication](https://awesome-repositories.com/f/data-databases/data-management/media-deduplication.md) — Identifies and removes redundant media files by analyzing actual content and metadata.
- [Media Similarity Scanners](https://awesome-repositories.com/f/data-databases/similarity-search/media-similarity-scanners.md) — Detects visually comparable images and duplicate audio or video files by analyzing internal content.
- [Local File Storage](https://awesome-repositories.com/f/data-databases/data-engineering-infrastructure/data-persistence-storage/data-storage/file-based-storage/local-file-storage.md) — Scans local directories to detect and remove redundant files, empty folders, and temporary data. ([source](https://github.com/qarmin/czkawka/blob/master/Changelog.md))
- [Incremental Scan Caches](https://awesome-repositories.com/f/data-databases/scan-caching-systems/incremental-scan-caches.md) — Stores previous file analysis results in local memory to skip unchanged data and accelerate performance during subsequent operations. ([source](https://github.com/qarmin/czkawka#readme))

### Development Tools & Productivity

- [Command Line Interfaces](https://awesome-repositories.com/f/development-tools-productivity/command-line-interfaces.md) — Executes file analysis tasks through a command-line interface with progress tracking to support automated or headless environments. ([source](https://github.com/qarmin/czkawka/blob/master/AGENTS.md))
- [Symlink Integrity Checkers](https://awesome-repositories.com/f/development-tools-productivity/symbolic-link-managers/symlink-integrity-checkers.md) — Identifies and reports broken symbolic links to maintain filesystem health and remove invalid references. ([source](https://github.com/qarmin/czkawka/blob/master/czkawka_cli/README.md))
- [Terminal Maintenance Suites](https://awesome-repositories.com/f/development-tools-productivity/terminal-maintenance-suites.md) — Executes file cleanup operations through command-line arguments to enable scripted maintenance of disk space. ([source](https://github.com/qarmin/czkawka/tree/master/czkawka_cli))
- [Large File Scanners](https://awesome-repositories.com/f/development-tools-productivity/local-file-syncing/large-file-scanners.md) — Pinpoints the largest files on disk to help users identify and remove data consuming significant storage. ([source](https://github.com/qarmin/czkawka/blob/master/czkawka_cli/README.md))

### Operating Systems & Systems Programming

- [Filesystem Utilities](https://awesome-repositories.com/f/operating-systems-systems-programming/system-administration-maintenance/file-system-management/file-system-utilities/filesystem-utilities.md) — Audits file integrity, identifies corrupted data, and manages temporary files across local storage.
- [Automated File Analysis](https://awesome-repositories.com/f/operating-systems-systems-programming/system-administration-maintenance/file-system-management/file-system-operations/automated-file-analysis.md) — Integrates file analysis and cleanup tasks into automated scripts and headless workflows.
- [File Organization Tools](https://awesome-repositories.com/f/operating-systems-systems-programming/system-administration-maintenance/file-system-management/file-organization-tools.md) — Optimizes file organization by stripping metadata and converting media to more efficient formats. ([source](https://github.com/qarmin/czkawka#readme))

### Content Management & Publishing

- [Media Similarity Analyzers](https://awesome-repositories.com/f/content-management-publishing/media-management/library-cleaning/media-similarity-analyzers.md) — Analyzes images, videos, and music to identify visually or audibly similar content for deduplication. ([source](https://github.com/qarmin/czkawka#readme))
- [Metadata Stripping Tools](https://awesome-repositories.com/f/content-management-publishing/metadata-stripping-tools.md) — Strips specific EXIF tags from image files to protect user privacy or reduce overall file size. ([source](https://github.com/qarmin/czkawka/blob/master/Changelog.md))
- [Metadata Tagging](https://awesome-repositories.com/f/content-management-publishing/metadata-tagging.md) — Compares media files using internal metadata tags to identify duplicates across storage. ([source](https://github.com/qarmin/czkawka/blob/master/czkawka_cli/README.md))

### Software Engineering & Architecture

- [Deduplication Algorithms](https://awesome-repositories.com/f/software-engineering-architecture/hash-tables/deduplication-algorithms.md) — Identifies redundant files by generating cryptographic signatures of file contents to ensure accurate detection regardless of file names.
- [Headless Logic Decoupling](https://awesome-repositories.com/f/software-engineering-architecture/headless-logic-decoupling.md) — Separates the computational engine from user interfaces to allow both automated command-line execution and interactive graphical management.
- [Library Interfaces](https://awesome-repositories.com/f/software-engineering-architecture/library-interfaces.md) — Exposes core file analysis capabilities as a reusable library to allow external applications to integrate storage management features.

### Graphics & Multimedia

- [Media Optimization Utilities](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/codec-encoding-utilities/media-codec-libraries/media-optimization-utilities.md) — Re-encodes video files using efficient codecs and crops static bars to reduce total disk usage. ([source](https://github.com/qarmin/czkawka/blob/master/Changelog.md))
- [Multimedia Analysis Integrations](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/command-line-toolkits/multimedia-format-converters/multimedia-analysis-integrations.md) — Interfaces with external libraries to perform deep content inspection and similarity analysis on complex audio and video formats.
