Czkawka is a cross-platform utility designed for storage optimization and filesystem maintenance. It functions as a comprehensive file analysis engine that identifies redundant data, including duplicate files, empty directories, broken symbolic links, and temporary files. By utilizing hash-based content verification, the tool ensures accurate identification of duplicates regardless of file names or metadata.
The project distinguishes itself by offering both a native graphical user interface and a command-line interface, allowing for both interactive management and automated, headless system maintenance. It includes specialized multimedia analysis capabilities that go beyond simple file matching, enabling the detection of visually or audibly similar media, the removal of metadata tags, and the optimization of video files to reclaim disk space.
Beyond basic deduplication, the tool provides extensive filesystem auditing to detect corrupted files and invalid references. It supports advanced workflows through features like incremental scan caching to accelerate repeated operations, automated file selection logic, and the ability to export scan results for external auditing. The core logic is also exposed as a library, enabling integration into custom scripts or external applications.