CapsWriter-Offline is a suite of desktop tools that operates without an internet connection, combining local media browsing, voice dictation, audio and video transcription, and 360-degree media viewing into a single application. The project's core identity centers on providing offline functionality for both media handling and speech-to-text workflows.
What distinguishes it is the integration of voice dictation with a persistent local storage layer that saves every audio recording and daily transcript logs, along with a rule-based text normalization engine that converts spoken number phrases and user-defined substitutions using phonetic matching and regex. Recognized speech can be routed to a language model for polishing or role-specific processing based on predefined names. For media, the tool offers a transactional file operation manager for moving, renaming, and deleting files with undo support, and a panoramic media rendering engine that displays equirectangular 360-degree video and images with draggable viewport and device tilt interactions.
Additional capabilities include thumbnail generation with caching and manual refresh or purge, a customizable grid display for browsing images with adjustable sorting and column count, and EXIF metadata display. For audio and video files, speech can be extracted to produce subtitles, plain text, and timestamps for offline analysis.