Dot is a deep learning face swap tool used to replace faces in live video streams, recorded media, and static images. It functions as a deepfake media processor and real-time video manipulator that applies facial transformations through neural network mapping.
The system includes a virtual camera video injector that routes processed output into a system-level virtual device to simulate a physical hardware webcam. This allows generated video to be used within third-party video conferencing software.
The tool supports real-time source switching via keyboard inputs to toggle between different source images during active sessions. It utilizes a unified media pipeline to handle both live camera streams and pre-recorded files, processing frames in a continuous loop to minimize latency.