Co-tracker is a PyTorch point tracking framework and dense point tracking model designed to map the motion of individual pixels throughout a video. It functions as a video pixel tracker that predicts point trajectories and visibility masks across sequences of video frames.
The project includes a computer vision training pipeline that utilizes teacher-student knowledge distillation. This allows for the generation of pseudo-labels from unannotated real video data to fine-tune pre-trained models and reduce the gap between synthetic and real data environments.
The framework provides capabilities for video motion analysis and visual tracking evaluation, including tools for rendering trajectories and visibility masks to inspect tracking accuracy. It supports both offline context and online streaming processing for video sequence analysis.