1 Repo
Analyzing the binary structure of files to ensure they match their claimed format for security purposes.
Distinct from Attachment Security Validators: Focused on detecting the true format of unknown files for security scanning, which is distinct from web content security policies.
Explore 1 awesome GitHub repository matching security & cryptography · Format Validation. Refine with filters or upvote what's useful.
Magika is an AI content type classifier and MIME type prediction engine that uses deep learning to identify file formats based on binary data. It analyzes byte sequences through a neural network to predict the content type of a file and provide associated confidence scores. The system features a foreign function interface that allows the core detection logic to be integrated across different programming languages. It includes a mechanism for configuring detection sensitivity and per-type thresholds to balance precision and recall. The project provides capabilities for bulk file analysis via
Analyzes unknown files to detect their true format for security scanning and data validation workflows.