LimiX is a tabular foundation model and a suite of tools for structured data, providing a transformer-based system for classification, regression, and data generation. It includes a causal inference engine to determine cause-and-effect relationships, a synthetic data generator, and a framework for filling missing dataset values through feature context prediction.
The project optimizes tabular inference through a high-performance system that uses ensemble-based sample retrieval to increase prediction speed and accuracy on high-specification hardware. It further distinguishes itself by using transformer-based encoding and masked-feature pretraining to learn data distributions.
The system covers a broad range of analytical capabilities, including high-dimensional vector embedding for categorical separation and the creation of synthetic samples via causal-graph data generation. Its predictive surface extends to specific applications such as electricity market price forecasting and the analysis of molecular properties in organic molecules.