Fauxpilot is a self-hosted AI coding assistant and local inference server. It functions as a proxy and API gateway that redirects traffic from IDE plugins to a local large language model, allowing for AI-assisted programming without external cloud dependencies.
The project provides a specialized API emulation layer that mimics coding assistant protocols and a standardized OpenAI-compatible interface. This enables supported code editors to use local models for completions and suggestions by overriding default proxy URLs.
The system includes capabilities for downloading and deploying local models, as well as a format-conversion pipeline to transform model files into optimized versions for specific inference engines. A model-agnostic backend allows for switching between different inference engines while maintaining the same API interfaces.