Seldon Core is a Kubernetes-based machine learning model server and MLOps inference framework. It functions as a multi-model serving engine and pipeline orchestrator, packaging models as scalable microservices that are exposed via standardized REST and gRPC APIs. The project distinguishes itself through graph-based inference pipelines that chain models and data transformers into sequential workflows. It optimizes hardware utilization via multi-model shared serving and dynamic memory overcommit strategies, while supporting production experimentation through weighted traffic routing, A/B testin
This project is an educational curriculum and set of technical guides for building production-ready large language model and retrieval augmented generation systems. It provides instructional materials and hands-on lessons focused on model specialization, LLMOps, and the implementation of vector databases. The course covers the development of retrieval augmented generation systems, including tutorials on creating data pipelines that crawl, chunk, and embed content into vector stores. It includes training guides for the deployment, monitoring, and maintenance of language models in production en
Yellowbrick is a machine learning visualization library and model diagnostic tool designed to analyze feature importance, target distributions, and model error metrics. It serves as a visual toolkit for diagnosing underfitting and overfitting through the use of validation and learning curves. The project provides specialized suites for evaluating predictive models and unsupervised learning. It enables the determination of optimal cluster counts via elbow methods and silhouette coefficients, and assesses classifier and regressor quality through ROC curves, confusion matrices, and residual plot
Opik is an observability and evaluation platform designed for generative AI applications and agentic workflows. It provides a centralized environment for tracing execution flows, managing prompt templates, and monitoring production performance, allowing teams to gain visibility into complex model interactions and tool usage without requiring manual application code changes. The platform distinguishes itself through its integrated approach to the AI development lifecycle, combining distributed trace instrumentation with automated evaluation frameworks. It supports model-as-a-judge scoring, syn