3 repository-uri
Techniques for identifying training errors through gradient checking and failure visualization.
Distinct from Machine Learning Systems: Candidates focus on low-level OS debugging or network failures, not ML-specific numerical debugging.
Explore 3 awesome GitHub repositories matching artificial intelligence & ml · Machine Learning Debugging. Refine with filters or upvote what's useful.
This project is a comprehensive Chinese translation of a technical deep learning textbook, providing an educational resource on the theory and implementation of neural networks. It functions as a collaborative technical translation project designed to make complex academic AI literature accessible to non-English speakers. The project utilizes a community-driven translation model that integrates external suggestions and pull requests to refine linguistic accuracy and reduce bias. It employs standardized terminology mapping to ensure a uniform vocabulary throughout the translated content. To i
Details methods for identifying errors by visualizing failures and comparing analytical and numerical gradients.
Lit is a machine learning interpretability framework and model debugging tool designed to analyze model behavior and performance. It serves as an interpretability dashboard for large language models and a general performance analyzer for text, image, and tabular datasets. The project distinguishes itself through a comprehensive suite of interpretability tools, including salience map generation for feature attribution, the creation of synthetic and counterfactual examples to test robustness, and the projection of high-dimensional embeddings into visual spaces via UMAP or PCA. It further enable
Provides interactive visualizations to analyze model predictions and errors and identify failure modes.
This toolkit serves as a framework for interpreting the decision-making processes of graph neural networks. It functions as a library for analyzing how these models process complex network data, providing methods to identify the specific node attributes and structural patterns that influence predictive outcomes. The project distinguishes itself by employing mask-optimized subgraph extraction and gradient-based attribution mapping to isolate the minimal components of a graph that preserve a model's original prediction. By separating graph processing layers from explanation logic, the architect
Investigates the internal logic of complex neural networks to identify biases or errors in graph data processing.