# nielsrogge/transformers-tutorials

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/nielsrogge-transformers-tutorials).**

11,641 stars · 1,721 forks · Jupyter Notebook · MIT

## Links

- GitHub: https://github.com/NielsRogge/Transformers-Tutorials
- awesome-repositories: https://awesome-repositories.com/repository/nielsrogge-transformers-tutorials.md

## Topics

`bert` `gpt-2` `layoutlm` `pytorch` `transformers` `vision-transformer`

## Description

This is a collection of tutorials and practical demonstrations for implementing machine learning tasks using the HuggingFace Transformers library. It serves as a guide for applying transformer architectures across computer vision, natural language processing, and audio analysis.

The repository provides implementation examples for multimodal model deployment, including the combination of text, image, and audio inputs. It includes resources for optimizing pre-trained models through fine-tuning on custom datasets and provides examples for preparing PyTorch datasets by converting raw files into tensors and batches.

The covered capabilities span various machine learning domains, including object detection, image segmentation, and depth estimation in computer vision, as well as audio signal classification and text categorization. It also covers the generation of visual content and the extraction of information from document images.

## Tags

### Part of an Awesome List

- [Transformer Implementations](https://awesome-repositories.com/f/awesome-lists/ai/transformer-implementations.md) — Serves as a comprehensive guide for implementing and deploying transformer architectures across various ML domains. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Document Parsing and Extraction](https://awesome-repositories.com/f/awesome-lists/data/document-parsing-and-extraction.md) — Demonstrates how to parse document images to classify layouts and extract structured information. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Audio Event Classification](https://awesome-repositories.com/f/awesome-lists/media/audio-and-sounds/audio-event-classification.md) — Implements models to categorize audio files and signals into specific labels using transformer architectures.

### Education & Learning Resources

- [Transformer Tutorials](https://awesome-repositories.com/f/education-learning-resources/technical-domain-education/ai-machine-learning-education/neural-network-architectures/recurrent-neural-network-tutorials/transformer-tutorials.md) — Offers a comprehensive collection of tutorials and notebooks for implementing transformer-based architectures across various modalities.
- [Dataset Preparation Tutorials](https://awesome-repositories.com/f/education-learning-resources/dataset-preparation-tutorials.md) — Includes guides for converting raw files into tensors and batches for PyTorch machine learning pipelines.

### Artificial Intelligence & ML

- [Computer Vision](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision.md) — Provides a wide range of computer vision implementations including object detection, segmentation, and depth estimation.
- [Implementation Guides](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/architectures/multimodal-perception-models/multimodal-vision-models/implementation-guides.md) — Provides implementation examples for deploying models that combine text, image, and audio inputs.
- [Model Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/fine-tuning-and-customization/model-fine-tuning.md) — Provides practical demonstrations for adapting pre-trained transformer models to custom datasets using training loops. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Model Fine-tuning Resources](https://awesome-repositories.com/f/artificial-intelligence-ml/model-fine-tuning-resources.md) — Provides code samples and educational materials for optimizing pre-trained models using training loops and accelerators.
- [Multimodal Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/multimodal-processing.md) — Implements techniques for combining text, image, and audio data into shared latent spaces for cross-modal analysis.
- [Natural Language Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing.md) — Implements natural language processing tasks such as text classification, document information extraction, and image captioning.
- [Pre-trained Model Application](https://awesome-repositories.com/f/artificial-intelligence-ml/pre-trained-model-application.md) — Demonstrates how to use weights from pre-trained transformer networks to perform predictions on new data.
- [Supervised Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/supervised-fine-tuning.md) — Includes resources for optimizing pre-trained models through supervised fine-tuning on custom labeled datasets.
- [Object Detection](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/object-detection-tracking/object-detection.md) — Provides implementation examples for identifying and locating specific objects within images using bounding boxes. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Depth Estimation](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/object-pose-estimations/monocular-depth-estimators/multi-view-depth-estimators/depth-estimation.md) — Implements algorithms to calculate the distance of objects from the camera using single-image depth estimation. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Image Segmentation](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/image-segmentation.md) — Provides guides for isolating specific objects or semantic regions within images using segmentation techniques. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Synthetic Media Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/synthetic-content-generators/synthetic-media-generators.md) — Includes examples for generating synthetic images from text prompts and creating descriptive captions for media. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Hardware Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/hardware-acceleration.md) — Provides examples for utilizing GPUs to accelerate the training and inference of transformer models.
- [Data Preparation Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/data-ingestion-preparation/data-preparation-tools.md) — Offers utilities and examples for cleaning and transforming raw data into formats suitable for transformer model ingestion.
- [Tensor Conversion Utilities](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/machine-learning-datasets/tensor-conversion-utilities.md) — Provides examples for preparing PyTorch datasets by converting raw files into tensors and batches. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Text Classifiers](https://awesome-repositories.com/f/artificial-intelligence-ml/text-classifiers.md) — Demonstrates the categorization of text sequences and audio signals into predefined labels. ([source](https://github.com/nielsrogge/transformers-tutorials#readme))
- [Zero-Shot Inference](https://awesome-repositories.com/f/artificial-intelligence-ml/zero-shot-inference.md) — Covers methods for performing classification or segmentation on new categories without specific training data.

### Data & Databases

- [Dataset Tensor Mappings](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-transformation/array-tensor-manipulation/tensor-transformations/dataset-tensor-mappings.md) — Provides examples for converting raw input files into tensors and batches for efficient model processing.
