# unsplash/datasets

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/unsplash-datasets).**

2,671 stars · 134 forks · Jupyter Notebook

## Links

- GitHub: https://github.com/unsplash/datasets
- Homepage: https://unsplash.com/data
- awesome-repositories: https://awesome-repositories.com/repository/unsplash-datasets.md

## Topics

`data` `dataset` `images` `keywords` `machine-learning` `photos` `research` `search-engine` `semantics` `unsplash`

## Description

This project is an open-source visual dataset and machine learning image library. It provides large-scale collections of high-quality photos and metadata designed for training computer vision models and conducting research into image categorization and retrieval.

The repository specifically offers semantic search datasets that pair images with AI and human-generated keywords to analyze search intent and visual metaphors. It also serves as an image metadata archive, providing structured EXIF data and camera specifications for technical analysis.

The available data covers broad capability areas including image semantic research, visual metadata analysis, and the study of user curation and interaction statistics. These datasets can be loaded into programming environments to analyze image properties, color profiles, and the relationship between search queries and photo downloads.

## Tags

### Artificial Intelligence & ML

- [Machine Learning Datasets](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/machine-learning-datasets.md) — Provides large-scale structured collections of high-quality images and metadata for training and validating computer vision models.
- [General Purpose Image Datasets](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/machine-learning-datasets/image-classification-datasets/general-purpose-image-datasets.md) — Provides large-scale collections of high-quality photos and metadata designed for training computer vision models. ([source](https://unsplash.com/blog/the-unsplash-dataset/))
- [Semantic Content Analysis](https://awesome-repositories.com/f/artificial-intelligence-ml/image-content-analyzers/semantic-content-analysis.md) — Pairs images with AI and human keywords to study user intent and image categorization. ([source](https://unsplash.com/blog/the-unsplash-dataset/))
- [Open-Source Vision Datasets](https://awesome-repositories.com/f/artificial-intelligence-ml/open-source-vision-datasets.md) — Offers a publicly available set of image assets and labels for research into image categorization and retrieval.
- [Datasets](https://awesome-repositories.com/f/artificial-intelligence-ml/semantic-search/datasets.md) — Pairs photos with AI and human keywords to analyze search intent and visual metaphors.
- [Visual Semantic Research](https://awesome-repositories.com/f/artificial-intelligence-ml/visual-semantic-research.md) — Studies how users search for visuals by analyzing pairs of images and keywords to understand search intent.

### Data & Databases

- [Image Search Datasets](https://awesome-repositories.com/f/data-databases/image-search-datasets.md) — A collection of open-source image and search data used for training machine learning models and studying image semantics. ([source](https://github.com/unsplash/datasets/blob/master/how-to/README.md))
- [Static Dataset Distributions](https://awesome-repositories.com/f/data-databases/static-dataset-distributions.md) — Provides large-scale image and metadata collections as downloadable open-source files for machine learning.
- [Semantic Image Mappings](https://awesome-repositories.com/f/data-databases/curated-datasets/semantic-image-mappings.md) — Implements semantic mapping by linking images to conceptual keywords via AI tags and human metadata.
- [Visual Content Searches](https://awesome-repositories.com/f/data-databases/search-indexing-technologies/search-indexing/search-and-indexing/content-search-filters/visual-content-searches.md) — Enables finding images using keywords and community tags while applying content safety filters. ([source](https://unsplash.com/documentation))
- [Visual Search Intent Analysis](https://awesome-repositories.com/f/data-databases/search-indexing-technologies/search-indexing/search-information-retrieval/search-engine-platforms/search-and-analytics-engines/search-query-analyzers/visual-search-intent-analysis.md) — Examines billions of visual queries to identify high-level scene concepts and complex visual metaphors. ([source](https://unsplash.com/data))
- [Visual Semantic Mapping](https://awesome-repositories.com/f/data-databases/search-indexing-technologies/search-indexing/search-information-retrieval/semantic-search-engines/visual-semantic-search/visual-semantic-mapping.md) — Maps photos to keywords using confidence scores derived from AI services and human suggestions. ([source](https://github.com/unsplash/datasets/blob/master/DOCS.md))
- [Image Search Aggregators](https://awesome-repositories.com/f/data-databases/search-result-aggregators/image-search-aggregators.md) — Provides a specialized mechanism for fetching and filtering high-quality photos based on search queries. ([source](https://unsplash.com/developers))

### Part of an Awesome List

- [Machine Learning Datasets](https://awesome-repositories.com/f/awesome-lists/data/machine-learning-datasets.md) — Serves as a comprehensive library of high-quality images and metadata for developing AI applications.
- [Semantic Image Datasets](https://awesome-repositories.com/f/awesome-lists/data/semantic-image-datasets.md) — Provides sets of photos and search data specifically for training models and studying image categorization. ([source](https://cdn.jsdelivr.net/gh/unsplash/datasets@master/README.md))

### Graphics & Multimedia

- [Metadata Archives](https://awesome-repositories.com/f/graphics-multimedia/exif-metadata-handling/metadata-archives.md) — Provides a structured repository of EXIF data, camera specifications, and user interaction statistics for technical analysis.
- [Technical Image Analysis](https://awesome-repositories.com/f/graphics-multimedia/exif-metadata-handling/technical-image-analysis.md) — Processes EXIF data, color profiles, and camera settings to study technical image properties and photography trends.
- [Technical Metadata Analysis](https://awesome-repositories.com/f/graphics-multimedia/exif-metadata-handling/technical-metadata-analysis.md) — Analyzes raw EXIF data to study the impact of camera settings and lens models on photo popularity.
- [Image Metadata Retrieval](https://awesome-repositories.com/f/graphics-multimedia/image-metadata-retrieval.md) — Fetches detailed image metadata including view counts, like totals, and download statistics. ([source](https://unsplash.com/documentation))
- [Blur Hash Placeholders](https://awesome-repositories.com/f/graphics-multimedia/blur-hash-placeholders.md) — Generates compact hash strings to render blurred preview images while high-resolution assets load.
- [Dominant Color Extraction](https://awesome-repositories.com/f/graphics-multimedia/dominant-color-extraction.md) — Identifies dominant colors in images by calculating hex codes, RGB values, and pixel coverage. ([source](https://github.com/unsplash/datasets/blob/master/DOCS.md))
- [EXIF Metadata Handling](https://awesome-repositories.com/f/graphics-multimedia/exif-metadata-handling.md) — Includes structured EXIF data and camera specifications for technical analysis of image properties. ([source](https://github.com/unsplash/datasets/blob/master/DOCS.md))
- [Dimension Resizing](https://awesome-repositories.com/f/graphics-multimedia/image-editing-processing/image-processing/dimension-resizing.md) — Adjusts image dimensions, cropping, and quality in real-time based on request parameters. ([source](https://unsplash.com/developers))
- [Image Metadata Extraction](https://awesome-repositories.com/f/graphics-multimedia/image-metadata-extraction.md) — Enables the retrieval of detailed technical specifications, EXIF data, and geographic markers from images. ([source](https://unsplash.com/blog/the-unsplash-dataset/))
- [Technical Photo Metadata Analysis](https://awesome-repositories.com/f/graphics-multimedia/technical-photo-metadata-analysis.md) — Provides datasets enabling the analysis of how camera brands and focal lengths correlate with photo popularity. ([source](https://unsplash.com/data))
- [URL-Driven Image Transformations](https://awesome-repositories.com/f/graphics-multimedia/url-driven-image-transformations.md) — Modifies image dimensions and quality in real-time by parsing parameters embedded within the request URL.

### Software Engineering & Architecture

- [Visual Popularity Metrics](https://awesome-repositories.com/f/software-engineering-architecture/project-management-governance/repository-maintenance/repository-metadata/popularity-metrics/community-popularity-sorting/visual-popularity-metrics.md) — Determines the quality and appeal of images by analyzing millions of keyword-associated user interactions. ([source](https://unsplash.com/data))

### Web Development

- [Dynamic Image Generation](https://awesome-repositories.com/f/web-development/dynamic-image-generation.md) — Allows embedding a searchable photo library into applications with real-time resizing via URL parameters.
- [Dynamic Image Services](https://awesome-repositories.com/f/web-development/dynamic-image-services.md) — Provides a service to transform image assets in real-time based on request parameters in the URL. ([source](https://unsplash.com/documentation))
