What are the best open-source alternatives to Amundsen?

30 open-source projects similar to amundsen-io/amundsen, ranked by shared features. Top picks: datahub-project/datahub, linkedin/datahub, open-metadata/openmetadata, awslabs/aws-data-wrangler, apache/gravitino, ckan/ckan, lancedb/lancedb, eventual-inc/daft, dbt-labs/dbt-core, gojek/feast.

Is datahub-project/datahub a good alternative to Amundsen?

DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationship…

Is linkedin/datahub a good alternative to Amundsen?

DataHub is a metadata management system and data catalog platform designed to provide a centralized directory for discovering, managing, and documenting datasets across a diverse data stack. It serves as a comprehensive framework for metadata management, incorporating a data governance framework to…

Is open-metadata/openmetadata a good alternative to Amundsen?

OpenMetadata is an enterprise data catalog, metadata platform, and governance suite that functions as a knowledge graph for data assets. It serves as an AI-ready metadata layer, providing governed context and organizational memory to large language model agents via the Model Context Protocol. The…

Is awslabs/aws-data-wrangler a good alternative to Amundsen?

This project is an AWS pandas integration library and data pipeline framework designed to simplify the movement and transformation of data between local memory and AWS storage and analytics services. It functions as a cloud data lake toolkit and storage file manager, allowing users to read, write,…

Is apache/gravitino a good alternative to Amundsen?

Gravitino is a federated metadata lake and unified data catalog designed to manage tables, files, and AI models across diverse data sources and cloud storage. It serves as a centralized interface for governing schemas, access controls, and tagging across relational databases, messaging queues, and…

Is ckan/ckan a good alternative to Amundsen?

CKAN is an open-source data management platform that provides the foundation for building data portals. It supports the full lifecycle of datasets—from creation and organization to publishing, cataloging with faceted search, and interactive data visualization—all through a web interface. The platf…

Is lancedb/lancedb a good alternative to Amundsen?

LanceDB is a vector database and columnar data store designed to function as a versioned dataset manager and vector search engine. It serves as a high-performance backend for indexing and retrieving high-dimensional embeddings, providing the foundation for machine learning data pipelines. The syst…

Is eventual-inc/daft a good alternative to Amundsen?

Daft is a distributed dataframe library and multimodal data processor designed to handle large-scale structured and unstructured data. It functions as a vectorized execution engine that processes tables alongside images, audio, and video, utilizing a unified schema to manage diverse data types. Th…

Is dbt-labs/dbt-core a good alternative to Amundsen?

dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles i…

Is gojek/feast a good alternative to Amundsen?

Feast is a machine learning feature store and MLOps data infrastructure layer. It provides a centralized system for managing and serving features across offline training and online production environments, utilizing an online feature serving layer for low-latency retrieval. The project centers on…

Back to amundsen-io/amundsen

Open-source alternatives to Amundsen

30 open-source projects similar to amundsen-io/amundsen, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Amundsen alternative.

datahub-project/datahub
datahub-project/datahub
12,141View on GitHub
DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations. The platform distinguishes itself through its focus on grounding artificial intelligence and autono
Pythondata-catalogdata-discoverydata-governance
View on GitHub12,141
linkedin/datahub
linkedin/datahub
12,106View on GitHub
DataHub is a metadata management system and data catalog platform designed to provide a centralized directory for discovering, managing, and documenting datasets across a diverse data stack. It serves as a comprehensive framework for metadata management, incorporating a data governance framework to classify sensitive information and assign ownership for organizational accountability. The platform distinguishes itself through AI-enabled data discovery, which connects large language models to a metadata graph to allow for natural language search and exploration of data assets. It also provides
Python
View on GitHub12,106
open-metadata/openmetadata
open-metadata/OpenMetadata
14,213View on GitHub
OpenMetadata is an enterprise data catalog, metadata platform, and governance suite that functions as a knowledge graph for data assets. It serves as an AI-ready metadata layer, providing governed context and organizational memory to large language model agents via the Model Context Protocol. The platform distinguishes itself by capturing institutional knowledge, linking conversations, decisions, and remediation notes directly to data assets to preserve tribal knowledge. It integrates AI agents to automate metadata governance, such as suggesting descriptions and identifying sensitive data thr
TypeScriptcontextcontext-layerdata-catalog
View on GitHub14,213

Open-source alternatives to Amundsen

datahub-project/datahub

linkedin/datahub

open-metadata/OpenMetadata

awslabs/aws-data-wrangler

apache/gravitino

ckan/ckan

lancedb/lancedb

Eventual-Inc/Daft

dbt-labs/dbt-core

gojek/feast

MarquezProject/marquez

apache/atlas

kedro-org/kedro

lin-ycv/EverythingPowerToys

hanc00l/wooyun_public

fish2018/pansou

opensearch-project/OpenSearch

mckinsey/vizro

mksglu/context-mode

koel/koel

osm-search/Nominatim

quantumblacklabs/kedro

RediSearch/RediSearch

algolia/autocomplete

liangliangyy/DjangoBlog

hect0x7/JMComic-Crawler-Python

olivernn/lunr.js

GoogleTrends/data

alibaba/zvec

rudderlabs/rudder-server