# Self-Hosted Data Integration Platforms

> Search results for `self-hosted Fivetran alternative for syncing data sources` on awesome-repositories.com. 117 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/self-hosted-fivetran-alternative-for-syncing-data-sources

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/self-hosted-fivetran-alternative-for-syncing-data-sources).**

## Results

- [appwrite/appwrite](https://awesome-repositories.com/repository/appwrite-appwrite.md) (56,318 ⭐) — Appwrite is a backend-as-a-service platform that provides a unified development environment for building full-stack applications. It integrates essential infrastructure components—including authentication, databases, storage, and serverless functions—into a single, centralized interface to simplify application development and resource management.

The platform distinguishes itself through a container-based microservices architecture that ensures consistent execution across diverse infrastructure. It features a versatile connectivity layer that links frontend applications with third-party servi
- [ankitects/anki](https://awesome-repositories.com/repository/ankitects-anki.md) (28,571 ⭐) — Anki is a cross-platform flashcard management system designed to optimize long-term memory retention through spaced-repetition learning. It functions as a digital learning assistant that uses active recall practice and automated scheduling algorithms to determine the ideal timing for card reviews based on individual performance history. The core system relies on a local relational database to ensure data persistence and portability, while supporting complex study workflows through flexible note-type schema modeling and template-driven content rendering.

The platform distinguishes itself throu
- [asciinema/asciinema](https://awesome-repositories.com/repository/asciinema-asciinema.md) (16,852 ⭐) — Asciinema is a platform for capturing, replaying, and sharing command-line sessions. It provides a comprehensive suite of tools to record terminal activity into lightweight, text-based files that preserve ANSI escape sequences, allowing users to document technical workflows, troubleshooting steps, and software demonstrations with high fidelity.

The project distinguishes itself through its versatile playback and distribution capabilities. It features a web-based player that renders interactive terminal sessions directly in the browser, supporting features like seeking, playback speed control,
- [owncloud/core](https://awesome-repositories.com/repository/owncloud-core.md) (8,793 ⭐) — This project is a self-hosted cloud storage server and content collaboration platform implemented as a PHP web application framework. It functions as a centralized server for storing, synchronizing, and sharing files and data across multiple devices.

The system operates as a WebDAV file server and a synchronization server for scheduling and contact information using CalDAV and CardDAV protocols. It features a plugin architecture that allows the server to be extended with additional applications and custom features.

The platform provides capabilities for user and group access management, exte
- [stoatchat/self-hosted](https://awesome-repositories.com/repository/stoatchat-self-hosted.md) (2,497 ⭐) — This project is a self-hosted communication suite and private messaging infrastructure. It is a containerized chat platform designed for deployment on independent hardware to maintain full control over user data and server dependencies.

The system features a modular plugin framework that allows custom features and behaviors to be loaded into the client at runtime via manifest files. It is designed as a proxy-compatible service, supporting configurable network port routing to operate behind external reverse proxy servers.

The platform covers capabilities for containerized service orchestratio
- [airbytehq/airbyte](https://awesome-repositories.com/repository/airbytehq-airbyte.md) (21,472 ⭐) — Airbyte is a data integration platform designed to synchronize information between diverse applications, databases, and data warehouses. It functions as an extract, transform, and load orchestrator that manages automated data movement workflows across cloud, on-premise, and hybrid environments. The platform provides a standardized interface for connectors, enabling the movement of structured and unstructured data while maintaining stateful checkpoints for reliable incremental syncing.

The platform distinguishes itself through a containerized architecture that isolates connectors to prevent de
- [clickhouse/clickhouse](https://awesome-repositories.com/repository/clickhouse-clickhouse.md) (48,229 ⭐) — ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring.

The platform distinguishes itself through ad
- [getsentry/self-hosted](https://awesome-repositories.com/repository/getsentry-self-hosted.md) (9,426 ⭐) — This project is a containerized error tracking platform and monitoring suite designed for self-hosted deployment on private infrastructure. It provides a collection of services for capturing and analyzing software crashes and exceptions, ensuring that sensitive application data remains within a controlled environment.

The system includes specialized tooling for air-gapped deployment, allowing the software to be installed and operated on servers without internet access through the manual transfer of container images. It also supports corporate network integration via proxy configurations to ma
- [coollabsio/coolify](https://awesome-repositories.com/repository/coollabsio-coolify.md) (57,055 ⭐) — This project is a self-hosted platform-as-a-service that provides a centralized management interface for deploying, configuring, and monitoring containerized applications and databases on private infrastructure. It functions as a visual control plane, automating the end-to-end lifecycle of services from source code to production. By managing container orchestration, networking, and resource allocation, it allows users to maintain full control over their own hardware while streamlining the delivery of software.

The platform distinguishes itself through its agentless architecture, which uses se
- [datawranglerai/self-host-n8n-on-gcr](https://awesome-repositories.com/repository/datawranglerai-self-host-n8n-on-gcr.md) (608 ⭐) — Self-host n8n on Google Cloud without the subscription fees or server headaches - because your automation workflows shouldn't cost more than your coffee budget
- [laurent22/joplin](https://awesome-repositories.com/repository/laurent22-joplin.md) (55,233 ⭐) — Joplin is an open-source, cross-platform note-taking application designed for secure, private knowledge management. It functions as a local-first productivity platform, maintaining a complete relational database on the user's device to ensure offline availability and high-performance data retrieval. The application prioritizes data sovereignty by implementing an end-to-end encryption layer, which secures all information locally with a master key before any synchronization occurs.

The platform distinguishes itself through a delta-based synchronization engine that transmits only specific file c
- [marmelab/react-admin](https://awesome-repositories.com/repository/marmelab-react-admin.md) (26,780 ⭐) — React-admin is a framework for building data-driven administrative interfaces that connect to REST or GraphQL backends. It provides a comprehensive suite of tools for managing the full lifecycle of administrative applications, including resource-oriented routing, declarative form scaffolding, and context-driven state management. By utilizing a modular adapter-based architecture, the framework abstracts backend communication, allowing developers to build consistent CRUD interfaces that handle data fetching, authentication, and synchronization automatically.

The project distinguishes itself thr
- [heyform/heyform](https://awesome-repositories.com/repository/heyform-heyform.md) (8,852 ⭐) — Heyform is an open-source form builder and self-hosted data collection platform. It provides a no-code designer for creating dynamic web-based surveys and input forms, supported by an extensible backend for managing submissions and storing results in a private database.

The system distinguishes itself through advanced form logic and branding controls. It includes a conditional logic engine to show or hide sections based on user responses and allows for precise visual identity customization through configurable themes and custom CSS injection.

The platform covers a broad range of operational
- [n8n-io/self-hosted-ai-starter-kit](https://awesome-repositories.com/repository/n8n-io-self-hosted-ai-starter-kit.md) (14,997 ⭐) — This project provides a dockerized AI workflow stack and orchestration templates for deploying a self-hosted AI environment. It establishes a localized infrastructure for building autonomous agents and model chains that process private data on-premises without external cloud dependencies.

The environment is designed to support autonomous agent development, allowing models to dynamically select tools, execute shell commands, and interact with local file systems. It includes integrated vector database support to enable retrieval augmented generation and private document analysis.

The stack cov
- [formbricks/formbricks](https://awesome-repositories.com/repository/formbricks-formbricks.md) (12,391 ⭐) — Formbricks is an open-source survey and feedback platform designed to help teams capture and analyze user insights through targeted, in-app, and website-based interactions. It functions as a comprehensive customer experience analytics system that allows organizations to maintain full control over their data, user attributes, and survey workflows.

The platform distinguishes itself through its event-driven architecture, which enables precise behavioral targeting by triggering surveys based on specific user actions or application events. It supports deep integration with external ecosystems by a
- [3lvis/sync](https://awesome-repositories.com/repository/3lvis-sync.md) (2,543 ⭐) — JSON to SwiftData and back. SwiftData Sync.
- [keen/dashboards](https://awesome-repositories.com/repository/keen-dashboards.md) (11,038 ⭐) — This project is a collection of responsive CSS Grid dashboard templates and a data visualization UI kit. It provides a set of HTML layouts designed for building analytics interfaces and monitoring views for KPIs and business metrics that adapt to different screen sizes.

The toolkit is library-agnostic, allowing the connection of static HTML templates to any external data source or third-party charting library without requiring custom adapter code. It uses a template-driven approach to separate the visual structure of the dashboard from the underlying data.

The capabilities cover the assembly
- [amruthpillai/reactive-resume](https://awesome-repositories.com/repository/amruthpillai-reactive-resume.md) (38,613 ⭐) — This project is a web-based platform designed for creating, managing, and sharing professional resumes. It functions as a structured document builder that integrates artificial intelligence to assist with content generation, editing, and analysis. Users can maintain a collection of resumes, customize their visual presentation through various templates, and export them into multiple formats for job applications.

The platform distinguishes itself through its autonomous AI agent capabilities, which can perform research, suggest incremental edits, and apply data patches directly to documents. It
- [simonschllng/rm-sync](https://awesome-repositories.com/repository/simonschllng-rm-sync.md) (87 ⭐) — Sync script for reMarkable paper tablet.
- [datahub-project/datahub](https://awesome-repositories.com/repository/datahub-project-datahub.md) (12,141 ⭐) — DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations.

The platform distinguishes itself through its focus on grounding artificial intelligence and autono
- [googlehosts/hosts](https://awesome-repositories.com/repository/googlehosts-hosts.md) (20,619 ⭐) — This project is a curated collection of domain-to-IP mappings designed to bypass network censorship and restore access to restricted web services. It provides a set of host file entries and static domain mapping files that allow users to redirect blocked traffic to accessible mirrors or proxy servers.

The system functions by prioritizing local host entries over external DNS responses. It utilizes plain-text configuration files that are intended for manual injection into a system's hosts file to override default name resolution.

The project covers network connectivity and DNS override managem
- [github/docs](https://awesome-repositories.com/repository/github-docs.md) (18,951 ⭐) — GitHub Copilot is an AI-powered development platform designed to integrate large language models directly into coding environments. It functions as an interactive assistant and an agentic workflow orchestrator, enabling developers to automate code generation, perform automated code reviews, and execute complex, multi-step development tasks through natural language prompts.

The platform distinguishes itself through its autonomous agent capabilities, which allow for repository-level research, implementation planning, and code modifications across multiple files. It supports a modular architectu
- [langfuse/langfuse](https://awesome-repositories.com/repository/langfuse-langfuse.md) (29,190 ⭐) — Langfuse is an open-source observability and evaluation platform designed for language model applications. It provides a centralized system for tracking execution traces, monitoring performance metrics, and managing prompt templates. By capturing hierarchical units of work and telemetry data, the platform enables developers to debug complex application lifecycles and analyze token usage, latency, and model interactions in production environments.

The platform distinguishes itself through an integrated evaluation framework that allows for systematic benchmarking and automated scoring of model
- [specialunderwear/hosts.prefpane](https://awesome-repositories.com/repository/specialunderwear-hosts-prefpane.md) (1,632 ⭐) — a Cocoa GUI for /etc/hosts
- [dubinc/dub](https://awesome-repositories.com/repository/dubinc-dub.md) (23,722 ⭐) — This project is a comprehensive link management and marketing attribution platform designed for creating, tracking, and analyzing shortened URLs. It functions as a centralized hub for marketing analytics, providing tools to monitor link performance, visualize conversion funnels, and manage affiliate programs through a unified dashboard.

The platform distinguishes itself by integrating advanced attribution modeling and partner management directly into the link infrastructure. It supports complex marketing workflows, including automated commission calculations, fraud detection, and payout distr
- [sfu-db/connector-x](https://awesome-repositories.com/repository/sfu-db-connector-x.md) (2,561 ⭐) — Connector-X is a high-performance SQL data extraction library and bridge for transferring relational database records into memory-efficient data structures. It functions as a parallel database connector and federated query engine capable of executing and joining queries across multiple remote database connections to aggregate data locally.

The project distinguishes itself through a zero-copy approach to data loading, which transfers SQL query results into memory structures without duplicating data. It maximizes throughput by partitioning SQL queries into threads, employing parallel columnar a
- [chartdb/chartdb](https://awesome-repositories.com/repository/chartdb-chartdb.md) (21,286 ⭐) — ChartDB is a database schema visualizer and entity-relationship diagramming platform designed to help developers understand, design, and document complex data architectures. It functions as a visual workspace where users can create and modify database schemas, define table attributes, and map foreign key relationships. By parsing database metadata or SQL scripts, the tool generates interactive diagrams that provide a clear overview of structural interdependencies and data associations.

The platform distinguishes itself through its focus on automated documentation and schema synchronization. I
- [microsoft/data-science-for-beginners](https://awesome-repositories.com/repository/microsoft-data-science-for-beginners.md) (35,657 ⭐) — This project is a comprehensive educational curriculum designed to teach the fundamental concepts, workflows, and tools of data science. It provides a structured learning path that covers the end-to-end data science lifecycle, including data acquisition, maintenance, processing, and pattern discovery, while grounding theoretical knowledge in practical, real-world applications.

The curriculum distinguishes itself through a data-driven pedagogical design that utilizes interactive, notebook-based lessons. By combining narrative text with live code blocks, the platform allows learners to experime
- [mementum/backtrader](https://awesome-repositories.com/repository/mementum-backtrader.md) (20,462 ⭐) — Backtrader is a Python framework designed for the development, backtesting, and live execution of algorithmic trading strategies. It provides a comprehensive environment for quantitative finance, allowing users to simulate trading logic against historical market data or connect directly to brokerage platforms for automated real-time trading.

The project distinguishes itself through a unified event-driven architecture that treats backtesting and live trading with the same API. This consistency is supported by a flexible data-feed abstraction layer that normalizes diverse financial sources, ena
- [langchain-ai/langchainjs](https://awesome-repositories.com/repository/langchain-ai-langchainjs.md) (17,818 ⭐) — LangChain.js is a framework for building, executing, and monitoring stateful agentic applications. It provides an orchestration engine that models workflows as directed graphs, allowing developers to connect language models, data sources, and external tools into modular, multi-step processes.

The platform distinguishes itself through its focus on stateful execution and human-in-the-loop control. It manages agent lifecycles by persisting execution state across threads, enabling fault tolerance and the ability to pause workflows at designated breakpoints for manual review or modification. This
- [yizhongw/self-instruct](https://awesome-repositories.com/repository/yizhongw-self-instruct.md) (4,602 ⭐) — This repository contains code and data for the Self-Instruct paper, a method for aligning pretrained language models with instructions.
- [openbb-finance/openbb](https://awesome-repositories.com/repository/openbb-finance-openbb.md) (69,583 ⭐) — OpenBB is a financial data platform and investment research terminal designed to aggregate, normalize, and distribute market data across analytical workflows. It functions as a comprehensive ecosystem that bridges disparate financial data providers with custom applications, spreadsheets, and internal modeling infrastructure.

The platform distinguishes itself through a provider-based data abstraction layer that normalizes heterogeneous financial APIs into a consistent, schema-driven format. This architecture supports quantitative research automation and the construction of interactive, widget-
- [gravitl/netmaker](https://awesome-repositories.com/repository/gravitl-netmaker.md) (11,630 ⭐) — Netmaker is a platform for automating and managing virtual mesh networks built on WireGuard. It functions as a centralized control plane that orchestrates encrypted, peer-to-peer tunnels across distributed infrastructure, including cloud environments, on-premise data centers, and containerized clusters. By automating the configuration of routing tables and access policies, the system enables secure, private connectivity between diverse devices and services without requiring manual network administration.

The platform distinguishes itself through its focus on zero-trust network access and soft
- [edtechre/pybroker](https://awesome-repositories.com/repository/edtechre-pybroker.md) (3,191 ⭐) — pybroker is a Python algorithmic trading framework and quantitative technical analysis library designed for developing, testing, and optimizing trading strategies using historical market data. It functions as a trading strategy backtester and a financial performance evaluator, providing a structured environment to simulate trading rules and analyze their statistical reliability.

The framework distinguishes itself through a market data integration layer that handles the fetching and caching of historical price data from external providers. It incorporates an event-driven backtesting engine and
- [diyago/gan-for-tabular-data](https://awesome-repositories.com/repository/diyago-gan-for-tabular-data.md) (0 ⭐) — Generative Networks are well-known for their success in realistic image generation. However, they can also be applied to generate tabular data. We introduce major improvements for generating high-fidelity tabular data giving oppotunity to try GANS, TimeGANs, Diffusions and LLM for tabular data…
- [jorgencr/alternative-and-responsible-investments](https://awesome-repositories.com/repository/jorgencr-alternative-and-responsible-investments.md) (9 ⭐) — Code and support files for ICM296 - Alternative and Responsible Investments
- [maplibre/maplibre-gl-js](https://awesome-repositories.com/repository/maplibre-maplibre-gl-js.md) (9,931 ⭐) — Maplibre GL JS is a WebGL map rendering engine and vector tile map library used to create interactive web maps. It serves as a web-based GIS visualization tool and an interactive map interface framework for rendering geographic data and vector tiles on web pages.

The library provides capabilities for 3D terrain rendering and the integration of custom 3D content. It supports complex geospatial data visualization through the use of heatmaps, clusters, and 3D extrusions, while allowing for custom map styling and environmental effect customization.

The system covers a broad range of functional a
- [jeric-x/syncclipboard](https://awesome-repositories.com/repository/jeric-x-syncclipboard.md) (3,940 ⭐) — SyncClipboard is a cross-platform synchronization service and server designed to share clipboard content and history across multiple devices. It utilizes a central server and a REST-based API to facilitate the exchange of data between custom client implementations.

The system features a self-hosted synchronization server and a command-line interface for controlling clipboard data and executing application commands via the terminal. It includes an image format converter that downloads original source files and transforms modern image formats into compatible types to ensure successful pasting a
- [dfsramos/wezterm-sync](https://awesome-repositories.com/repository/dfsramos-wezterm-sync.md) (2 ⭐) — A WezTerm plugin that syncs your config to a private GitHub Gist — keeping it in sync across multiple machines with no dotfiles setup required.
- [healthchecks/healthchecks](https://awesome-repositories.com/repository/healthchecks-healthchecks.md) (9,891 ⭐) — Healthchecks is a heartbeat monitoring service and cron job monitoring tool designed to track the execution and success of scheduled tasks and systemd timers. It functions as a dead man switch, alerting users when expected periodic signals from remote processes fail to arrive.

The system accepts health signals via HTTP and SMTP, allowing it to track infrastructure heartbeats from sources ranging from CI/CD workflows to network routers. It distinguishes itself by supporting the capture of diagnostic data, including exit codes and execution logs, and by calculating the duration between start an
- [theoxo/self-repair](https://awesome-repositories.com/repository/theoxo-self-repair.md) (0 ⭐) — This is is the accompanying repository for the paper Is Self-Repair a Silver Bullet for Code Generation?, presented at the Twelfth International Conference on Learning Representations (Vienna, May 2024). It contains source code used to run the experiments; the resulting data; as well as scripts…
- [gitroomhq/postiz-app](https://awesome-repositories.com/repository/gitroomhq-postiz-app.md) (32,271 ⭐) — Postiz is an open-source social media management platform designed to centralize the scheduling, publishing, and analysis of content across diverse social networks, community forums, and blogging platforms. It functions as a unified hub where users can coordinate, review, and distribute content through a shared team workspace, while leveraging integrated artificial intelligence to assist in drafting text and generating multimedia assets.

The platform distinguishes itself through a modular architecture that utilizes a provider-specific adapter pattern to ensure consistent content distribution
- [bluesky-social/social-app](https://awesome-repositories.com/repository/bluesky-social-social-app.md) (18,063 ⭐) — This project provides a comprehensive implementation of the AT Protocol, serving as a framework for building decentralized social networking applications. It enables the creation of distributed data repositories where users maintain cryptographic ownership of their identity and content, allowing for portable accounts that can be migrated between independent servers without central authority intervention.

The platform distinguishes itself by decoupling content hosting from discovery through modular algorithmic curation. Users can select third-party services to filter and organize their feeds,
- [nylas/sync-engine](https://awesome-repositories.com/repository/nylas-sync-engine.md) (3,498 ⭐) — :incoming_envelope: IMAP/SMTP sync system with modern APIs
- [vanna-ai/vanna](https://awesome-repositories.com/repository/vanna-ai-vanna.md) (22,707 ⭐) — Vanna is a Python framework designed to build conversational interfaces that translate natural language into executable database queries. It functions as an enterprise-grade toolkit that connects language models to relational databases, allowing users to retrieve information through conversational prompts rather than manual code. The system maintains context across interactions by utilizing vector databases to store historical query patterns and schema metadata.

The framework distinguishes itself through a focus on security and schema-aware generation. It incorporates granular access control,
- [otavioschwanck/telescope-alternate.nvim](https://awesome-repositories.com/repository/otavioschwanck-telescope-alternate-nvim.md) (107 ⭐) — Alternate between common files using pre-defined regexp.  Just map the patterns and starting navigating between files that are related.
- [docmost/docmost](https://awesome-repositories.com/repository/docmost-docmost.md) (19,049 ⭐) — Docmost is an open-source knowledge management system designed as a collaborative documentation platform for teams. It functions as an enterprise wiki that centralizes organizational information into structured, searchable workspaces, enabling users to create, organize, and share content through a hierarchical system of spaces and pages.

The platform distinguishes itself by integrating artificial intelligence directly into the documentation lifecycle. It utilizes vector-based semantic search to allow for natural language queries across stored content and provides AI-assisted tools for draftin
- [standardnotes/app](https://awesome-repositories.com/repository/standardnotes-app.md) (6,519 ⭐)
- [stevenblack/hosts](https://awesome-repositories.com/repository/stevenblack-hosts.md) (30,553 ⭐) — This project provides a system-wide content filtering utility that controls network traffic by redirecting domain resolution requests to local null addresses. By mapping unwanted hostnames to these addresses at the operating system level, it effectively blocks connections to advertising, tracking, and malicious domains across all applications on a machine.

The core of the system is a data-driven build pipeline that aggregates multiple curated source lists into a single, unified configuration file. This process is highly customizable, allowing users to employ declarative filtering logic throug
- [electric-sql/electric](https://awesome-repositories.com/repository/electric-sql-electric.md) (9,909 ⭐) — Electric is a Postgres data synchronization engine and replication proxy designed to enable local-first software. It replicates data from Postgres databases to client-side stores in real time using logical replication, allowing applications to maintain a local embedded database for offline access and low-latency updates.

The system distinguishes itself by using shapes to filter and authorize specific subsets of database rows and columns before streaming them to clients or edge workers. It further supports multi-user collaboration by integrating a conflict-free replicated data type framework t
