30 open-source projects similar to dataexpert-io/data-engineer-handbook, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Data Engineer Handbook alternative.
This project is a community-driven directory that aggregates and categorizes high-quality technical resources, tools, and learning materials. It functions as a centralized knowledge management repository, designed to help developers navigate the software development landscape by providing structured access to curated lists and external project references. The directory relies on a collaborative, peer-reviewed workflow where external contributors submit and maintain links through a version-controlled system. This community-maintained approach ensures that the information remains current and re
This project is a curated technical resource directory and software engineering learning roadmap. It serves as a computer science study curriculum and professional development framework, providing staged progressions for mastering programming languages, data structures, and full-stack development. The repository functions as a career preparation guide, offering strategic frameworks for resume building, technical interview practice, and internship application targeting. It includes a system for identifying income opportunities and managing a professional social presence to increase visibility.
This project is a curated knowledge repository that serves as a comprehensive reference guide for web development. It organizes high-quality resources, documentation, and technical links into a structured collection designed to assist developers in navigating the frontend ecosystem and staying updated with evolving industry standards. The repository utilizes a directory-based information architecture, categorizing complex technical topics into nested folders to ensure predictable navigation. By leveraging a git-versioned knowledge base, the project maintains a complete historical audit trail
This project is a comprehensive, community-driven knowledge repository that serves as a centralized hub for data science resources. It provides a structured index of educational materials, software packages, and professional development tools designed to support both students and practitioners in navigating the data science landscape. The repository distinguishes itself through a hierarchical taxonomy that organizes a vast collection of external links into a human-readable, markdown-based document. By relying on distributed contributions, the project maintains an up-to-date snapshot of the fi
Developer Roadmap is a community-driven platform that provides structured, graph-based learning paths for software engineering. It serves as a comprehensive knowledge repository where technical domains are organized into visual sequences to guide professional skill acquisition and career growth. The project distinguishes itself through a collaborative ecosystem that enables users to contribute roadmaps, curate industry best practices, and maintain professional profiles. It integrates diagnostic assessment frameworks to evaluate technical proficiency, helping developers identify knowledge gaps
This project is a curated knowledge repository designed to support the professional development of software engineers. It functions as a comprehensive index of industry best practices, methodologies, and design principles, providing a structured roadmap for those seeking to improve their technical skills, architectural decision-making, and career trajectory. The repository distinguishes itself through a community-driven approach, relying on peer-reviewed contributions to maintain an up-to-date collection of resources. It organizes vast amounts of technical information into a hierarchical taxo
This project is a community-maintained directory of resources for building desktop applications with Electron. It serves as a centralized knowledge base, aggregating high-quality tools, learning materials, and software examples to assist developers in mastering the framework and improving their development workflows. The repository functions as a curated ecosystem index, relying on peer review and community contributions to verify and organize information. By maintaining a structured collection of articles, books, boilerplates, and third-party components, it provides a comprehensive reference
This project is a comprehensive cybersecurity knowledge repository that provides a structured collection of technical documentation, training materials, and professional development roadmaps. It serves as a centralized resource for practitioners to navigate complex security disciplines, ranging from offensive and defensive fundamentals to specialized infrastructure and application security. The repository distinguishes itself through a modular resource framework that enables users to construct isolated cyber range environments for hands-on practice. It also features a specialized reference gu
This project is a community-driven repository that serves as a comprehensive reference guide for mastering the command line interface on macOS. It functions as a curated index of high-quality tools, documentation, and best practices designed to assist users in navigating terminal environments and optimizing their development workflows. The directory distinguishes itself through a decentralized, peer-reviewed curation model. By leveraging a structured submission workflow, the content is continuously updated and vetted by contributors to ensure the accuracy and relevance of the listed resources
This project is an open-source educational curriculum designed to provide comprehensive training in data engineering. It focuses on building scalable data pipelines and managing cloud-native infrastructure through a structured, self-paced program that combines technical explanations with hands-on practical exercises. The curriculum distinguishes itself by emphasizing industry-standard methodologies, specifically teaching students how to implement infrastructure as code and manage data workflows through orchestration tools. By utilizing container-based environment isolation and declarative con
GoBooks is a curated directory and index of technical literature designed to support software engineering development and professional growth. The project functions as a structured repository that organizes educational materials into learning paths, allowing developers to discover resources tailored to their specific proficiency levels and career goals. The collection is centered on the Go programming language, providing a progression of books and guides that range from fundamental syntax to advanced system design patterns. By utilizing a structured classification system, the repository enabl
Everything that makes working with databases easier
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
Cookbook is a comprehensive knowledge base and reference repository for data engineering. It serves as a centralized directory for data architecture patterns, professional career roadmaps, and a curated collection of public datasets. The project provides a structured guide for transitioning into specialized data engineering roles through skill-matrix mapping and technical interview preparation. It further distinguishes itself by documenting real-world industry case studies and decomposing large-scale industrial implementations into repeatable architectural patterns. The repository covers a b
This project is a curated knowledge repository that serves as a comprehensive index for software architecture and design patterns. It functions as a community-driven learning resource, providing developers with structured access to high-quality documentation, books, and articles focused on mastering complex design principles and industry-standard best practices. The directory distinguishes itself through a hierarchical taxonomy that organizes technical concepts into logical domains, ranging from cloud architecture and distributed systems to front-end development and machine learning. By relyi
This project is a community-driven knowledge base that serves as a comprehensive guide for mastering version control operations and platform-specific workflows. It functions as a developer productivity resource, consolidating essential information on command-line operations, repository management, and advanced interface techniques into a single, version-controlled document. The guide distinguishes itself by providing actionable insights into platform-specific automation and navigation. It covers the use of keyboard shortcuts to accelerate daily tasks, the application of advanced search syntax
This repository is a curated study guide and knowledge base designed to assist developers in preparing for software engineering job interviews within the Android ecosystem. It organizes essential programming topics, language-specific features, and mobile architecture patterns into a structured format for professional review and skill assessment. The collection covers a broad range of technical domains, including system design principles, performance optimization, and core development concepts. By categorizing these topics, the resource provides a systematic way for users to practice and impro
V is a statically typed, compiled programming language designed for high-performance systems development. It prioritizes memory safety and execution speed by enforcing strict type checking and immutable defaults, while generating native machine code for multiple hardware architectures. The language is built around an integrated toolchain that includes a compiler, package manager, formatter, and testing utilities within a single executable, facilitating rapid development cycles. What distinguishes V is its focus on developer productivity and interoperability. It provides a direct interface for
This project serves as a comprehensive directory of open-source libraries, tools, and technical documentation designed for building and managing infrastructure on the Amazon Web Services platform. It functions as a centralized knowledge base, aggregating software development kits, command-line utilities, and editor extensions to assist developers in cloud-native application development. The directory distinguishes itself through a categorical taxonomy that organizes disparate technical resources into a structured hierarchy. It incorporates community-driven metadata aggregation and automated m
InterviewGuide is a comprehensive technical interview preparation platform that covers the full spectrum of software engineering recruitment, from foundational computer science concepts through to offer negotiation. It provides structured learning paths across algorithms, operating systems, databases, networking, and programming languages, with a particular emphasis on C++ and Go. The platform aggregates real interview experiences and company-specific questions from major tech employers, offering candidates a searchable database of past written exam problems and detailed accounts of actual int
This project provides a comprehensive, modular framework for auditing and hardening personal digital and physical security. It functions as a structured, platform-agnostic knowledge base that breaks down complex security standards into granular, actionable tasks. By utilizing a static documentation architecture, the project ensures that its guidance remains accessible and transparent, allowing users to track their security posture incrementally through a persistent, manual progress-tracking system. The project distinguishes itself by bridging the gap between digital cybersecurity and physical
This project is a curated knowledge repository that aggregates high-quality technical blogs and engineering insights from industry leaders. It serves as a comprehensive technical learning resource, providing a centralized index of companies, individual experts, and technologies to help professionals discover reliable sources of software development knowledge. The repository distinguishes itself through a community-driven approach, relying on external contributions to maintain and expand its knowledge base. By utilizing markdown-based content curation, the project ensures that all information
This project is a community-driven knowledge repository and software resource directory focused on artificial intelligence and professional productivity tools. It functions as a markdown-based knowledge base that organizes information into a hierarchical taxonomy, allowing users to discover, compare, and evaluate software solutions based on specific business and technical requirements. The platform distinguishes itself through a decentralized peer-review model, where the directory is maintained and updated by the community via a pull-request workflow. This collaborative approach ensures that
This project is a community-curated directory serving as a central hub for resources related to the Next.js framework. It provides a structured collection of learning materials, starter templates, and third-party extensions designed to assist developers in building and maintaining web applications. The repository distinguishes itself through a community-driven contribution model, where developers maintain and update the collection via version-controlled pull requests. This collaborative approach ensures the directory remains an organized taxonomy of real-world examples, architectural patterns
This project is an open-source knowledge repository that serves as a comprehensive technical interview question bank for backend engineering roles. It provides a structured resource for hiring managers and candidates to evaluate proficiency in software design, architectural patterns, and core engineering principles through a curated collection of discussion topics and coding challenges. The repository functions as a programming paradigm evaluation tool, enabling the assessment of a candidate's understanding of object-oriented, functional, and procedural techniques. It distinguishes itself by
This project is a community-driven knowledge base that archives historical statements and insights from influential technology leaders. It functions as a reference library designed to preserve the intellectual history of the information technology field and the evolution of computing philosophy. The repository utilizes a static, flat-file data structure formatted in markdown, which eliminates the need for complex database management or server-side runtime environments. By leveraging a version control system, the project maintains a complete audit trail of all additions and editorial changes,
30-seconds-of-code is a comprehensive knowledge base and programming snippet library designed to support software engineering education and professional development. It provides a curated collection of reusable code units and technical guides that help developers master core language mechanics, design patterns, and architectural philosophies. The project distinguishes itself by offering a wide-ranging library of algorithmic solutions and web development patterns that are organized into modular, independently testable units. It emphasizes functional programming paradigms and declarative logic,
This project is a community-maintained open source directory that serves as a comprehensive index of React components and libraries. It functions as a technical knowledge base, mapping common development challenges to vetted third-party solutions to help developers accelerate their frontend workflows and avoid reinventing standard interface elements. The directory distinguishes itself through a decentralized, hyperlink-centric architecture that avoids hosting code locally, instead pointing users directly to external repositories. This content is curated through a collaborative model where com
This project is a curated directory of reusable components and integration scripts designed to extend the functionality of continuous integration and deployment pipelines. It serves as a comprehensive knowledge base for developers, providing a structured index of community-vetted tools that assist in implementing best practices for software workflows and automation. The directory distinguishes itself through a community-driven approach, relying on external contributions to maintain an up-to-date catalog of resources. It organizes these tools into a hierarchical taxonomy, allowing users to nav