30 open-source projects similar to datastacktv/data-engineer-roadmap, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Data Engineer Roadmap alternative.
This project is a curated educational curriculum and technical skill roadmap designed to guide learners through the core competencies required for professional data science roles. It provides a structured sequence of educational materials and tutorials, arranging prerequisite skills and advanced topics into a dependency-based learning path. The curriculum covers specific training tracks for data science fundamentals, machine learning study plans, and data engineering guides. These tracks focus on the theoretical knowledge and practical skills needed to manage data pipelines, apply statistics
CS-Xmind-Note is a collection of structured mind maps and conceptual diagrams serving as a comprehensive knowledge base for computer science fundamentals. It functions as an academic reference and study guide, organizing core subjects into a visual mapping of interdependent technical concepts. The project utilizes an XMind-compatible schema to model complex domains through hierarchical nodes and relational concept mapping. This approach allows for the visual representation of technical layers, linking hardware specifications to software abstractions. The knowledge base covers several primary
This project is a specialized collection of study guides, curriculum references, and implementation libraries designed for computer science academic certifications. It serves as a comprehensive study guide and preparation kit for the Chinese postgraduate computer science entrance examination, providing a curated reference for the core curriculum. The repository distinguishes itself by combining theoretical study materials with a practical data structure implementation library. This includes standard C code templates for fundamental structures such as linked lists, stacks, queues, and binary t
CS-Notes is an AI-powered note organizer and computer science knowledge base. It serves as a technical learning curriculum and a structured collection of study materials covering fundamentals such as algorithms, operating systems, and machine learning. The project utilizes large language models to automatically categorize and arrange technical documentation through AI-driven workflows. It organizes these topics into a hierarchical knowledge graph of linked nodes to support self-directed study. Content is stored as markdown flat files for version control and text editing. These files are conv
Psyduck is a technical knowledge repository and study guide designed for mastering core computer science principles. It serves as a curated collection of learning materials, technical notes, and conceptual diagrams focused on software engineering and system design. The project utilizes XMind to organize complex technical information into hierarchical visual nodes. This structured approach represents relationships between computer science concepts through mind maps and visual-hierarchical data modeling. The knowledge base covers fundamental domains including operating systems, networking, and
This project is a community-curated knowledge base and directory designed to support software developers and designers. It functions as a centralized hub for technical resources, offering structured learning paths, development utilities, and professional networking opportunities through a version-controlled repository. The platform distinguishes itself through a decentralized curation model, where the entire knowledge base is maintained via peer-reviewed contributions. By utilizing a git-based workflow, the community collectively verifies and updates technical information, ensuring the direct
This project is an interactive programming curriculum and educational system designed to teach computer science and software engineering. It provides a structured set of courses and professional roadmaps focused on backend engineering, DevOps, and systems fundamentals. The platform is distinguished by an AI-powered coding tutor that provides Socratic guidance and contextual hints to help students find solutions independently. It features a browser-based code sandbox using WebAssembly to eliminate local environment setup, alongside automated test-based grading and spaced-repetition logic to re
This project is a comprehensive study kit and resource archive for the Structure and Interpretation of Computer Programs (SICP) course. It serves as a curated learning path for studying functional programming and the fundamentals of program construction, providing a centralized directory of textbooks, tutorials, and instructional materials. A primary focus of the repository is multilingual accessibility, specifically providing Chinese translations of English lecture subtitles and transcripts. These translated resources are mirrored across multiple video hosting platforms and cloud storage pro
Skill-map is a static site generator designed to document and visualize engineering competency frameworks. It provides a structured system for defining technical skills, learning milestones, and career progression paths, allowing organizations to map professional growth across specialized domains and technology stacks. The project utilizes a dependency-graph data model to represent the logical progression of technical mastery, which is defined through human-readable configuration files. This approach enables teams to audit collective technical capabilities, identify knowledge gaps, and provid
This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding challenges, and hands-on scenarios that cover the full spectrum of software delivery and operational workflows. The repository distinguishes itself through a modular, domain-specific structure that links instructional problem statements with verified implementation examples. By emp
This is an interactive notebook-based course that teaches machine learning from Python fundamentals through deep learning and natural language processing. It uses real datasets and multiple frameworks within a structured, hands-on curriculum that combines concise explanations with executable code cells, built-in datasets, and embedded exercise checkpoints. Learning progresses through data preparation and exploration, classical machine learning workflows, computer vision with convolutional neural networks, and natural language processing with deep learning, all delivered as a cohesive progressi
This project is a comprehensive computer science exam study guide and curriculum resource designed for postgraduate entrance examination preparation. It functions as a digital knowledge base and resource repository, providing curated notes, textbooks, and study materials covering core subjects such as operating systems, computer networking, data structures, and computer organization. The system features a coding pattern library that standardizes common data structure solutions and implementation techniques. It includes a framework for exam trend analysis, which evaluates historical test data
This project is a centralized repository and academic resource aggregator designed to guide students through a structured computer science curriculum. It provides a comprehensive roadmap of foundational courses and technical materials, helping learners navigate the transition from introductory programming to advanced software engineering proficiency. The repository distinguishes itself through a community-driven approach, where study paths and resource collections are refined and expanded via peer feedback and collaborative contributions. By organizing high-quality lecture notes, assignments,
This project is a comprehensive study guide and knowledge base for deep learning, machine learning, and the associated mathematics required for artificial intelligence. It functions as a curated collection of technical questions and answers designed to help users study fundamental theories and practical applications. The repository serves as a technical interview preparation resource by aggregating industry-standard questions and core knowledge points. It provides a structured reference for reviewing neural network architectures and specific techniques used in computer vision, such as object
This project serves as a comprehensive, community-driven directory for the single-board computer ecosystem. It functions as a centralized index of software, hardware projects, operating systems, and technical documentation designed to assist enthusiasts and system administrators in managing and developing custom applications. The repository distinguishes itself through a structured approach to resource curation, utilizing markdown-based content management to organize technical guides, tutorials, and hardware interfacing references. It incorporates automated validation tools to ensure the inte
This repository is a comprehensive collection of fully worked solutions to exercises and problems from the standard algorithms textbook by Cormen, Leiserson, Rivest, and Stein (CLRS). It serves as an educational reference for algorithm design and analysis, providing step-by-step reasoning, pseudocode, and mathematical proofs for a wide range of topics. The content spans core computer science areas: algorithm analysis with asymptotic notation, recurrence solving, and amortized cost analysis; data structure implementation and operations for binary search trees, red-black trees, B-trees, Fibonac
Gentelella is a collection of pre-configured interface templates and a component library designed for building administration panels, data dashboards, and internal management consoles. It provides a Bootstrap 5 based framework that includes accessible web interface templates and PWA-ready dashboard shells. The project features specialized templates for data visualization, utilizing modular chart factories to render line, bar, radar, and heatmap visualizations. It includes a set of ready-to-use interface elements for enterprise prototyping, such as kanban boards, file managers, and interactive
This project is a static site generator platform and hosting system that transforms structured content into live websites. It utilizes a version control website deployment workflow to turn a repository into a public site by hosting static files directly from a branch. The system implements git-based content management, where website articles and settings are handled through commit histories. It supports a collaborative site workflow that enforces pull request reviews to ensure content quality before changes are merged into the production environment. The platform covers the provisioning and
Grav is a flat-file content management system that eliminates the need for a traditional database by storing site content and configuration in human-readable Markdown and YAML files. Built as a modular PHP web framework, it uses a hierarchical page routing system where the physical directory structure directly determines the site's URL paths. The platform is distinguished by its event-driven plugin architecture and a command-line interface that prioritizes system administration, deployment, and maintenance tasks. It utilizes a blueprint-driven system to generate administrative forms from stru
Developer Roadmap is a community-driven platform that provides structured, graph-based learning paths for software engineering. It serves as a comprehensive knowledge repository where technical domains are organized into visual sequences to guide professional skill acquisition and career growth. The project distinguishes itself through a collaborative ecosystem that enables users to contribute roadmaps, curate industry best practices, and maintain professional profiles. It integrates diagnostic assessment frameworks to evaluate technical proficiency, helping developers identify knowledge gaps
This repository is a comprehensive educational resource for mastering fundamental computer science concepts through Python. It provides a structured collection of source code implementations for classic data structures and algorithms, serving as a practical guide for building technical proficiency and preparing for coding interviews. The project distinguishes itself by integrating visual aids and diagrams that map complex execution steps to clarify how data structures function. This visual approach is paired with a rigorous automated unit testing framework, which validates the correctness of
This is a collection of classic computer science algorithms and data structures implemented from scratch in JavaScript. The project provides reference implementations of fundamental concepts including sorting algorithms, binary search, linked lists, and binary search trees, all built as standalone pure functions with no external dependencies. The implementations cover a range of data structures, including singly-linked, doubly-linked, and circular linked lists with full traversal and mutation operations, as well as binary search trees supporting insertion, deletion, and search. Sorting algori
This repository is a collection of standard computer science algorithms and data structure implementations written in Java. It serves as a learning resource for developers to study fundamental programming concepts, efficient logic management, and common problem-solving techniques. The project organizes these solutions into modular, independent components that utilize the Java standard library and static type system. Each implementation is verified through unit tests, ensuring that the code adheres to expected outcomes for various mathematical and logical operations. The collection covers a b
This project is a computer science educational resource and a library of common data structures and algorithms implemented in Swift. It serves as a practical reference for studying complexity and efficiency through solved algorithmic problems and conceptual guides. The collection includes implementations of linear and hierarchical data structures, such as stacks, queues, linked lists, and trees. It covers a wide range of computational patterns, including graph and pathfinding implementations, mathematical numerical methods, and data compression techniques. The project also provides implement
This project is a comprehensive career transition guide and job navigator designed for software engineers moving from the private sector into government and public service roles. It serves as a structured resource for evaluating job eligibility, comparing professional benefits, and selecting government positions that value technical backgrounds. The repository provides specialized guidance for the public sector recruitment process, including a handbook for navigating political reviews, medical screenings, and background vetting. It also features a civil service exam guide with study methodolo
This project is a structured learning path and skill roadmap designed to guide developers through the Android ecosystem. It serves as a mobile engineering curriculum and career guide, organizing the technical knowledge required to build scalable applications from basic concepts to advanced professional skills. The guide differentiates itself by providing a hierarchical map of required knowledge, sequencing technical topics into a logical order. It includes a curated collection of study resources and technical documentation, along with tools to track learning progress through the specified cur
Algorithmica is an open-access knowledge base and computer science textbook designed for the publication of structured instructional materials. It functions as an educational content management system and a collaborative documentation site for hosting technical knowledge and course drafts for public consumption. The platform enables collaborative technical writing and authoring through version control and web editors, utilizing an automated pipeline to generate previews of pending changes. The system manages large collections of learning materials using static site generation, markdown-drive
This repository serves as a comprehensive educational resource and technical reference for implementing fundamental data structures and algorithms using JavaScript. It provides a structured guide to mastering core computer science concepts, focusing on the practical application of data organization techniques and problem-solving strategies within the JavaScript ecosystem. The materials cover the implementation of essential storage patterns, including linked lists, trees, and graphs, alongside the analysis of algorithmic efficiency. By evaluating execution time and memory usage through asympto
This project consists of technical instructional guides and a structured computer science curriculum designed to teach the construction of a full computing stack from transistors up to a web browser. It provides a 12-week educational course that covers the fundamental domains of computer architecture and digital logic design. The curriculum is distinguished by its first-principles approach, guiding the learner through the engineering of CPUs, FPGAs, and PCBs using digital logic and hardware description languages. It includes specific syllabi for building a hardware design, a TCP/IP network st
This project is a comprehensive collection of common computer science algorithms and data structures implemented in Swift. It serves as an educational reference and library for studying computational complexity, algorithmic logic, and data structure engineering through practical code examples. The repository provides a wide suite of data structure implementations, including various types of linked lists, heaps, hash tables, and an extensive range of hierarchical trees such as Red-Black, B-Tree, and Splay trees. It also covers diverse sorting and searching techniques, from basic bubble sort to