# Compiler Construction Guides

> Search results for `build your own compiler step by step` on awesome-repositories.com. 120 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/build-your-own-compiler-step-by-step

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/build-your-own-compiler-step-by-step).**

## Results

- [yukimotopress/programming-blockchains-step-by-step](https://awesome-repositories.com/repository/yukimotopress-programming-blockchains-step-by-step.md) (680 ⭐) — Programming Blockchains Step-by-Step book / guide. Let's build blockchains from scratch (zero) step by step. Let's start with crypto hashes...
- [swiftlang/swift](https://awesome-repositories.com/repository/swiftlang-swift.md) (70,051 ⭐) — Swift is a high-performance, general-purpose programming language designed for safety and speed. It features a modular compiler front-end that transforms source code into optimized machine binaries, utilizing a value-oriented type system that prioritizes predictable state management through value and reference types. The language is built on a task-based concurrency model that schedules asynchronous operations across multicore hardware to ensure data race safety.

The project distinguishes itself through a native, bi-directional interoperability mechanism that allows for direct integration wit
- [codehubapp/codehub](https://awesome-repositories.com/repository/codehubapp-codehub.md) (22,662 ⭐) — CodeHub is a mobile application designed for managing remote repositories and reviewing code changes directly from a smartphone or tablet. It functions as a mobile client for GitHub, enabling users to browse repositories, monitor project progress, and interact with pull requests while away from a desktop computer.

Beyond its repository management capabilities, the application serves as a document conversion utility and software comparison platform. It provides tools for transforming files between various formats while maintaining formatting integrity, as well as resources for evaluating and r
- [jamiebuilds/the-super-tiny-compiler](https://awesome-repositories.com/repository/jamiebuilds-the-super-tiny-compiler.md) (28,525 ⭐) — This project is an educational compiler implementation and architecture demo. It serves as a small-scale C-style language compiler designed to demonstrate the fundamental stages of transforming source code into executable machine instructions.

The codebase functions as a tool for compiler architecture education and design prototyping. It illustrates the process of building an educational language implementation to help users understand the mechanics of parsing and code generation.

The implementation covers the primary stages of a compiler pipeline, including regular expression tokenization,
- [google-research/distilling-step-by-step](https://awesome-repositories.com/repository/google-research-distilling-step-by-step.md) (595 ⭐) — Code for paper Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
- [maiot-io/zenml](https://awesome-repositories.com/repository/maiot-io-zenml.md) (5,452 ⭐) — ZenML is an extensible machine learning orchestration framework designed to manage the end-to-end lifecycle of data pipelines and AI agent workflows. It functions as a durable orchestrator that executes machine learning tasks as directed acyclic graphs, ensuring that every step is containerized for consistent performance across local, cloud, and hybrid infrastructure. By decoupling pipeline code from underlying compute and storage backends, the platform allows developers to define infrastructure-agnostic stacks that remain portable across diverse environments.

The project distinguishes itself
- [jwasham/coding-interview-university](https://awesome-repositories.com/repository/jwasham-coding-interview-university.md) (353,639 ⭐) — This project is a comprehensive educational roadmap designed to guide software engineers through the mastery of computer science fundamentals and technical interview preparation. It provides a structured, dependency-aware learning path that organizes complex computing concepts into a hierarchical curriculum, enabling users to build a professional engineering foundation through iterative study and practical implementation.

The curriculum distinguishes itself by integrating theoretical knowledge with professional development, offering a unified index of cross-referenced resources including book
- [yvettelau/step-by-step](https://awesome-repositories.com/repository/yvettelau-step-by-step.md) (700 ⭐) — 不积跬步无以至千里。
- [inngest/inngest](https://awesome-repositories.com/repository/inngest-inngest.md) (5,499 ⭐) — Inngest is a durable execution framework and event-driven automation engine designed to orchestrate background workflows. It enables developers to build resilient, stateful processes by memoizing function steps, ensuring that long-running tasks can automatically resume from the last successful operation after failures, timeouts, or infrastructure restarts.

The platform distinguishes itself through its event-driven architecture, which uses a schema-validated bus to trigger functions and coordinate complex, multi-step logic. It employs an onion-model middleware approach for cross-cutting concer
- [chainlit/chainlit](https://awesome-repositories.com/repository/chainlit-chainlit.md) (12,213 ⭐) — Chainlit is a Python framework designed for building and deploying interactive, stateful conversational AI interfaces. It provides a backend-driven platform that connects language models and agent frameworks to a web-based chat frontend, managing the complexities of session state, message history, and real-time communication.

The framework distinguishes itself by offering a component-based UI builder that allows developers to inject interactive widgets, rich media, and data visualizations directly into the chat stream. It supports the visualization of complex agent workflows, enabling users t
- [numba/numba](https://awesome-repositories.com/repository/numba-numba.md) (10,918 ⭐) — Numba is a just-in-time compiler that translates high-level Python functions into optimized machine code at runtime. By leveraging the LLVM compiler infrastructure, it provides a framework for accelerating numerical data processing and mathematical computations, enabling performance levels comparable to statically compiled languages.

The project distinguishes itself through its ability to perform type-inference-based specialization, which generates machine instructions tailored to the specific data types used during execution. It employs a lazy compilation pipeline that defers translation unt
- [buildthingsuseful/build-your-own-kafka](https://awesome-repositories.com/repository/buildthingsuseful-build-your-own-kafka.md) (65 ⭐) — Build Your Own Kafka
- [crystal-lang/crystal](https://awesome-repositories.com/repository/crystal-lang-crystal.md) (20,299 ⭐) — Crystal is a statically typed, compiled programming language designed for high performance and memory safety. It leverages an LLVM-based compiler to translate source code into optimized machine-executable binaries, while its type-inference-based static analysis enforces strict safety rules during the build process.

The language distinguishes itself through a fiber-based concurrent runtime that manages lightweight execution units for asynchronous input and output without blocking the main process. It also features a powerful compile-time macro system that allows for the inspection and transfor
- [expo/expo](https://awesome-repositories.com/repository/expo-expo.md) (50,111 ⭐) — Expo is a universal mobile framework designed to build native iOS and Android applications from a single codebase using web-standard technologies. It provides a comprehensive development environment that includes a unified runtime for testing, cloud-based infrastructure for compiling and signing native binaries, and automated tools for managing the entire mobile release lifecycle, including app store submission.

The framework distinguishes itself through a plugin-based native configuration engine that programmatically modifies project files, allowing developers to integrate native modules wit
- [creationix/step](https://awesome-repositories.com/repository/creationix-step.md) (2,194 ⭐) — An async control-flow library that makes stepping through logic easy.
- [chakra-ui/zag](https://awesome-repositories.com/repository/chakra-ui-zag.md) (5,121 ⭐) — Zag is a headless UI state machine library that provides unstyled component logic to ensure consistent behavior across different rendering engines. It functions as a toolkit for modeling complex user interface interactions using statecharts, decoupling the underlying logic and accessibility attributes from visual styling.

The library is framework-agnostic, utilizing specialized adapters to bridge its state machine logic with various JavaScript libraries such as React, Solid, Vue, and Svelte. This approach allows developers to maintain a single interaction model while applying any CSS or styli
- [webpack/webpack](https://awesome-repositories.com/repository/webpack-webpack.md) (65,834 ⭐) — Webpack is a module bundler that maps project dependencies into a directed acyclic graph to transform diverse file types into optimized, browser-ready assets. It functions as a build pipeline orchestrator, using entry points to recursively resolve imports and bundle modules, scripts, and static assets into a unified output.

The project is distinguished by its plugin-based architecture and loader-driven transformation pipeline. It utilizes an event-driven hook system that allows developers to intercept and modify the build process at specific lifecycle stages, enabling custom code transformati
- [peiyuanix/build-your-own-zerotier](https://awesome-repositories.com/repository/peiyuanix-build-your-own-zerotier.md) (603 ⭐) — Build your own layer-2 virtual switch in less than 300 lines of code
- [microsoft/typescript-go](https://awesome-repositories.com/repository/microsoft-typescript-go.md) (24,151 ⭐)
- [fastapi/fastapi](https://awesome-repositories.com/repository/fastapi-fastapi.md) (99,260 ⭐) — FastAPI is a web framework for building APIs with Python. It leverages standard language type hints to provide automatic data validation, request parsing, and interactive API documentation generation. The framework supports asynchronous request handling and manages execution contexts to prevent blocking the main event loop.

The project includes a dependency injection system that allows for the resolution and injection of reusable components into request handlers. This system supports request-scoped caching, lifecycle management, and integration with security mechanisms like OAuth2 and JSON We
- [ace-step/ace-step](https://awesome-repositories.com/repository/ace-step-ace-step.md) (4,088 ⭐) — ACE-Step is a high-fidelity audio synthesis system and diffusion model designed to generate music and vocals from text descriptions. It functions as a music generator and vocal synthesizer, using a diffusion transformer decoder to produce audio across various languages and genres.

The project provides tools for text-guided audio editing, including the ability to extend the duration of tracks, regenerate specific song segments, and perform latent-space audio inpainting to modify lyrics or styles. It also includes a framework for audio style fine-tuning using low-rank adaptation to adapt vocal
- [highlightjs/highlight.js](https://awesome-repositories.com/repository/highlightjs-highlight-js.md) (24,825 ⭐) — Highlight.js is a syntax highlighting library that automatically detects and applies color-coded styling to source code blocks within web pages. It functions as a language-agnostic formatting engine, utilizing a modular processor that applies consistent visual themes to diverse programming languages based on their specific grammatical rules. By decoupling the core parsing logic from language-specific definitions, the library provides a unified execution environment that operates without requiring internal knowledge of the target language.

The project is distinguished by its modular architectu
- [danistefanovic/build-your-own-x](https://awesome-repositories.com/repository/danistefanovic-build-your-own-x.md) (516,495 ⭐) — Master programming by recreating your favorite technologies from scratch.
- [jenkinsci/docker](https://awesome-repositories.com/repository/jenkinsci-docker.md) (7,530 ⭐) — This project is a containerized build automation system and self-hosted DevOps platform provided as a Docker image. It serves as a distributed build orchestrator and a Dockerized continuous integration and delivery server, ensuring consistent execution environments across different infrastructure.

The system distinguishes itself through a distributed execution model that separates a primary controller from multiple remote agents connected via SSH, TCP, or web sockets. It utilizes a modular extensibility framework that allows the core system functionality to be augmented through the installati
- [php/php-src](https://awesome-repositories.com/repository/php-php-src.md) (40,150 ⭐) — This project is the core source code for a general-purpose, server-side scripting language designed for web development. It provides a high-performance execution engine that parses and runs scripts to generate dynamic content, supported by a comprehensive standard library for data manipulation, networking, and system interaction. The repository serves as an open-source development platform where the language runtime and its interpreter are built, maintained, and evolved through community-driven governance.

The runtime is powered by a stack-based virtual machine that executes compiled bytecode
- [vitalets/playwright-magic-steps](https://awesome-repositories.com/repository/vitalets-playwright-magic-steps.md) (0 ⭐) — Auto-transform JavaScript comments into Playwright steps.
- [j3ssie/osmedeus](https://awesome-repositories.com/repository/j3ssie-osmedeus.md) (6,425 ⭐) — Osmedeus is a security workflow orchestration engine that coordinates AI agents, shell commands, and scanning tools through declarative YAML pipelines. It functions as a distributed security scanner, a declarative workflow automator, and an AI agent framework for security, enabling automated multi-step security analysis with conditional branching, parallel execution, and distributed workers.

The engine distinguishes itself through a hybrid runner model that executes workflow steps on the local host, inside Docker containers, or over SSH to remote machines, selected per step or module. It supp
- [fastapi/typer](https://awesome-repositories.com/repository/fastapi-typer.md) (19,632 ⭐) — This project is a Python framework for building command-line interfaces by converting standard functions into executable programs. It uses type hints to automatically infer and generate argument parsers, validation logic, and help documentation, allowing developers to define complex terminal applications through simple function signatures.

The framework distinguishes itself through a decorator-driven registration system that enables the construction of hierarchical command trees. It supports dependency injection to manage shared state and runtime configuration across subcommands, and it utili
- [eslint/eslint](https://awesome-repositories.com/repository/eslint-eslint.md) (27,349 ⭐) — This project is a static analysis engine designed to identify patterns, enforce coding standards, and automate code quality improvements in software projects. By parsing source code into structured abstract syntax trees, it enables deep programmatic inspection and the automated remediation of identified programming issues.

The engine functions as a pluggable linting framework, allowing developers to extend its core capabilities through a modular architecture. Users can inject custom rules, parsers, and processors to support non-standard file formats or domain-specific logic. This extensibilit
- [lukemathwalker/build-your-own-jira-with-rust](https://awesome-repositories.com/repository/lukemathwalker-build-your-own-jira-with-rust.md) (0 ⭐) — You will be working through a series of test-driven exercises, or koans, to learn Rust while building your own JIRA clone!
- [psf/black](https://awesome-repositories.com/repository/psf-black.md) (41,578 ⭐) — This project is an uncompromising, deterministic code formatter for Python. It functions by parsing source code into an abstract syntax tree and regenerating it according to a rigid, opinionated set of style rules. By automating the formatting process, it eliminates manual style debates and configuration overhead, ensuring that code remains consistent across entire projects regardless of the original input.

The tool distinguishes itself through its focus on speed and seamless integration into development workflows. It utilizes content-based file caching and parallel processing to maintain hig
- [thoughtworks/build-your-own-radar](https://awesome-repositories.com/repository/thoughtworks-build-your-own-radar.md) (2,549 ⭐) — This project is a technology radar visualization tool and dockerized static site generator. It transforms JSON or CSV datasets into an interactive technology map used to track the adoption status and maturity of tools and techniques across an organization.

The tool enables enterprise architecture mapping by organizing portfolios of technologies into categories and maturity levels. It supports custom technical taxonomies, allowing the definition of specialized rings and quadrants to match specific organizational evaluation criteria.

The system covers automated radar generation and technology
- [lemmynet/lemmy](https://awesome-repositories.com/repository/lemmynet-lemmy.md) (14,454 ⭐) — Lemmy is a self-hosted, federated discussion platform that enables the operation of independent, decentralized social networking servers. By implementing the ActivityPub protocol, it allows autonomous instances to exchange content, synchronize user interactions, and participate in a global, distributed network without centralized control.

The platform distinguishes itself through a decoupled architecture that separates the backend API from the frontend, facilitating the development of custom interfaces while maintaining unified user handles and cross-platform communication. It provides granul
- [dnspy/dnspy](https://awesome-repositories.com/repository/dnspy-dnspy.md) (28,993 ⭐) — dnSpy is a desktop application designed for the analysis, debugging, and modification of compiled .NET assemblies. It functions as an assembly analysis suite and decompiler, translating binary instruction streams back into readable source code to facilitate reverse engineering when original source files are unavailable.

The tool distinguishes itself through an integrated binary patching engine and metadata editor, which allow for the direct modification of executable logic and internal metadata tables. It supports in-process debugging instrumentation, enabling users to inject runtime hooks, s
- [a2aproject/a2a](https://awesome-repositories.com/repository/a2aproject-a2a.md) (24,404 ⭐) — A2A is a standardized framework designed to enable interoperability, discovery, and orchestration among independent artificial intelligence agents. It provides a common communication protocol that allows heterogeneous agents to exchange data, verify identities, and collaborate across diverse programming languages and computing environments. By establishing a unified messaging standard, the project facilitates the creation of complex, multi-agent workflows where tasks are routed and managed between specialized services.

The project distinguishes itself through a capability-based architecture t
- [ace-step/ace-step-1.5](https://awesome-repositories.com/repository/ace-step-ace-step-1-5.md) (6,002 ⭐) — ACE Step 1.5 is a local text-to-music generation and audio editing system that runs on consumer hardware. It transforms plain-language descriptions into full-length songs with lyrics, and can edit existing audio through cover generation, vocal removal, track separation, and selective repainting. The system supports multilingual prompts and lyrics in over 50 languages, and provides precise control over musical structure including duration, BPM, key, and time signature.

The project distinguishes itself through a dual-stream diffusion architecture that processes separate latent streams for vocal
- [fish-shell/fish-shell](https://awesome-repositories.com/repository/fish-shell-fish-shell.md) (33,687 ⭐) — This project is an interactive command-line shell designed to provide a user-friendly terminal environment for system interaction and task automation. It functions as both an interactive interface for developers and a scripting runtime, featuring a clean, consistent syntax that simplifies command execution and process management.

The shell distinguishes itself through a focus on discoverability and real-time feedback. It includes a predictive suggestion engine that offers command completions and history-based hints as you type, alongside a dedicated parser that provides immediate visual feedb
- [cucumber/cucumber-js](https://awesome-repositories.com/repository/cucumber-cucumber-js.md) (5,350 ⭐) — Cucumber-js is a behavior-driven development tool and acceptance test automation framework for JavaScript. It serves as a test automation engine that allows users to define system behavior using human-readable scenarios, mapping plain-text steps to executable code to verify that software conforms to business rules and acceptance criteria.

The project utilizes a Gherkin-based framework to create executable system documentation in natural language. It supports multi-language localization for writing scenarios in various spoken languages and provides tools for scenario parameterization and busin
- [tokenrove/build-your-own-shell](https://awesome-repositories.com/repository/tokenrove-build-your-own-shell.md) (496 ⭐) — Guidance for mollusks (WIP)
- [tarekalabd/flutter-steps-tracker](https://awesome-repositories.com/repository/tarekalabd-flutter-steps-tracker.md) (116 ⭐) — It's an open source project for the steps tracking (Pedometer) built with Flutter and integrated with Firebase as the initial backend service.
- [astral-sh/uv](https://awesome-repositories.com/repository/astral-sh-uv.md) (86,451 ⭐) — uv is a high-performance Python package manager and project build tool designed to handle dependency resolution, virtual environment orchestration, and Python interpreter management. It functions as a comprehensive workspace orchestrator, enabling developers to manage complex, multi-package repositories and ensure reproducible builds across different platforms.

The tool distinguishes itself through its use of a global, content-addressable cache and hard-link-based environment provisioning, which allow for near-instant environment creation and minimal disk usage. It employs a high-performance
- [jqlang/jq](https://awesome-repositories.com/repository/jqlang-jq.md) (34,901 ⭐) — This project is a command-line processor designed for the parsing, filtering, and transformation of structured data streams. It functions as a declarative programming environment that treats data as immutable streams, allowing users to perform complex structural modifications through the composition of small, reusable functions. By utilizing a recursive tree traversal engine, the system enables the navigation, inspection, and modification of deeply nested hierarchical data structures.

The engine distinguishes itself through a stream-oriented architecture that processes input records one by on
- [codecrafters-io/build-your-own-x](https://awesome-repositories.com/repository/codecrafters-io-build-your-own-x.md) (516,240 ⭐) — This project provides a comprehensive framework for creating, managing, and executing educational programming challenges. It includes standardized systems for authoring instructional content, defining test cases, and structuring documentation to ensure consistent learning outcomes. The platform supports a wide range of programming languages through dedicated execution environments that handle compilation, dependency management, and automated testing.

The infrastructure facilitates both local and remote development workflows, offering command-line utilities for testing code without requiring v
- [koalaman/shellcheck](https://awesome-repositories.com/repository/koalaman-shellcheck.md) (39,574 ⭐) — This project is a static analysis tool and linter designed to improve the quality, reliability, and portability of shell scripts. By performing deep structural analysis, it identifies common programming pitfalls, syntax errors, and security vulnerabilities before scripts are executed. It functions as an automated code reviewer that enforces best practices and helps developers maintain consistent, robust code across different operating environments.

The tool distinguishes itself through its dialect-aware grammar resolution, which adapts its parsing logic based on the specific shell interpreter
- [infobyte/faraday](https://awesome-repositories.com/repository/infobyte-faraday.md) (6,523 ⭐) — Faraday is a vulnerability management platform and security tool aggregator designed to centralize security findings from multiple scanners into a single dashboard. It utilizes a relational security database to catalog hosts, services, and security flaws, enabling users to track remediation and analyze organizational risk.

The platform distinguishes itself through a plugin-based system that normalizes diverse security tool outputs into a unified data model. It supports deep integration with a wide array of scanners and CLI tools, intercepting shell command output or parsing report files to ag
- [wilfred/difftastic](https://awesome-repositories.com/repository/wilfred-difftastic.md) (24,175 ⭐)
- [infaaa/build-your-own-x-vibe-coding](https://awesome-repositories.com/repository/infaaa-build-your-own-x-vibe-coding.md) (80 ⭐) — Master programming by recreating your favorite technologies from scratch with vibe coding.
- [alirezarezvani/claude-skills](https://awesome-repositories.com/repository/alirezarezvani-claude-skills.md) (18,240 ⭐) — This project is a framework for integrating modular instruction packages and domain-specific tools into large language model agents. It provides a system for managing agent context and extending coding assistants through a modular prompt library of persona-based instruction sets and skill trees.

The framework distinguishes itself through a persistent memory layer that tracks architectural decisions and infrastructure patterns to prevent regressions during autonomous code modifications. It includes an orchestrator for managing multi-agent swarms and autonomous coding loops that cycle through g
- [kestra-io/kestra](https://awesome-repositories.com/repository/kestra-io-kestra.md) (27,073 ⭐) — Kestra is a declarative workflow orchestrator designed to manage complex task dependencies and automated processes through versioned configuration files. It functions as a distributed platform that decouples task scheduling from execution by offloading computational workloads to a fleet of worker nodes. The system uses a reactive, event-driven engine to initiate workflows automatically in response to external signals, webhooks, schedules, or file system changes.

The platform distinguishes itself through a modular plugin architecture that allows for the integration of custom tasks and external
- [denysdovhan/wtfjs](https://awesome-repositories.com/repository/denysdovhan-wtfjs.md) (37,628 ⭐) — This project is an educational resource and technical reference archive focused on the core architecture and counter-intuitive behaviors of the JavaScript programming language. It provides a comprehensive collection of language edge cases, syntax anomalies, and runtime inconsistencies that challenge standard developer assumptions. By grounding these examples in the official ECMAScript specification, the repository serves as a guide for understanding the underlying mechanics of the language.

The project distinguishes itself by cataloging specific instances of type coercion, operator precedence
