Why is docling-project/docling a recommended Intermediate Representations GitHub Repositories repository?

Normalizes diverse input formats into a consistent internal data model to enable uniform processing across different sources.

Why is terrastruct/d2 a recommended Intermediate Representations GitHub Repositories repository?

Normalizes input scripts into a unified intermediate graph representation to facilitate consistent cross-format rendering.

Why is dbt-labs/dbt-core a recommended Intermediate Representations GitHub Repositories repository?

Separates complex, multi-step data processing into dedicated models to simplify final reporting tables.

Why is quay/clair a recommended Intermediate Representations GitHub Repositories repository?

Transforms raw package data into a standardized intermediate representation to correlate source-level packages with binary versions.

Why is flamegraph-rs/flamegraph a recommended Intermediate Representations GitHub Repositories repository?

Normalizes raw text output from various profiling tools into a consistent internal call stack representation.

Why is microsoft/poml a recommended Intermediate Representations GitHub Repositories repository?

Transforms semantic XML-like syntax into a structured internal tree for consistent processing across different models.

Why is kotlin/dokka a recommended Intermediate Representations GitHub Repositories repository?

Normalizes diverse source code structures into a unified model to facilitate consistent documentation output.

8 مستودعات

Awesome GitHub RepositoriesIntermediate Representations

Internal data models that normalize diverse input formats into a consistent structure for uniform processing.

Explore 8 awesome GitHub repositories matching data & databases · Intermediate Representations. Refine with filters or upvote what's useful.

اعثر على أفضل المستودعات باستخدام الذكاء الاصطناعي.سنبحث عن أفضل المستودعات المطابقة باستخدام الذكاء الاصطناعي.

docling-project/docling
docling-project/docling
61,674عرض على GitHub
Docling is a modular framework designed for document parsing, layout analysis, and structured data extraction. It transforms unstructured files and web content into a unified, hierarchical data model that preserves the spatial and semantic relationships between text, tables, images, and layout elements. By normalizing diverse input formats into a consistent internal representation, the library enables uniform processing across various document types. The project distinguishes itself through a schema-driven approach that maps document regions to strongly-typed objects, ensuring data accuracy t
Normalizes diverse input formats into a consistent internal data model to enable uniform processing across different sources.
Pythonaiconvertdocument-parser
عرض على GitHub61,674
terrastruct/d2
terrastruct/d2
23,083عرض على GitHub
This project is a diagram-as-code tool that transforms declarative text scripts into professional visual representations. It functions as a technical documentation generator, allowing users to define nodes, connections, and hierarchical relationships through a domain-specific modeling language that integrates directly into version-controlled developer workflows. The tool distinguishes itself through a highly modular architecture that decouples diagram definitions from spatial positioning. It features a pluggable layout engine that supports multiple arrangement algorithms, alongside a styling
Normalizes input scripts into a unified intermediate graph representation to facilitate consistent cross-format rendering.
Godeveloper-toolsdiagrammingdiagrams
عرض على GitHub23,083
alibaba/datax
alibaba/DataX
17,241عرض على GitHub
DataX is a distributed data integration framework and plugin-based ETL tool designed for synchronizing large datasets between heterogeneous sources and destinations. It functions as a JDBC data migration engine and offline synchronization tool, enabling the movement of data between relational databases, NoSQL stores, and object storage. The system utilizes a plugin-based connector architecture that decouples reader and writer logic, allowing it to map and transform data types across different storage engines using a standardized internal representation. This design supports heterogeneous data
Employs internal data models that normalize diverse input formats into a consistent structure for uniform processing across different storage engines.
Java
عرض على GitHub17,241
dbt-labs/dbt-core
dbt-labs/dbt-core
13,051عرض على GitHub
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Separates complex, multi-step data processing into dedicated models to simplify final reporting tables.
Rustanalyticsbusiness-intelligencedata-modeling
عرض على GitHub13,051
quay/clair
quay/clair
11,012عرض على GitHub
Clair is a container image vulnerability scanner and security analyzer. It performs static analysis of container images by matching package contents against vulnerability databases to identify security risks across different package formats and architectures. The project functions as both an image indexer and a vulnerability database manager. It processes container layers into intermediate representations to enable fast security lookups and synchronizes security metadata from multiple external sources to maintain a local registry. Capability areas include continuous security monitoring, whic
Transforms raw package data into a standardized intermediate representation to correlate source-level packages with binary versions.
Goclaircontainersdocker
عرض على GitHub11,012
flamegraph-rs/flamegraph
flamegraph-rs/flamegraph
5,960عرض على GitHub
هذا المشروع عبارة عن أداة تحليل مكتوبة بـ Rust تلتقط وتحول وتصور مكدسات استدعاء الدوال لتحديد اختناقات أداء النظام. تعمل كمغلف لمحلل أخذ العينات يحول بيانات التحليل الخام إلى رسوم بيانية تفاعلية للهب (flamegraphs)، وهي خرائط هرمية لاستهلاك الموارد. توفر الأداة تكاملاً متخصصاً مع نظام بناء Rust لتحليل الملفات الثنائية ومعايير الأداء. كما تسمح بتكوينات تحليل مخصصة، مما يتيح للمستخدمين تجاوز أدوات تحليل النظام الافتراضية أو تسجيل الأعلام للتحكم في كيفية جمع البيانات. تدعم الأداة مراقبة أداء التطبيقات وتحليل التنفيذ الثنائي. ويمكنها التقاط بيانات الأداء عن طريق الإرفاق بمعرف عملية نشط لتحليل تطبيق قيد التشغيل دون الحاجة إلى إعادة تشغيل.
Normalizes raw text output from various profiling tools into a consistent internal call stack representation.
Rustflamegraphsperfprofiling
عرض على GitHub5,960
microsoft/poml
microsoft/poml
4,853عرض على GitHub
Poml is a prompt management framework and templating engine designed for authoring, versioning, and rendering structured prompts for large language models. It uses a semantic markup language to organize prompts into reusable templates, combining them with dynamic context and data to generate formatted inputs. The system distinguishes itself by decoupling core prompt logic from final presentation through a stylesheet-based approach. It provides a dedicated JSON schema output generator to enforce strict, machine-parsable model responses and a configuration interface for managing function tool s
Transforms semantic XML-like syntax into a structured internal tree for consistent processing across different models.
TypeScriptllmmarkup-languageprompt
عرض على GitHub4,853
kotlin/dokka
Kotlin/dokka
3,780عرض على GitHub
Dokka هو محرك وثائق قابل للتوسيع مصمم لتوليد مواد مرجعية منظمة لواجهات برمجة التطبيقات (API) لمشاريع Kotlin. من خلال تحليل الكود المصدري والتعليقات، يعمل كمولد مواقع ثابتة يحول قواعد الكود إلى وثائق قابلة للقراءة. يندمج مباشرة في تدفقات عمل التطوير كإضافة لنظام البناء، مما يسمح بالإنشاء المؤتمت للمواد المرجعية أثناء عملية التجميع القياسية. يتميز المشروع بخط أنابيب معالجة معياري قائم على الإضافات يسمح للمطورين بتعديل سير عمل التوليد، وتخصيص تنسيقات المخرجات، وضبط التصميم المرئي. وهو يدعم هياكل المشاريع المعقدة من خلال تجميع الوثائق من مشاريع فرعية متعددة في مخرج واحد موحد. علاوة على ذلك، يتضمن محرك إسناد تبادلي يحل رموز الكود الداخلية ويربطها بوثائق المكتبات الخارجية، مما يضمن بقاء المواد المولدة متصلة وقابلة للتنقل. يوفر المحرك تحكماً واسعاً في المخرج النهائي، بما في ذلك القدرة على تصفية عناصر كود معينة، وتضمين أمثلة استخدام حية، والربط مباشرة بمستودعات الكود المصدري البعيدة. وهو يدعم تنسيقات مخرجات متعددة، مثل HTML، وMarkdown، وJavadoc، ويسمح بتخصيص عميق للقوالب والأصول لتلبية متطلبات العلامة التجارية أو الوثائق المحددة. تم تصميم الأداة ليتم استدعاؤها عبر واجهات سطر الأوامر أو من خلال تكاملات أدوات البناء الأصلية، مما يسهل استخدامها في خطوط أنابيب النشر المؤتمتة.
Normalizes diverse source code structures into a unified model to facilitate consistent documentation output.
Kotlinapi-referencedocumentationdokka
عرض على GitHub3,780

Awesome Intermediate Representations GitHub Repositories

docling-project/docling

terrastruct/d2

alibaba/DataX

dbt-labs/dbt-core

quay/clair

flamegraph-rs/flamegraph

microsoft/poml

Kotlin/dokka