21 个仓库
Custom logic implemented as functions that can be called within database queries.
Distinct from Rust User-Defined Functions: Existing candidates are too specific to Rust, WASM, or async I/O, whereas this is a general database capability.
Explore 21 awesome GitHub repositories matching data & databases · User-Defined Functions. Refine with filters or upvote what's useful.
Shell GPT is an AI-powered command-line interface that generates shell commands and source code from natural language prompts. It serves as a terminal-based tool for automating technical tasks, producing executable commands, and generating code snippets directly within the shell. The tool distinguishes itself through a read-eval-print loop for interactive chatting and the ability to maintain stateful conversational history via named sessions. It supports flexible backend routing, allowing users to connect to cloud-based APIs or local language model hosts for offline operation and data privacy
Allows the model to execute custom local functions to perform system tasks and analyze the results.
30 Seconds of Python is a curated collection of short, reusable Python code snippets designed for quick reference and immediate reuse. It provides an interactive browser that lets you explore snippets organized by tags, search across names and descriptions, and copy code to your clipboard with a single click. The collection covers a broad range of common programming tasks, including list and dictionary operations, string formatting and manipulation, date and time calculations, and color format conversion. It also includes utilities for data transformation, such as converting between case styl
Provides a snippet to sort a list of dictionaries by multiple keys using a tuple.
OfficeCLI 是一个无头(headless)办公套件和自动化工具,专为程序化读取、编辑和生成 Microsoft Office 文档而设计。它充当 OOXML 操作库和文档模板引擎,提供了一个独立的二进制文件,允许在无需本地安装办公软件的情况下管理 Word、Excel 和 PowerPoint 文件。 该项目通过将文档操作作为 AI 代理的工具(通过 JSON-RPC 服务器和模型上下文协议)公开而脱颖而出。它通过使用 XPath 进行原始 XML 操作实现了高级自定义,并提供了一个将文档子树转储为可重放 JSON 批处理的序列化系统。 该工具涵盖了广泛的功能,包括具有公式评估和数据透视表生成的程序化电子表格工程,以及全面的文字处理任务(如样式管理、修订跟踪和多语言文本格式化)。它还包括用于数据可视化、将内容提取为结构化 JSON 或高保真 HTML 的工具,以及将 JSON 数据合并到预定义模板中以进行自动化报告生成的实用程序。
Allows searching for spreadsheet cells using value matches, data types, formulas, and text-based selectors.
Rockstar is an esoteric programming language whose syntax is inspired by 1980s hard rock and heavy metal lyrics, making programs read like song lyrics. It uses poetic number literals, where the length of each word in a phrase is parsed as a decimal digit to initialize numeric variables, and pronoun-based variable references, where words like "it" and "they" resolve to the most recently assigned or compared variable at runtime. The language can be compiled to WebAssembly and run inside a web page for in-browser code execution. The language distinguishes itself through a lyric-driven parsing sy
Declares and invokes named functions using keywords like 'takes' and 'giving'.
This project is a streaming data integration framework that captures real-time database changes and synchronizes them with downstream systems. It operates as a distributed streaming ETL and database synchronizer, reading database logs and snapshots to propagate row-level modifications to target sinks. The system supports declarative data integration, allowing users to define source-to-sink data flows using SQL or YAML configurations. It distinguishes itself by automating schema evolution to maintain synchronization when source structures change and ensuring exactly-once delivery and processin
Integrates custom logic classes to perform specialized data transformations via programmable evaluation methods.
KurrentDB is an event-native database designed for event sourcing and event-driven architectures. It stores application state as an immutable, ordered sequence of events rather than updating rows in place, preserving full history for audit, replay, and distributed consistency. The database combines event storage with real-time streaming and a built-in JavaScript projection engine that transforms and aggregates event streams into materialized views. The system provides official gRPC client libraries for Python, Node.js, Java, .NET, Go, and Rust, enabling multi-language application development
Runs user-defined JavaScript functions to transform and aggregate event streams into materialized views.
KurrentDB is an event-native database designed for event sourcing and event-driven architectures. It stores events as immutable, ordered records in streams, preserving a complete audit trail and enabling temporal queries. The database uses gRPC for all client-server and inter-node communication, providing efficient binary serialization and bidirectional streaming, and supports atomic multi-stream writes that ensure consistency across multiple streams in a single transaction. The database distinguishes itself with a built-in JavaScript projection engine that transforms, filters, and aggregates
Ships a built-in JavaScript projection engine for real-time event stream transformation and materialized views.
seL4 is a formally verified microkernel whose C implementation is backed by machine-checked mathematical proofs of correctness, confidentiality, integrity, and availability. It enforces strict isolation between processes through hardware-enforced address space separation and a capability-based access control system, where each process holds explicit rights only to the resources it has been granted. The kernel exposes hardware resources through a minimal API of system calls that manage threads, address spaces, and inter-process communication, with synchronous IPC supporting sender-identifying b
Specifies a set of methods with typed parameters that components can provide or require for structured communication.
Inngest is a durable execution framework and event-driven automation engine designed to orchestrate background workflows. It enables developers to build resilient, stateful processes by memoizing function steps, ensuring that long-running tasks can automatically resume from the last successful operation after failures, timeouts, or infrastructure restarts. The platform distinguishes itself through its event-driven architecture, which uses a schema-validated bus to trigger functions and coordinate complex, multi-step logic. It employs an onion-model middleware approach for cross-cutting concer
Provides actions and interface components that allow end-users to configure and execute custom automation sequences within an application.
Danfo.js 是一个 JavaScript 数据分析和预处理库,提供高性能的标签化数据结构。它实现了数据帧(DataFrames)和序列(Series),以支持复杂的数据分析、统计计算和结构化表格数据的操作。 该项目作为一个机器学习预处理库,提供用于分类标签编码、独热编码(One-hot encoding)以及数值特征缩放和标准化的实用程序。它特别促进了将标签化数据结构转换为张量(Tensors)以进行模型训练和评估的过程。 该库涵盖了广泛的能力,包括描述性统计、合并和连接等关系操作以及时间序列处理。它包括用于数据清洗、过滤和分组的工具,以及用于直接从数据帧生成交互式图表和绘图的视觉化界面。 该系统支持通过 CSV、JSON 和 Excel 格式导入和导出数据。
Allows the application of user-defined functions across data axes to transform or analyze values.
pgrx is a framework for building and packaging custom PostgreSQL extensions using Rust. It serves as a foreign function interface bridge that maps Rust types and functions to the internal server programming interface of the database. The project provides a specialized toolchain for cross-compilation across different operating systems and target architectures using custom sysroots. It includes a schema generator that automatically translates high-level language structures into SQL definitions and binary protocol mappings. The framework manages memory safety by wrapping database memory context
Exposes Rust functions as PostgreSQL user-defined functions via declarative attribute macros.
H2 是一个用 Java 编写的 JDBC 兼容关系型数据库管理系统。它作为一个可嵌入的 SQL 数据库,可以直接在应用程序进程内运行以消除网络延迟,或者作为内存数据库用于高性能的易失性存储。它还包含一个基于 Web 的控制台,用于执行 SQL 命令和管理模式。 该系统的特点是其灵活的部署模式,包括用于远程 TCP/IP 访问的独立服务器模式,以及用于同时进行本地和远程连接的混合模式。它具有方言模拟层和兼容模式,允许其模仿其他数据库系统的行为和语法。 该引擎提供了一套广泛的功能,涵盖具有多版本并发控制(MVCC)的 ACID 事务、地理空间和 JSON 数据支持,以及高级分析窗口函数。它包括通过压缩备份、SQL 脚本恢复和堆外内存管理来处理大数据集的数据保护工具。 该数据库使用标准的 Java 数据库连接驱动程序和连接 URL 与应用程序集成。
Provides the ability to implement custom logic as functions that can be called within SQL queries.
Apache AGE is a graph database extension for PostgreSQL that adds openCypher graph query capabilities directly within the relational database environment. It functions as a loadable extension that translates Cypher graph traversal queries into SQL expressions, enabling users to run pattern matching and path analysis alongside standard SQL operations within a single database instance. The extension stores labeled, directed property graphs as isolated schemas with internal relational tables for vertices, edges, and labels, preventing cross-graph interference. It supports hybrid query execution
Registers user-defined PL/pgSQL functions as custom Cypher query functions for extensibility.
Rusqlite is an embedded database interface and relational database driver that provides a client library for interacting with SQLite. It functions as an SQL query wrapper, enabling the management of local file-based or in-memory databases through a safe interface. The library allows for the extension of native database capabilities by implementing custom scalar functions, collations, and virtual tables. It also supports the embedding of the database engine directly into the application binary to remove external library dependencies. The project covers a broad range of capabilities including
Supports the creation of user-defined functions to implement custom logic within SQL queries.
govaluate 是一个 Go 表达式求值器和运行时逻辑解析器。它作为一个动态规则引擎,将基于字符串的表达式转换为可执行操作,从而允许在运行时计算数学和逻辑表达式。 该库支持在不重新编译源代码的情况下执行业务规则和自定义公式。它支持通过注入外部参数和执行用户定义的自定义函数来扩展求值行为。 该系统提供了动态数据验证功能,包括正则表达式匹配和集合成员资格验证。它还使用空合并逻辑处理缺失结果,并通过类型无关的处理方式处理值。
Allows the registration of custom Go functions to extend the native operator set of expressions.
This is a Python library providing sorted list, set, and dictionary data structures that maintain their order automatically during insertions and deletions. The library provides a sorted list for fast random access and logarithmic lookups, a sorted set for unique elements and set-theoretic operations, and a sorted dictionary for managing key-value pairs where keys remain sorted. These collections support custom sorting logic through user-defined key functions to determine the order of elements. Core capabilities include positional indexing, range queries, and the use of bisection methods to
Supports custom sorting logic through user-defined key functions to determine element order.
Langroid is a multi-agent orchestration framework and tool integration suite designed for building complex AI applications. It serves as a multi-modal integration layer that connects diverse local and remote language models with an agentic retrieval-augmented generation system. The project distinguishes itself through a collaborative message-exchange paradigm, allowing specialized agents to delegate tasks hierarchically and coordinate via structured communication. It features an advanced state management system for conversational AI, including the ability to rewind and prune conversation hist
Implements Pydantic-based interfaces that agents use to execute tasks and validate JSON output.
USearch is a high-performance vector similarity search engine and approximate nearest neighbor index designed for dense embeddings. It functions as a low-level vector database core and high-dimensional vector indexer, providing the primitives necessary to store and retrieve vectors across massive datasets. The engine distinguishes itself through hardware-level SIMD acceleration for distance kernels and a proximity-graph indexing system that enables fast retrieval across billions of vectors. It supports multi-precision vector quantization to balance memory usage and accuracy, and utilizes memo
Allows the integration of custom compiled functions or assembly code to implement specialized distance calculations.
xlwings - Make Excel fly with Python!
Defines custom Excel worksheet functions written in Python that calculate values in cells.
Gravitino is a federated metadata lake and unified data catalog designed to manage tables, files, and AI models across diverse data sources and cloud storage. It serves as a centralized interface for governing schemas, access controls, and tagging across relational databases, messaging queues, and object stores. The project distinguishes itself by unifying the management of AI assets, such as machine learning models and their version lineages, alongside traditional tabular data. It also implements the Iceberg REST specification to provide a standardized metadata server and proxy for lakehouse
Supports the creation and deletion of scalar and table-valued user-defined functions.