21 Repos
Custom logic implemented as functions that can be called within database queries.
Distinct from Rust User-Defined Functions: Existing candidates are too specific to Rust, WASM, or async I/O, whereas this is a general database capability.
Explore 21 awesome GitHub repositories matching data & databases · User-Defined Functions. Refine with filters or upvote what's useful.
Shell GPT is an AI-powered command-line interface that generates shell commands and source code from natural language prompts. It serves as a terminal-based tool for automating technical tasks, producing executable commands, and generating code snippets directly within the shell. The tool distinguishes itself through a read-eval-print loop for interactive chatting and the ability to maintain stateful conversational history via named sessions. It supports flexible backend routing, allowing users to connect to cloud-based APIs or local language model hosts for offline operation and data privacy
Allows the model to execute custom local functions to perform system tasks and analyze the results.
30 Seconds of Python is a curated collection of short, reusable Python code snippets designed for quick reference and immediate reuse. It provides an interactive browser that lets you explore snippets organized by tags, search across names and descriptions, and copy code to your clipboard with a single click. The collection covers a broad range of common programming tasks, including list and dictionary operations, string formatting and manipulation, date and time calculations, and color format conversion. It also includes utilities for data transformation, such as converting between case styl
Provides a snippet to sort a list of dictionaries by multiple keys using a tuple.
OfficeCLI ist eine Headless-Office-Suite und ein Automatisierungswerkzeug, das darauf ausgelegt ist, Microsoft Office-Dokumente programmatisch zu lesen, zu bearbeiten und zu generieren. Es fungiert als OOXML-Manipulationsbibliothek und Dokument-Templating-Engine und bietet eine eigenständige Binärdatei, die die Verwaltung von Word-, Excel- und PowerPoint-Dateien ermöglicht, ohne dass eine lokale Installation von Office-Software erforderlich ist. Das Projekt zeichnet sich dadurch aus, dass es Dokumentoperationen als Werkzeuge für KI-Agenten über einen JSON-RPC-Server und das Model Context Protocol offenlegt. Es ermöglicht fortgeschrittene Anpassungen durch rohe XML-Manipulation mittels XPath und bietet ein Serialisierungssystem, das Dokument-Subtrees in wiederholbare JSON-Batches ausgibt. Das Tool deckt ein breites Spektrum an Funktionen ab, einschließlich programmatischem Tabellenkalkulations-Engineering mit Formelauswertung und Pivot-Tabellen-Generierung sowie umfassenden Textverarbeitungsaufgaben wie Stilverwaltung, Revisionsverfolgung und mehrsprachiger Textformatierung. Es enthält zudem Utilities für Datenvisualisierung, Inhaltsextraktion in strukturiertes JSON oder High-Fidelity-HTML sowie das Zusammenführen von JSON-Daten in vordefinierte Templates für die automatisierte Berichterstellung.
Allows searching for spreadsheet cells using value matches, data types, formulas, and text-based selectors.
Rockstar is an esoteric programming language whose syntax is inspired by 1980s hard rock and heavy metal lyrics, making programs read like song lyrics. It uses poetic number literals, where the length of each word in a phrase is parsed as a decimal digit to initialize numeric variables, and pronoun-based variable references, where words like "it" and "they" resolve to the most recently assigned or compared variable at runtime. The language can be compiled to WebAssembly and run inside a web page for in-browser code execution. The language distinguishes itself through a lyric-driven parsing sy
Declares and invokes named functions using keywords like 'takes' and 'giving'.
This project is a streaming data integration framework that captures real-time database changes and synchronizes them with downstream systems. It operates as a distributed streaming ETL and database synchronizer, reading database logs and snapshots to propagate row-level modifications to target sinks. The system supports declarative data integration, allowing users to define source-to-sink data flows using SQL or YAML configurations. It distinguishes itself by automating schema evolution to maintain synchronization when source structures change and ensuring exactly-once delivery and processin
Integrates custom logic classes to perform specialized data transformations via programmable evaluation methods.
KurrentDB is an event-native database designed for event sourcing and event-driven architectures. It stores application state as an immutable, ordered sequence of events rather than updating rows in place, preserving full history for audit, replay, and distributed consistency. The database combines event storage with real-time streaming and a built-in JavaScript projection engine that transforms and aggregates event streams into materialized views. The system provides official gRPC client libraries for Python, Node.js, Java, .NET, Go, and Rust, enabling multi-language application development
Runs user-defined JavaScript functions to transform and aggregate event streams into materialized views.
KurrentDB is an event-native database designed for event sourcing and event-driven architectures. It stores events as immutable, ordered records in streams, preserving a complete audit trail and enabling temporal queries. The database uses gRPC for all client-server and inter-node communication, providing efficient binary serialization and bidirectional streaming, and supports atomic multi-stream writes that ensure consistency across multiple streams in a single transaction. The database distinguishes itself with a built-in JavaScript projection engine that transforms, filters, and aggregates
Ships a built-in JavaScript projection engine for real-time event stream transformation and materialized views.
seL4 is a formally verified microkernel whose C implementation is backed by machine-checked mathematical proofs of correctness, confidentiality, integrity, and availability. It enforces strict isolation between processes through hardware-enforced address space separation and a capability-based access control system, where each process holds explicit rights only to the resources it has been granted. The kernel exposes hardware resources through a minimal API of system calls that manage threads, address spaces, and inter-process communication, with synchronous IPC supporting sender-identifying b
Specifies a set of methods with typed parameters that components can provide or require for structured communication.
Inngest is a durable execution framework and event-driven automation engine designed to orchestrate background workflows. It enables developers to build resilient, stateful processes by memoizing function steps, ensuring that long-running tasks can automatically resume from the last successful operation after failures, timeouts, or infrastructure restarts. The platform distinguishes itself through its event-driven architecture, which uses a schema-validated bus to trigger functions and coordinate complex, multi-step logic. It employs an onion-model middleware approach for cross-cutting concer
Provides actions and interface components that allow end-users to configure and execute custom automation sequences within an application.
Danfo.js ist eine Bibliothek für Datenanalyse und Vorverarbeitung für JavaScript, die leistungsstarke gelabelte Datenstrukturen bereitstellt. Sie implementiert Dataframes und Series, um komplexe Datenanalysen, statistische Berechnungen und die Manipulation strukturierter tabellarischer Daten zu ermöglichen. Das Projekt dient als Bibliothek für die Vorverarbeitung beim maschinellen Lernen und bietet Dienstprogramme für kategoriales Label-Encoding, One-Hot-Encoding sowie die Skalierung und Standardisierung numerischer Features. Es erleichtert insbesondere die Konvertierung gelabelter Datenstrukturen in Tensoren für das Modelltraining und die Evaluierung. Die Bibliothek deckt eine breite Palette an Funktionen ab, einschließlich deskriptiver Statistik, relationaler Operationen wie Merging und Joining sowie Zeitreihenverarbeitung. Sie enthält Tools für die Datenbereinigung, Filterung und Gruppierung sowie eine Visualisierungsschnittstelle zur Erstellung interaktiver Diagramme und Plots direkt aus Dataframes. Das System unterstützt den Import und Export von Daten über CSV-, JSON- und Excel-Formate.
Allows the application of user-defined functions across data axes to transform or analyze values.
pgrx is a framework for building and packaging custom PostgreSQL extensions using Rust. It serves as a foreign function interface bridge that maps Rust types and functions to the internal server programming interface of the database. The project provides a specialized toolchain for cross-compilation across different operating systems and target architectures using custom sysroots. It includes a schema generator that automatically translates high-level language structures into SQL definitions and binary protocol mappings. The framework manages memory safety by wrapping database memory context
Exposes Rust functions as PostgreSQL user-defined functions via declarative attribute macros.
H2 ist ein JDBC-konformes relationales Datenbankmanagementsystem, das in Java geschrieben ist. Es fungiert als einbettbare SQL-Datenbank, die direkt innerhalb eines Anwendungsprozesses ausgeführt werden kann, um Netzwerklatenz zu eliminieren, oder als In-Memory-Datenbank für performante, flüchtige Speicherung. Es enthält zudem eine webbasierte Konsole zur Ausführung von SQL-Befehlen und zur Verwaltung von Schemata. Das System zeichnet sich durch flexible Bereitstellungsmodi aus, einschließlich eines Standalone-Server-Modus für Remote-TCP/IP-Zugriffe und eines gemischten Modus für gleichzeitige lokale und Remote-Konnektivität. Es verfügt über eine Dialekt-Emulationsschicht und Kompatibilitätsmodi, die es ermöglichen, das Verhalten und die Syntax anderer Datenbanksysteme nachzuahmen. Die Engine bietet ein breites Spektrum an Funktionen, darunter ACID-Transaktionen mit Multi-Version Concurrency Control (MVCC), Unterstützung für Geodaten und JSON sowie fortgeschrittene analytische Fensterfunktionen. Es enthält Tools zur Datensicherung durch komprimierte Backups, SQL-Skript-Wiederherstellung und Off-Heap-Speicherverwaltung für große Datensätze. Die Datenbank lässt sich über Standard-JDBC-Treiber und Verbindungs-URLs in Anwendungen integrieren.
Provides the ability to implement custom logic as functions that can be called within SQL queries.
Apache AGE is a graph database extension for PostgreSQL that adds openCypher graph query capabilities directly within the relational database environment. It functions as a loadable extension that translates Cypher graph traversal queries into SQL expressions, enabling users to run pattern matching and path analysis alongside standard SQL operations within a single database instance. The extension stores labeled, directed property graphs as isolated schemas with internal relational tables for vertices, edges, and labels, preventing cross-graph interference. It supports hybrid query execution
Registers user-defined PL/pgSQL functions as custom Cypher query functions for extensibility.
Rusqlite is an embedded database interface and relational database driver that provides a client library for interacting with SQLite. It functions as an SQL query wrapper, enabling the management of local file-based or in-memory databases through a safe interface. The library allows for the extension of native database capabilities by implementing custom scalar functions, collations, and virtual tables. It also supports the embedding of the database engine directly into the application binary to remove external library dependencies. The project covers a broad range of capabilities including
Supports the creation of user-defined functions to implement custom logic within SQL queries.
govaluate ist ein Go-Ausdrucksauswerter und Laufzeit-Logik-Parser. Er fungiert als dynamische Regel-Engine, die stringbasierte Ausdrücke in ausführbare Operationen umwandelt und so die Berechnung mathematischer und logischer Ausdrücke zur Laufzeit ermöglicht. Die Bibliothek erlaubt die Ausführung von Geschäftsregeln und benutzerdefinierten Formeln, ohne den Quellcode neu kompilieren zu müssen. Sie unterstützt die Erweiterung des Auswertungsverhaltens durch das Injizieren externer Parameter und die Ausführung benutzerdefinierter Funktionen. Das System bietet Funktionen für die dynamische Datenvalidierung, einschließlich Regex-Matching und der Überprüfung von Sammlungsmitgliedschaften. Zudem verarbeitet es fehlende Ergebnisse mittels Null-Coalescing-Logik und bietet eine typagnostische Wertverarbeitung.
Allows the registration of custom Go functions to extend the native operator set of expressions.
This is a Python library providing sorted list, set, and dictionary data structures that maintain their order automatically during insertions and deletions. The library provides a sorted list for fast random access and logarithmic lookups, a sorted set for unique elements and set-theoretic operations, and a sorted dictionary for managing key-value pairs where keys remain sorted. These collections support custom sorting logic through user-defined key functions to determine the order of elements. Core capabilities include positional indexing, range queries, and the use of bisection methods to
Supports custom sorting logic through user-defined key functions to determine element order.
Langroid is a multi-agent orchestration framework and tool integration suite designed for building complex AI applications. It serves as a multi-modal integration layer that connects diverse local and remote language models with an agentic retrieval-augmented generation system. The project distinguishes itself through a collaborative message-exchange paradigm, allowing specialized agents to delegate tasks hierarchically and coordinate via structured communication. It features an advanced state management system for conversational AI, including the ability to rewind and prune conversation hist
Implements Pydantic-based interfaces that agents use to execute tasks and validate JSON output.
USearch is a high-performance vector similarity search engine and approximate nearest neighbor index designed for dense embeddings. It functions as a low-level vector database core and high-dimensional vector indexer, providing the primitives necessary to store and retrieve vectors across massive datasets. The engine distinguishes itself through hardware-level SIMD acceleration for distance kernels and a proximity-graph indexing system that enables fast retrieval across billions of vectors. It supports multi-precision vector quantization to balance memory usage and accuracy, and utilizes memo
Allows the integration of custom compiled functions or assembly code to implement specialized distance calculations.
xlwings - Make Excel fly with Python!
Defines custom Excel worksheet functions written in Python that calculate values in cells.
Gravitino is a federated metadata lake and unified data catalog designed to manage tables, files, and AI models across diverse data sources and cloud storage. It serves as a centralized interface for governing schemas, access controls, and tagging across relational databases, messaging queues, and object stores. The project distinguishes itself by unifying the management of AI assets, such as machine learning models and their version lineages, alongside traditional tabular data. It also implements the Iceberg REST specification to provide a standardized metadata server and proxy for lakehouse
Supports the creation and deletion of scalar and table-valued user-defined functions.