12 repositorios
Methods for initializing components with local, static data sources.
Distinguishing note: Focuses on the loading of static data rather than dynamic fetching.
Explore 12 awesome GitHub repositories matching data & databases · Static Data Loading. Refine with filters or upvote what's useful.
This project is a centralized directory and resource manager for artificial intelligence-driven software engineering tools. It functions as a curated registry that aggregates extensions, automated workflows, and development resources, providing a structured database for developers to discover and implement AI-assisted coding solutions. The system distinguishes itself through a suite of automated maintenance utilities that ensure the integrity and currency of the curated data. It employs background processes to validate external links, synchronize remote repository information, and manage reso
Maintains a centralized collection of structured information as the single source of truth for curated resources.
Select2 is a searchable, modular UI framework designed to enhance standard HTML select elements. It transforms basic form controls into interactive, accessible dropdown interfaces that support multi-selection, tagging, and real-time filtering. By providing a robust set of tools for managing complex data inputs, it enables developers to create more responsive and user-friendly selection components. The project is distinguished by its adapter-based architecture, which allows for deep customization of rendering, data processing, and selection logic. Developers can extend core functionality throu
Allows populating dropdowns from local JavaScript arrays during initialization.
Qwerty-learner is a browser-based educational platform designed to improve typing proficiency while simultaneously facilitating English vocabulary acquisition. By combining keyboard muscle memory training with the repetitive input of curated word lists, the application provides a structured environment for students and professionals to master technical terminology and academic language. The platform distinguishes itself through a dedicated recall exercise mode, which triggers dictation-style sessions to strengthen long-term memory retention by requiring users to type words without visual prom
Loads vocabulary lists from local JSON files at runtime to allow for easy content expansion.
Zola is a static site generator that compiles Markdown and templates into a standalone website. It is distributed as a single binary, removing the need for external runtimes or package managers to build the final site. The project includes a built-in Sass compiler to transform styles into compressed CSS and a dedicated Markdown rendering engine that supports task lists and footnotes. It also features a client-side search indexer, enabling full-text site search without a backend server, and a multilingual content manager for organizing translated content. Additional capabilities cover asset o
Imports local CSV, TOML, JSON, or XML files into templates as structured data.
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Imports local CSV files into the database as tables to provide version-controlled reference data for transformation logic.
Axolotl is a configuration-driven framework designed for the fine-tuning, evaluation, and quantization of large language models. It functions as a comprehensive orchestrator for distributed training, enabling users to manage complex workflows across multi-node and multi-GPU environments. By utilizing structured configuration files, the platform streamlines the setup of training parameters, dataset paths, and hardware distribution strategies. The project distinguishes itself through its support for diverse training methodologies, including full-parameter tuning, parameter-efficient adaptation,
Imports training data from local files including JSON, CSV, Parquet, and Arrow formats.
Feast is an open-source feature store for machine learning that provides a central platform for defining, storing, and serving features across both training and inference workflows. It operates as a declarative system where feature definitions are written as code in Python files, synchronized to a central registry, and made available for low-latency online retrieval or point-in-time correct historical joins for training datasets. The project abstracts storage behind a pluggable architecture, allowing offline and online backends to be swapped without changing retrieval logic, and coordinates ma
Loads models and lookup tables once during server startup to reduce per-request overhead.
Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency access and high availability. The platform also serves as a distributed SQL query engine, allowing users to execute standard SQL statements against both in-memory datasets and external data sources. What distinguishes Hazelcast is its use of a distributed consensus subsystem to maintain strongly consis
Defers the loading of data from external stores until a minimum number of cluster members are active.
AR.js is a JavaScript library for building augmented reality experiences that run directly in the web browser. It provides the core capability to render digital content overlaid on the real world using either the A-Frame HTML component system or the three.js JavaScript library, supporting both marker-based and location-based AR approaches. The library enables tracking of predefined 2D images, fiducial markers, and image targets through the device camera, allowing virtual objects such as 3D models, videos, or images to be positioned relative to recognized visual references. For location-based
Imports place data from HTML, JavaScript, JSON files, or API calls for location-based AR.
iDataV is a big-screen data visualization dashboard builder that combines bar, line, pie, map, and number-flip charts into a single large-format display. It provides a canvas-based rendering engine for high-performance chart drawing, a CSS Grid layout engine for responsive dashboard arrangements, and flip-digit animations for numeric indicators. The project includes template-based project scaffolding for generating new dashboards from pre-built HTML, CSS, and JavaScript templates. The tool offers prebuilt dashboard demos for scenarios such as tourism, sales, stock markets, cloud monitoring, a
Loads demo data from inline JSON objects or external files without a backend server or API.
This project is a Python library that wraps official NBA endpoints to retrieve player, team, and game statistics as structured data. It serves as a programmatic interface for fetching professional basketball league records and real-time scoreboards via HTTP requests. The library integrates with Pandas to transform raw JSON responses from sports servers into DataFrames for statistical analysis and data science. It functions as a data retrieval utility for tracking league-wide performance trends and scouting professional basketball players. The tool covers a broad range of capabilities includi
Initializes data components using local static files to minimize network requests for common metadata.
This project provides a comprehensive Chinese language corpus designed to support the training and fine-tuning of large language models. It serves as a structured natural language processing resource, offering a collection of text data that includes dialogue, customer service interactions, and creative writing. The dataset is organized into distinct thematic categories, allowing for targeted model development across specific conversational and narrative contexts. By providing information in standardized, schema-agnostic text formats, the collection ensures portability across various machine l
Stores datasets in structured text files to facilitate efficient static data loading for large language model training.