8 repository-uri
Command-line utilities for browsing and verifying records within a database.
Distinct from Remote Data Inspection: Distinct from remote data inspection by providing a full CLI interface for record browsing rather than simple previews.
Explore 8 awesome GitHub repositories matching data & databases · CLI Data Inspection. Refine with filters or upvote what's useful.
ParlAI is a conversational AI research framework designed for training, evaluating, and sharing dialogue models using a unified interface for datasets and agents. It functions as a PyTorch-based training platform and a dialogue data collection system, providing a centralized model zoo for the distribution of versioned pretrained agents. The project distinguishes itself through a knowledge-grounded retrieval system that combines dense and sparse indexing to ground responses in external information. It also provides a comprehensive infrastructure for gathering human-AI interaction data via inte
Allows inspection of sample episodes and input-output pairs from datasets via the command line.
AI Town is a TypeScript-based simulation engine used to create virtual environments where autonomous characters interact and socialize. It functions as a framework for orchestrating multiple AI agents within a persistent digital world, utilizing language models and a game engine to drive character behavior and social interactions. The project differentiates itself through a dedicated agent sandbox and a vector database agent store, which allow for the management of agent memories and world state. It integrates generative AI for background music and provides tools for simulation world design,
Displays database table records and system storage directly within the command line.
Deep Searcher is an open-source retrieval-augmented generation engine that indexes private documents into a vector database and uses large language models to answer complex questions with cited reasoning. It functions as both a command-line interface and a web API research tool, enabling users to load data and generate comprehensive reports by combining indexed private information with LLM-powered analysis. The system distinguishes itself through a plugin-based provider architecture that supports multiple embedding models, LLM providers, vector databases, and file loaders as interchangeable c
Loads documents into a vector database by running commands in a terminal.
jid is an interactive JSON query tool and data explorer for the terminal. It provides a keyboard-driven interface to navigate, evaluate, and extract information from nested JSON structures using dot-notation paths and real-time autocomplete. The tool integrates JMESPath to support complex data filtering and transformation. It distinguishes itself through a type-aware assistance system that provides context-aware function hints and argument templates based on the data type of the selected JSON node. The capability surface includes a terminal user interface with syntax highlighting, predictive
Provides a command-line utility for browsing and verifying records within JSON files with syntax highlighting.
This project is a Chinese localization repository and technical translation project designed to make concise programming projects and technical documentation accessible to Chinese speakers. It provides a collection of translated resources and curated mappings of computer science terminology to ensure consistent translation of technical concepts. The project implements a software localization workflow that converts English-language technical guides and codebase documentation into Chinese. This process utilizes a technical glossary resource and a resource-driven localization model to maintain t
Provides a command-line interface for browsing and verifying records within the database.
ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardware acceleration and on-device large language model inference capabilities. The project distinguishes itself through a hardware accelerator delegate system that partitions model subgraphs and offloads computation to specialized backends including NPUs, GPUs, and DSPs from Apple, Arm, Intel, MediaTek,
ExecuTorch prints the runtime event table from a terminal command without writing Python code, accepting an optional record for operator correlation.
dsq is a command-line utility that enables SQL-based analysis of local files by treating them as relational database tables. It allows users to execute standard SQL queries against heterogeneous data formats, including JSON, CSV, Excel, and Parquet, without requiring a formal database import process. The tool distinguishes itself by providing a persistent interactive shell for iterative data exploration and schema inspection. It supports complex operations such as joining data across multiple disparate files and converting between structured formats by applying SQL transformations to the inpu
Displays the inferred structure and data types of a file to help users understand its contents without executing a full query.
OpenViking is a multi-tenant context server and knowledge base administration system designed to provide AI agents with persistent long-term memory. It enables the indexing of diverse documents and codebases to support retrieval-augmented generation, allowing agents to recall past interactions, user preferences, and learned experiences across sessions. The project is distinguished by its use of a URI-based virtual filesystem to organize memories, resources, and skills. It implements a tiered context loading system that balances retrieval precision with token budgets by structuring data into a
Includes a terminal-based file explorer to verify resource existence and browse vector record counts.