7 个仓库
Reading and writing comma-separated values files using custom delimiters and escaping rules.
Distinct from File Read and Write Operations: None of the candidates cover standard CSV parsing and serialization; they focus on raw binary files or sharding.
Explore 7 awesome GitHub repositories matching data & databases · CSV File Processing. Refine with filters or upvote what's useful.
LearnPython is a programming tutorial consisting of a collection of practical code examples used to demonstrate Python language features and programming patterns. It serves as a comprehensive learning resource that implements core language concepts through functional code. The project provides specialized guides and samples covering several key domains. These include asynchronous network programming with event loops and coroutines, data visualization using numerical datasets for 2D and 3D plots, and web scraping for fetching content and automating login flows. It also features instructions on
Implements reading and writing of CSV files using custom delimiters and escaping rules.
Lancet 是 Go 标准库的全面扩展,提供了一系列可复用的函数和数据结构,旨在减少应用中的样板代码。它是一个跨多个领域的通用工具包,包括并发、安全、网络和函数式逻辑。 该项目的特色在于针对 Go 并发的专用工具包(如键控锁和基于通道的流处理),以及支持柯里化和函数组合的专用函数式编程套件。它还包含一个使用 AES、RSA 和 SM 标准实现对称和非对称加密的专用密码学库。 该库涵盖了广泛的功能领域,包括 LRU 缓存和二叉搜索树等高级数据结构、代数和统计数学计算,以及用于进程和文件管理的操作系统集成。它还提供了用于 HTTP 网络、日期和时间操作,以及集合代数和惰性流求值等高级数据处理的实用程序。
Reads and writes tabular data between CSV files and slices or maps.
CsvHelper is a library for reading and writing comma-separated value files by mapping data to custom class objects. It functions as a parsing library and data mapper that converts flat-file text into structured data objects and serializes internal data sets back into standard CSV files. The project emphasizes memory efficiency through a parser that optimizes resource consumption. It utilizes field value caching and an interned string cache to store repetitive values, which reduces memory overhead when processing large datasets. The library provides a configuration-driven parsing engine that
Converts comma-separated value files into custom class objects or raw data for application use.
这是一个面向 .NET 生态系统的科学计算框架,提供了一套全面的数值分析、统计和数学优化库。它作为开发机器学习、数字信号处理和计算机视觉应用的基础工具包。 该框架提供了用于训练和部署预测模型的专用工具包,包括神经网络、支持向量机和决策树。它还通过对实时视觉分析(如对象跟踪和面部特征检测)的深度集成,以及用于捕获和过滤音频及传感器信号的专用数字信号处理库而脱颖而出。 其功能范围扩展到高级矩阵分解和线性代数、概率状态建模和启发式搜索算法。它还涵盖了广泛的数据操作实用程序,从降维和归一化到空间数据组织和科学可视化组件。 该系统包括用于摄像机配置、GPIO 端口管理和专用深度传感硬件的硬件集成控制器。
Parses comma-separated values from files using a high-performance, forward-only reader.
ExcelDataReader 是一个 C# 库,用于从 Microsoft Excel 电子表格和 CSV 文件中提取数据和元数据。它作为一个工作簿解析器,将电子表格内容转换为结构化数据集,以便进行程序化访问和迭代。 该项目包含一个专门的元数据提取器,用于检索单元格级别的详细信息,例如数字格式、样式、行高、列宽和合并单元格范围。它还提供了一个流处理器,用于解析具有可自定义编码和分隔符检测功能的纯文本 CSV 文件。 该库支持现代电子表格文件的 OpenXML 标准,并利用基于流的解析和基于游标的行迭代来遍历工作簿。这些功能允许将多工作表工作簿转换为关系数据表。
Processes plain text streams using comma separated values with customizable encoding and separator detection.
这是一个 Unix 点文件(dotfiles)和脚本合集,旨在个性化和标准化 Shell 及操作系统环境。它为 Bash 和 Fish Shell 提供可移植配置,以及为 Vim 编辑器提供自定义偏好和命令映射。 该合集包括自动化引导脚本,通过安装应用程序、管理包管理器和创建配置符号链接来初始化新机器。它具有专门的 Git 工作流自动化功能以加速版本控制任务,以及使用基于硬件的身份验证和连接持久性的安全 SSH 连接管理。 该工具的覆盖范围扩展到广泛的系统管理工具,包括交互式进程终止、基于学习的目录导航和二进制路径审计。它还涵盖了用于浏览器自动化、本地静态文件托管以及视频转码和视觉图像差异对比等媒体处理任务的开发者生产力助手。
Provides a terminal utility to format comma-separated values into aligned columns for easier reading.
Csvlens is a command-line utility designed for the inspection and navigation of large structured tabular files. It provides a terminal-based interface that allows users to view, sort, and filter datasets directly within the command line without the memory overhead associated with loading entire files into spreadsheet applications. The tool utilizes memory-mapped file access and lazy-loading data streams to maintain a constant memory footprint, regardless of the total size of the source document. By identifying field boundaries during an initial scan, it enables efficient column-specific opera
Inspects and navigates large comma-separated files using memory-mapped access and lazy-loading for high performance.