Why is zama-ai/fhevm a recommended Distributed Computing Engines GitHub Repositories repository?

Provides an asynchronous computation service that offloads resource-intensive encrypted operations to maintain scalability.

Why is ornicar/lila a recommended Distributed Computing Engines GitHub Repositories repository?

Implements a distributed computing engine specialized for evaluating chess positions and calculating optimal moves in parallel.

Why is lichess-org/lila a recommended Distributed Computing Engines GitHub Repositories repository?

Offloads computationally intensive move evaluations to a cluster of specialized servers for real-time tactical insights.

Why is pingcap/tikv a recommended Distributed Computing Engines GitHub Repositories repository?

Implements a coprocessor for executing filtering and aggregation logic directly on storage nodes to minimize network latency.

Why is rare-technologies/gensim a recommended Distributed Computing Engines GitHub Repositories repository?

Functions as a distributed computing engine for processing and transforming massive text corpora.

Why is official-stockfish/stockfish a recommended Distributed Computing Engines GitHub Repositories repository?

Functions as a high-performance UCI-compliant engine that evaluates board positions and calculates optimal moves.

Why is oxnr/awesome-bigdata a recommended Distributed Computing Engines GitHub Repositories repository?

Indexes a wide range of distributed computing engines and frameworks for batch, stream, and interactive data processing.

Why is ydataai/ydata-profiling a recommended Distributed Computing Engines GitHub Repositories repository?

Scales data profiling tasks across distributed enterprise environments to handle massive datasets efficiently.

Why is openmined/pysyft a recommended Distributed Computing Engines GitHub Repositories repository?

Provides a remote computation engine that allows analysis jobs to run on private data sources without raw information leaving the host.

15 مستودعات

Awesome GitHub RepositoriesDistributed Computing Engines

Frameworks designed for processing and transforming massive datasets across distributed computing environments.

Explore 15 awesome GitHub repositories matching data & databases · Distributed Computing Engines. Refine with filters or upvote what's useful.

اعثر على أفضل المستودعات باستخدام الذكاء الاصطناعي.سنبحث عن أفضل المستودعات المطابقة باستخدام الذكاء الاصطناعي.

zama-ai/fhevm
zama-ai/fhevm
25,215عرض على GitHub
fhevm is a full-stack blockchain framework designed to integrate Fully Homomorphic Encryption into smart contracts. It provides a platform for developing confidential smart contracts that can process encrypted data and execute private on-chain computations without decrypting the underlying information. The framework utilizes a coprocessor system to offload resource-intensive encrypted operations to an asynchronous service, improving blockchain performance and scalability. It incorporates a secure key management service based on multi-party computation and a zero-knowledge proof verifier to en
Provides an asynchronous computation service that offloads resource-intensive encrypted operations to maintain scalability.
Rustblockchainfheprivacy
عرض على GitHub25,215
ornicar/lila
ornicar/lila
18,362عرض على GitHub
Lila is an open-source chess server and multiplayer platform designed for playing, analyzing, and streaming games. It functions as a comprehensive environment for hosting competitive play and managing player profiles. The platform integrates a distributed chess engine interface to evaluate complex positions and a collaborative analysis board that allows multiple users to study and coordinate insights in real time. It also includes an online tournament platform for organizing competitive events, simultaneous exhibitions, and structured player leagues. The system maintains a searchable game da
Implements a distributed computing engine specialized for evaluating chess positions and calculating optimal moves in parallel.
Scala
عرض على GitHub18,362
lichess-org/lila
lichess-org/lila
18,362عرض على GitHub
Lila is a comprehensive, open-source chess gaming platform designed for real-time multiplayer interaction, competitive tournament management, and deep strategic analysis. It provides a global environment where users can engage in live matches, participate in structured competitions, and access extensive archives of historical game data for research and study. The platform distinguishes itself through a highly scalable architecture that utilizes actor-model concurrency and event-sourced game states to ensure precise match reconstruction and fault tolerance. It integrates distributed engine eva
Offloads computationally intensive move evaluations to a cluster of specialized servers for real-time tactical insights.
Scalachessfree-softwarefunctional-programming
عرض على GitHub18,362
pingcap/tikv
pingcap/tikv
16,724عرض على GitHub
TiKV is a cloud-native distributed transactional key-value store and storage engine. It provides a distributed database designed for horizontal scalability and strong consistency across a cluster of physical nodes. The system uses a Raft-based consensus mechanism to maintain data availability and state synchronization. It ensures ACID compliance for distributed transactions through a two-phase commit workflow and manages data distribution via multi-Raft sharding. The engine handles massive datasets using automated range splitting and cluster load balancing to distribute data across different
Implements a coprocessor for executing filtering and aggregation logic directly on storage nodes to minimize network latency.
Rust
عرض على GitHub16,724
rare-technologies/gensim
RaRe-Technologies/gensim
16,442عرض على GitHub
Gensim is an unsupervised natural language processing toolkit designed for topic modeling, word embedding training, and the processing of large-scale text corpora. It provides a framework for discovering latent themes and semantic structures in text without the need for labeled data. The toolkit is distinguished by its ability to handle datasets that exceed system memory through iterator-based data streaming from disk. It also supports distributed model training, allowing complex modeling tasks to be executed across computer clusters. The library covers a broad range of analysis capabilities
Functions as a distributed computing engine for processing and transforming massive text corpora.
Python
عرض على GitHub16,442
official-stockfish/stockfish
official-stockfish/Stockfish
14,802عرض على GitHub
Stockfish is a high-performance chess engine designed to evaluate board positions and calculate optimal moves. It functions as a command-line tool that utilizes neural network-based search algorithms to assess complex game states and determine strategic advantages. The engine is fully compliant with the Universal Chess Interface, allowing it to exchange commands and move data with external graphical user interfaces and professional analysis software. The engine distinguishes itself through advanced computational strategies that maximize hardware efficiency and search depth. It employs multi-t
Functions as a high-performance UCI-compliant engine that evaluates board positions and calculates optimal moves.
C++chesschess-enginecpp
عرض على GitHub14,802
oxnr/awesome-bigdata
oxnr/awesome-bigdata
14,454عرض على GitHub
This project is a curated directory of software, frameworks, and educational resources designed for building, scaling, and maintaining distributed data processing and storage architectures. It serves as a comprehensive index for the distributed computing ecosystem, helping users identify the appropriate tools for managing large-scale information systems. The repository functions as a central hub for data engineering, offering categorized access to technologies that support batch and stream processing, machine learning, and interactive querying. By organizing these resources, it assists in the
Indexes a wide range of distributed computing engines and frameworks for batch, stream, and interactive data processing.
awesomeawesome-listbigdata
عرض على GitHub14,454
ydataai/ydata-profiling
ydataai/ydata-profiling
13,388عرض على GitHub
Ydata-profiling is an automated exploratory data analysis framework designed to generate comprehensive statistical reports and visual summaries from dataframes. It functions as a diagnostic tool for assessing data quality, identifying missing values, duplicates, and outliers, while providing a scalable engine for profiling massive datasets across distributed enterprise environments. The project distinguishes itself through its ability to handle large-scale data through distributed task orchestration and lazy stream processing, which minimizes memory overhead during complex computations. It in
Scales data profiling tasks across distributed enterprise environments to handle massive datasets efficiently.
Pythonbig-data-analyticsdata-analysisdata-exploration
عرض على GitHub13,388
modin-project/modin
modin-project/modin
10,389عرض على GitHub
Modin is a distributed dataframe library and parallel data processing engine designed to handle large datasets that exceed system memory. It functions as a distributed computing framework that parallelizes data manipulation tasks across multiple CPU cores or clusters to increase throughput and avoid memory errors. The project mirrors the Pandas API, allowing for the distribution of data workflows without changing core code logic. It utilizes a pluggable backend interface, which enables users to switch between different distributed execution engines to optimize performance based on available h
Provides a framework for processing and transforming massive datasets across distributed computing environments.
Pythonanalyticsdata-sciencedataframe
عرض على GitHub10,389
openmined/pysyft
OpenMined/PySyft
9,907عرض على GitHub
PySyft is a privacy-preserving machine learning framework and remote computation engine. It functions as a decentralized data analysis orchestrator that allows for the execution of data science workflows on remote servers without requiring the transfer of raw private data from the host device. The platform provides a secure collaboration environment where data owners manage permissions and authorize specific collaborators to run computations. It differentiates its workflow by utilizing mock data for local development and validation before submitting final analysis jobs to private remote serve
Provides a remote computation engine that allows analysis jobs to run on private data sources without raw information leaving the host.
Pythoncryptographydeep-learningfederated-learning
عرض على GitHub9,907
apache/seatunnel
apache/seatunnel
9,427عرض على GitHub
SeaTunnel is a distributed data integration engine designed to synchronize structured and unstructured data across diverse sources and sinks. It functions as a multi-engine execution framework that can run data integration tasks across different distributed computing backends to optimize workload performance. The project is distinguished by a visual data pipeline designer for configuring workflows without manual code and a specialized change data capture tool for streaming incremental database updates. It also includes an enrichment pipeline that integrates large language models and embedding
Functions as a framework that can execute data integration tasks across various distributed computing backends.
Javaapachebatchcdc
عرض على GitHub9,427
featuretools/featuretools
featuretools/featuretools
7,655عرض على GitHub
Featuretools is a Python data science library and automated feature engineering framework designed to create predictive features from multiple related datasets. It automates the data preparation and transformation steps required for machine learning models through deep feature synthesis. The library enables the automatic generation of comprehensive feature tables by applying recursive transformations to relational data. It supports the transformation of unstructured text into structured numeric features and allows users to define custom primitives to extend the synthesis process with specific
Offloads heavy feature computation to multiple cores or clusters using distributed computing engines.
Python
عرض على GitHub7,655
feast-dev/feast
feast-dev/feast
6,727عرض على GitHub
Feast is an open-source feature store for machine learning that provides a central platform for defining, storing, and serving features across both training and inference workflows. It operates as a declarative system where feature definitions are written as code in Python files, synchronized to a central registry, and made available for low-latency online retrieval or point-in-time correct historical joins for training datasets. The project abstracts storage behind a pluggable architecture, allowing offline and online backends to be swapped without changing retrieval logic, and coordinates ma
Computes and writes the latest batch feature values into an online store for low-latency serving.
Pythonbig-datadata-engineeringdata-quality
عرض على GitHub6,727
federatedai/fate
FederatedAI/FATE
6,048عرض على GitHub
FATE is an open-source federated learning platform that enables multiple organizations to collaboratively train machine learning models without exposing raw data to any party. It provides a complete framework for private data collaboration, allowing participants to jointly compute on sensitive information while maintaining data privacy and security guarantees through secure multi-party computation protocols. The platform distinguishes itself through its comprehensive infrastructure management capabilities, supporting automated deployment of multi-party clusters using Ansible-driven provisioni
Runs distributed data processing and computation across parties using a custom cluster manager.
Pythonalgorithmfatefederated-learning
عرض على GitHub6,048
microsoft/seal
microsoft/SEAL
3,985عرض على GitHub
SEAL هي مكتبة تشفير متماثل (Homomorphic Encryption) وإطار عمل تشفير بـ C++ يتيح إجراء عمليات رياضية على البيانات المشفرة دون الحاجة إلى فك التشفير. توفر مجموعة أدوات لإجراء عمليات الجمع والضرب على الأعداد الصحيحة والأرقام المركبة المشفرة لدعم الحوسبة التي تحافظ على الخصوصية. ينفذ إطار العمل مخططات BFV و CKKS، مما يسمح بكل من الحساب النمطي على الأعداد الصحيحة المشفرة والحساب التقريبي على أرقام الفاصلة العائمة ذات الدقة الثابتة. يتضمن أغلفة متخصصة لدمج سير عمل التشفير هذا في بيئات .NET ويدعم النشر عبر الأنظمة الأساسية لـ Android و iOS و WebAssembly. تدير المكتبة دقة الحوسبة والأمان من خلال تبديل معامل التشفير (Ciphertext moduli switching)، والتحكم في الضوضاء، والتحقق من معايير التشفير مقابل معايير الأمان. كما توفر أدوات لضغط النص المشفر وإنشاء هياكل تخزين مشفرة حيث لا يمكن لمزود الخدمة الوصول إلى مفاتيح فك التشفير.
Provides a framework where data stays encrypted throughout its lifecycle and service providers never access keys.
C++cryptographyencryptionhomomorphic-encryption
عرض على GitHub3,985

Awesome Distributed Computing Engines GitHub Repositories

zama-ai/fhevm

ornicar/lila

lichess-org/lila

pingcap/tikv

RaRe-Technologies/gensim

official-stockfish/Stockfish

oxnr/awesome-bigdata

ydataai/ydata-profiling

modin-project/modin

OpenMined/PySyft

apache/seatunnel

featuretools/featuretools

feast-dev/feast

FederatedAI/FATE

microsoft/SEAL

استكشف الوسوم الفرعية