NVIDIANVTabular

View on GitHub

0 stars0 forks0 views

NVTabular

Features

Big Data and Distributed Computing - Feature engineering for large-scale tabular data.

Open-source alternatives to NVTabular

Similar open-source projects, ranked by how many features they share with NVTabular.

dask/dask
dask/dask
13,746View on GitHub
Dask is a parallel computing framework and distributed task scheduler designed to scale Python data science workflows from single machines to large clusters. It functions as a cluster resource manager that orchestrates computational logic by representing tasks and their dependencies as directed acyclic graphs. This architecture allows the system to automate the distribution of workloads across available hardware while managing complex execution requirements. The project distinguishes itself through a lazy evaluation engine that defers data operations until they are explicitly requested, enabl
Pythondasknumpypandas
View on GitHub13,746
google/tensorstore
google/tensorstore
1,522View on GitHub
Library for reading and writing large multi-dimensional arrays.
C++
View on GitHub1,522
h2oai/h2o-3
h2oai/h2o-3
7,493View on GitHub
h2o-3 is a distributed machine learning platform and automated machine learning framework designed for training and deploying predictive models using distributed in-memory computing. It functions as a deep learning framework and a distributed model scoring engine, capable of operating as a Kubernetes ML cluster to process large datasets in parallel. The platform distinguishes itself through automated machine learning capabilities that automatically select the best algorithms and hyperparameters to optimize model performance. It provides specialized deep learning toolkits for tasks including i
Jupyter Notebookautomlbig-datadata-science
View on GitHub7,493
cupy/cupy
cupy/cupy
11,000View on GitHub
CuPy is a CUDA array computing library that implements a NumPy-compatible interface for executing array operations and numerical computing on NVIDIA GPUs. It serves as a GPU-accelerated numerical library and a CUDA-based SciPy implementation, offloading heavy calculations to graphics hardware to increase processing speed for scientific and engineering workloads. The library enables multi-framework tensor exchange, allowing data buffers to be shared between different deep learning frameworks using standardized memory layouts to avoid memory copies. It also supports custom GPU kernel integratio
Python
View on GitHub11,000

See all 9 alternatives to NVTabular

NVTabular

Features

Open-source alternatives to NVTabular

dask/dask

google/tensorstore

h2oai/h2o-3

cupy/cupy

Star history

Open-source alternatives to NVTabular

dask/dask

google/tensorstore

h2oai/h2o-3

cupy/cupy