Why is ibis-project/ibis a recommended Dataframe Format Conversion GitHub Repositories repository?

Implements a bridge to convert query execution results between SQL backends and formats like Pandas, Polars, and PyArrow.

Why is rapidsai/cuml a recommended Dataframe Format Conversion GitHub Repositories repository?

Processes data directly from various in-memory dataframe and tensor formats without requiring manual conversion.

Why is awslabs/gluonts a recommended Dataframe Format Conversion GitHub Repositories repository?

Transforms tabular Pandas dataframes into structured formats suitable for time series modeling.

3 रिपॉजिटरी

Awesome GitHub RepositoriesDataframe Format Conversion

Converting query results between different in-memory dataframe and tensor formats.

Distinct from Object Result Fetches: Candidates focus on caching or API transformers; this is specifically about interop between Pandas, Polars, and PyArrow.

Explore 3 awesome GitHub repositories matching data & databases · Dataframe Format Conversion. Refine with filters or upvote what's useful.

AI के साथ बेहतरीन रिपॉजिटरी खोजें।हम AI का उपयोग करके सबसे सटीक रिपॉजिटरी खोजेंगे।

ibis-project/ibis
ibis-project/ibis
6,574GitHub पर देखें
Ibis is a portable Python dataframe library and multi-backend query engine that provides a unified interface for executing data transformations across diverse compute engines. It functions as a Python SQL expression compiler and dialect transpiler, allowing users to define data logic once and execute it across cloud warehouses, embedded databases, and distributed clusters without rewriting code. The project distinguishes itself through a database backend abstraction that decouples transformation logic from the underlying execution engine. It enables polyglot data workflows by mixing raw SQL s
Implements a bridge to convert query execution results between SQL backends and formats like Pandas, Polars, and PyArrow.
Pythonbigqueryclickhousedatabase
GitHub पर देखें6,574
rapidsai/cuml
rapidsai/cuml
5,209GitHub पर देखें
cuml, एक GPU-एक्सेलेरेटेड मशीन लर्निंग लाइब्रेरी और फ्रेमवर्क है जो टैबुलर डेटा प्रीप्रोसेसिंग और मॉडल निष्पादन को गति देने के लिए CUDA का उपयोग करता है। यह NVIDIA GPUs और GPU क्लस्टर्स पर क्लासिफिकेशन, रिग्रेशन और क्लस्टरिंग मॉडल्स को प्रशिक्षित और डिप्लॉय करने के लिए टूल्स का एक सूट प्रदान करता है। लाइब्रेरी को स्केलेबिलिटी के लिए डिज़ाइन किया गया है, जो एक डिस्ट्रिब्यूटेड GPU मशीन लर्निंग वातावरण प्रदान करती है जो सिंगल-डिवाइस मेमोरी से अधिक डेटासेट्स को संभालने के लिए हार्डवेयर एक्सेलेरेटर्स और नोड्स में गणना और डेटा फैला सकती है। यह मानक एस्टिमेटर इंटरफ़ेस को मिरर करती है ताकि मौजूदा वर्कफ़्लो के भीतर CPU-आधारित मॉडल्स को GPU-एक्सेलेरेटेड वर्शन्स के साथ बदला जा सके। प्रोजेक्ट मशीन लर्निंग क्षमताओं की एक विस्तृत श्रृंखला को कवर करता है, जिसमें सुपरवाइज्ड लर्निंग, अनसुपरवाइज्ड क्लस्टरिंग, नियरेस्ट नेबर सर्च और हाई-डायमेंशनल डाइमेंशनलिटी रिडक्शन शामिल है।
Processes data directly from various in-memory dataframe and tensor formats without requiring manual conversion.
Python
GitHub पर देखें5,209
awslabs/gluonts
awslabs/gluonts
5,199GitHub पर देखें
GluonTS एक प्रोबेबिलिस्टिक टाइम सीरीज़ लाइब्रेरी और डीप लर्निंग फोरकास्टिंग फ्रेमवर्क है। यह न्यूरल नेटवर्क आर्किटेक्चर बनाने, प्रशिक्षित करने और मूल्यांकन करने के लिए एक टूलकिट प्रदान करता है, जो अनिश्चितता को मापने के लिए भविष्य के मूल्यों को प्रायिकता वितरण (probability distributions) के रूप में अनुमानित करते हैं। यह प्रोजेक्ट ज़ीरो-शॉट फोरकास्टिंग का समर्थन करके और डीप प्रोबेबिलिस्टिक न्यूरल नेटवर्क तथा Prophet और R forecast जैसी बाहरी सांख्यिकीय लाइब्रेरी के लिए रैपर सहित विविध मॉडलिंग दृष्टिकोणों को एकीकृत करके खुद को अलग करता है। यह सूचना रिसाव को रोकने और लेटेंट रिप्रेजेंटेशन को वैध प्रायिकता वितरण में मैप करने के लिए कैज़ुअल कन्वोल्यूशन और इनवर्टिबल रेसिड्यूअल नेटवर्क जैसे विशेष आर्किटेक्चरल प्रिमिटिव्स को लागू करता है। यह फ्रेमवर्क टाइम सीरीज़ स्केलिंग, बाइजेक्टिव ट्रांसफॉर्मेशन और पदानुक्रमित मॉडलिंग सहित एक व्यापक डेटा इंजीनियरिंग सतह को कवर करता है। यह उच्च-प्रदर्शन डेटासेट स्ट्रीमिंग और रैंडम-एक्सेस प्रबंधन के लिए Apache Arrow और Parquet का उपयोग करता है। मॉडल मूल्यांकन के लिए, इसमें क्वांटाइल लॉस और कंटीन्यूअस रैंक प्रोबेबिलिटी स्कोर जैसे मेट्रिक्स का उपयोग करके पूर्वानुमान सटीकता और प्रोबेबिलिस्टिक कवरेज को मापने के लिए एक मूल्यांकन सूट शामिल है। यह लाइब्रेरी Amazon SageMaker के साथ एकीकरण के माध्यम से मॉडल परिनियोजन (deployment) का समर्थन करती है।
Transforms tabular Pandas dataframes into structured formats suitable for time series modeling.
Pythonartificial-intelligenceawsdata-science
GitHub पर देखें5,199

Awesome Dataframe Format Conversion GitHub Repositories

ibis-project/ibis

rapidsai/cuml

awslabs/gluonts