5 repository-uri
Manages secure access and credential configuration for cloud-based data storage and processing services.
Distinguishing note: Focuses on credential management for cloud connectivity, distinct from general database connection strings.
Explore 5 awesome GitHub repositories matching data & databases · Cloud Authentication. Refine with filters or upvote what's useful.
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
Configures secure cloud credentials to authenticate remote tasks and manage access to cloud resources.
Prefect is a workflow orchestration platform designed to define, schedule, and monitor complex data pipelines as Python code. It functions as a container-native engine that wraps individual tasks in isolated environments, ensuring consistent dependencies and resource allocation across diverse infrastructure. By utilizing a state-machine-based orchestration model, the system tracks execution progress through discrete transitions and persistent event logs to maintain reliable and observable task processing. The platform distinguishes itself through a decoupled worker-API architecture, which sep
Stores and manages access credentials for external cloud providers to securely authorize automated workflows to interact with remote infrastructure.
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Connects the local environment to a remote platform using credential files to enable integrated development and management.
This repository is a collection of Python code examples that demonstrate how to use Google Cloud Platform services and APIs. Each sample is organized as a self-contained directory with its own dependencies, making it independently runnable and testable. The samples rely on Google's auto-generated Python client libraries and standardize invocation through command-line argument parsing, with configuration read from environment variables for portability across development and CI environments. The examples cover authentication setup using the gcloud CLI, along with practical demonstrations for se
Obtains local credentials via the gcloud CLI to authorize API requests from a development environment.
mimic-code is a clinical data analysis framework and toolset for processing deidentified electronic health records and intensive care unit data. It provides a healthcare SQL query library and a processing tool to transform raw health records into formats suitable for longitudinal analysis and machine learning. The project features a medical research notebook environment that integrates with cloud-hosted datasets, allowing for remote querying and analysis. It includes a DICOM imaging pipeline to retrieve chest radiographs and link medical imaging with structured clinical metadata. The framewo
Manages the addition of authenticated user accounts to access lists for secure cloud storage.