What are the best Awesome Data Analysis & Visualization GitHub Repositories?

This group focuses on tools and techniques for analyzing, interpreting, and visually representing data. Explore 409 awesome GitHub repositories matching data & databases · Data Analysis & Visualization. Refine with filters or upvote what's useful. Top picks: kamranahmedse/developer-roadmap, vinta/awesome-python, awesome-selfhosted/awesome-selfhosted, practical-tutorials/project-based-learning, ossu/computer-science, n8n-io/n8n, jackfrued/python-100-days, d3/d3, growinggit/github-chinese-top-ch…

Why is kamranahmedse/developer-roadmap a recommended Data Analysis & Visualization GitHub Repositories repository?

Provides visual representations of technical learning paths and skill progression.

Why is vinta/awesome-python a recommended Data Analysis & Visualization GitHub Repositories repository?

Process large-scale datasets and perform complex statistical exploration using high-level computational engines.

Why is awesome-selfhosted/awesome-selfhosted a recommended Data Analysis & Visualization GitHub Repositories repository?

Collects and reports website event data over short-term periods to provide insights into user activity.

Why is practical-tutorials/project-based-learning a recommended Data Analysis & Visualization GitHub Repositories repository?

Render dynamic and interactive data visualizations by binding arbitrary data to document elements and applying transformations to the underlying structure.

Why is ossu/computer-science a recommended Data Analysis & Visualization GitHub Repositories repository?

Provides resources and guidance for analyzing and visualizing data as part of the broader computer science curriculum.

Why is n8n-io/n8n a recommended Data Analysis & Visualization GitHub Repositories repository?

Captures and manages operational metrics with configurable retention and compaction settings for self-hosted instances.

Why is jackfrued/python-100-days a recommended Data Analysis & Visualization GitHub Repositories repository?

Implement numerical computing, data manipulation, and visualization workflows using industry-standard analytical libraries.

Why is d3/d3 a recommended Data Analysis & Visualization GitHub Repositories repository?

Implement interactive selection areas that allow users to highlight and isolate specific data ranges within a visualization.

Why is growinggit/github-chinese-top-charts a recommended Data Analysis & Visualization GitHub Repositories repository?

Monitors open-source project activity and ecosystem trends to deliver insights into software popularity and health.

Why is punkpeye/awesome-mcp-servers a recommended Data Analysis & Visualization GitHub Repositories repository?

Bridges high-performance mathematical engines with analytical frameworks to execute complex data processing and visualization tasks.

409 repository-uri

Awesome GitHub RepositoriesData Analysis & Visualization

This group focuses on tools and techniques for analyzing, interpreting, and visually representing data.

Explore 409 awesome GitHub repositories matching data & databases · Data Analysis & Visualization. Refine with filters or upvote what's useful.

Găsește cele mai bune repo-uri cu AI.Vom căuta cele mai potrivite repository-uri folosind AI.

kamranahmedse/developer-roadmap
kamranahmedse/developer-roadmap
357,434Vezi pe GitHub
Developer Roadmap este o platformă condusă de comunitate care oferă căi de învățare structurate, bazate pe grafuri, pentru ingineria software. Servește drept repository cuprinzător de cunoștințe unde domeniile tehnice sunt organizate în secvențe vizuale pentru a ghida dobândirea competențelor profesionale și creșterea în carieră. Proiectul se distinge printr-un ecosistem colaborativ care permite utilizatorilor să contribuie cu roadmap-uri, să cureție cele mai bune practici din industrie și să mențină profiluri profesionale. Acesta integrează framework-uri de evaluare diagnostică pentru a evalua competența tehnică, ajutând dezvoltatorii să identifice lacunele de cunoștințe și să se pregătească pentru interviurile profesionale prin secvențe de învățare țintite. Dincolo de capabilitățile sale de bază de mapare, platforma oferă idei practice de proiecte și tutorat interactiv pentru a consolida conceptele de inginerie. Oferă un spațiu centralizat pentru ca comunitatea să partajeze resurse, să urmărească dezvoltarea progresivă a competențelor și să navigheze prin peisaje tehnice complexe.
Provides visual representations of technical learning paths and skill progression.
TypeScriptangular-roadmapbackend-roadmapblockchain-roadmap
Vezi pe GitHub357,434
vinta/awesome-python
vinta/awesome-python
303,207Vezi pe GitHub
Acest proiect este un director cuprinzător, curatoriat de comunitate, care organizează un peisaj vast de biblioteci, framework-uri și instrumente software Python. Servește drept bază de cunoștințe centralizată concepută pentru a facilita navigarea în ecosistem și a accelera descoperirea de către dezvoltatori pe parcursul întregului ciclu de viață al dezvoltării software. Directorul se distinge prin furnizarea unui index structurat de resurse categorisite pe domeniu tehnic, variind de la utilitare fundamentale de dezvoltare la domenii de inginerie specializate. Acoperă capabilități de nivel înalt, inclusiv inteligență artificială, știința datelor, dezvoltare web și gestionarea infrastructurii, permițând dezvoltatorilor să identifice soluții verificate pentru provocări tehnice specifice. Proiectul cuprinde o suprafață largă de capabilități, inclusiv instrumente pentru gestionarea dependențelor, analiza statică a codului și testarea automatizată. De asemenea, cataloghează resurse pentru stocarea persistentă a datelor, orchestrarea infrastructurii cloud și dezvoltarea interfețelor, oferind o referință unificată pentru construirea și menținerea sistemelor software complexe.
Process large-scale datasets and perform complex statistical exploration using high-level computational engines.
Pythonawesomecollectionspython
Vezi pe GitHub303,207
awesome-selfhosted/awesome-selfhosted
awesome-selfhosted/awesome-selfhosted
299,516Vezi pe GitHub
Acest proiect este un director curatoriat de comunitate cu software open-source conceput pentru implementarea în medii de server private și laboratoare de acasă (home labs). Servește drept resursă cuprinzătoare pentru descoperirea alternativelor independente, auto-găzduite, la serviciile cloud mainstream, permițând utilizatorilor să mențină proprietatea deplină a datelor și controlul asupra infrastructurii lor digitale. Directorul este structurat printr-o taxonomie ierarhică ce organizează o colecție vastă de aplicații în categorii logice, variind de la gestionarea media și analiza datelor la comunicare privată și instrumente de productivitate în echipă. Se distinge printr-un proces colaborativ de peer-review, unde membrii comunității validează calitatea și relevanța fiecărei trimiteri pentru a se asigura că directorul rămâne precis și fiabil. Proiectul acoperă o suprafață largă de capabilități, inclusiv automatizarea infrastructurii, implementarea serviciilor bazate pe containere și gestionarea configurației declarative. Aceste instrumente ajută utilizatorii să mențină medii de server reproductibile și să gestioneze dependențele complexe ale serviciilor pe hardware privat. Directorul este menținut ca un repository controlat prin versiuni, asigurându-se că toate actualizările și modificările conduse de comunitate sunt urmărite și transparente.
Collects and reports website event data over short-term periods to provide insights into user activity.
awesomeawesome-listcloud
Vezi pe GitHub299,516
practical-tutorials/project-based-learning
practical-tutorials/project-based-learning
270,530Vezi pe GitHub
Acest proiect este un repository centralizat, condus de comunitate, de tutoriale practice concepute pentru a facilita dobândirea de competențe prin construcția practică a aplicațiilor software din lumea reală. Servește drept director cuprinzător care agregă documentație externă și materiale instrucționale, oferind o cale structurată pentru ca dezvoltatorii să stăpânească limbaje de programare și domenii tehnice specifice. Repository-ul se distinge prin organizarea resurselor tehnice disparate într-o structură ierarhică, bazată pe taxonomie, care permite dezvoltatorilor să descopere și să navigheze prin diverse discipline de inginerie software. Prin gruparea proiectelor individuale în secvențe logice, oferă un roadmap care ajută cursanții să progreseze de la concepte fundamentale la implementare avansată. Conținutul este menținut prin contribuții colaborative, asigurându-se că colecția rămâne o resursă actuală și expansivă pentru comunitatea de dezvoltatori. Proiectul acoperă o suprafață largă de capabilități, cuprinzând domenii precum dezvoltarea web full-stack, ingineria aplicațiilor mobile și dezvoltarea jocurilor interactive. Include resurse pentru o gamă largă de limbaje de programare, variind de la limbaje de nivel de sistem precum C, C++ și Rust la limbaje de nivel înalt și funcționale precum Python, Ruby, Haskell și Clojure. Aceste materiale susțin stăpânirea tehnică specializată în domenii precum învățarea automată, știința datelor și programarea în rețea. Directorul este structurat pentru a permite descoperirea eficientă pe limbaj de programare și domeniu tehnic, cu un cuprins clar pentru a ajuta utilizatorii să localizeze informații specifice. Funcționează ca un index persistent de link-uri externe, conectând dezvoltatorii la documentație și tutoriale terțe pentru a le aprofunda înțelegerea conceptelor tehnice.
Render dynamic and interactive data visualizations by binding arbitrary data to document elements and applying transformations to the underlying structure.
beginner-projectcppgolang
Vezi pe GitHub270,530
ossu/computer-science
ossu/computer-science
205,190Vezi pe GitHub
Acest proiect oferă un cadru de curriculum informatic structurat, conceput pentru cursanții autodidacți. Acesta organizează resurse academice cu acces deschis, inclusiv manuale, cursuri și teme, într-o cale coerentă care oglindește cerințele unei diplome universitare formale. Prin integrarea studiului teoretic cu metodologiile practice de inginerie software, platforma permite studenților să stăpânească independent conceptele fundamentale și abilitățile tehnice avansate. Curriculumul se distinge prin utilizarea unui flux de lucru bazat pe controlul versiunilor pentru a gestiona experiența educațională. Cursanții folosesc instrumente bazate pe depozite pentru a urmări etapele academice, a menține un istoric persistent al temelor finalizate și a valida soluțiile tehnice în raport cu cerințele stabilite. Această abordare încurajează adoptarea practicilor de inginerie standard în industrie, cum ar fi configurarea mediilor de dezvoltare izolate și gestionarea dependențelor de proiect, pe tot parcursul procesului de învățare. Platforma susține o gamă largă de dezvoltări tehnice, acoperind domenii precum rezolvarea problemelor computaționale, designul orientat pe obiecte și analiza datelor. Aceasta facilitează învățarea colaborativă prin platforme conduse de comunitate, permițând studenților să se implice în interacțiunea cu colegii și validarea muncii lor. Curriculumul este menținut ca o resursă open-source, oferind un ghid cuprinzător pentru construirea competenței profesionale în ingineria software.
Provides resources and guidance for analyzing and visualizing data as part of the broader computer science curriculum.
HTMLawesome-listcomputer-sciencecourses
Vezi pe GitHub205,190
n8n-io/n8n
n8n-io/n8n
192,772Vezi pe GitHub
n8n is a workflow automation platform that combines a visual interface with code-based extensibility to design, orchestrate, and manage automated processes. It provides a comprehensive suite of tools for data transformation, filtering, and storage, allowing users to build complex logic through conditional branching, looping, and sub-workflow execution. The platform supports both pre-built integration nodes and custom code execution in JavaScript or Python, enabling connectivity with a wide range of external services and APIs. The platform includes a suite of generative AI capabilities, such a
Captures and manages operational metrics with configurable retention and compaction settings for self-hosted instances.
TypeScriptaiapisautomation
Vezi pe GitHub192,772
jackfrued/python-100-days
jackfrued/Python-100-Days
183,425Vezi pe GitHub
This project is a comprehensive, day-by-day curriculum designed to guide learners through the Python programming language and its professional applications. The content spans from fundamental syntax and object-oriented design to advanced topics including database management, web development, data analysis, and machine learning. The curriculum is structured into distinct modules that cover practical software engineering practices, such as version control, containerization, and system architecture. It also provides resources for technical interview preparation and an analysis of career paths wi
Implement numerical computing, data manipulation, and visualization workflows using industry-standard analytical libraries.
Jupyter Notebook
Vezi pe GitHub183,425
d3/d3
d3/d3
113,118Vezi pe GitHub
D3 is a modular library providing low-level primitives for creating data-driven visualizations. It functions as a flexible framework that allows for direct control over visual presentation by mapping abstract data dimensions to graphical properties, such as position, color, and size, without imposing predefined chart abstractions. The library distinguishes itself by offering specialized tools for complex data representation, including algorithmic layouts for hierarchical structures and geographic projection utilities for mapping spherical coordinates. It also includes a comprehensive suite fo
Implement interactive selection areas that allow users to highlight and isolate specific data ranges within a visualization.
Shellchartchartsd3
Vezi pe GitHub113,118
growinggit/github-chinese-top-charts
GrowingGit/GitHub-Chinese-Top-Charts
108,509Vezi pe GitHub
This project functions as a curated software directory and developer resource index, providing a centralized platform for discovering and evaluating high-quality open-source repositories. It serves as an aggregator that monitors trending software and educational resources, organizing them by technical domain and programming language to assist developers in identifying tools for their specific technical challenges. The directory distinguishes itself through a community-driven curation workflow, where repository lists are validated and updated based on collective developer consensus. This infor
Monitors open-source project activity and ecosystem trends to deliver insights into software popularity and health.
Java
Vezi pe GitHub108,509
punkpeye/awesome-mcp-servers
punkpeye/awesome-mcp-servers
89,264Vezi pe GitHub
This project serves as a centralized directory and interoperability hub for the Model Context Protocol, providing a curated collection of standardized service connectors that bridge artificial intelligence models with external software, databases, and APIs. It facilitates the integration of AI agents with diverse ecosystems by offering a registry of machine-readable interface definitions that enable dynamic tool discovery and structured context injection. The directory distinguishes itself by focusing on the protocol-based interoperability required for autonomous AI agents to interact with he
Bridges high-performance mathematical engines with analytical frameworks to execute complex data processing and visualization tasks.
aimcp
Vezi pe GitHub89,264
mermaid-js/mermaid
mermaid-js/mermaid
88,676Vezi pe GitHub
This project is a client-side rendering engine that transforms declarative, text-based syntax into visual diagrams directly within the browser. By utilizing a domain-specific language, it allows users to define complex structures—such as software architectures, process flows, and system behaviors—without the need for manual layout configuration. The library functions as a browser-based runtime that parses these definitions into intermediate abstract syntax trees, which are then processed by specialized engines to generate high-fidelity, resolution-independent graphics. The system distinguishe
Converts plain-text configuration into visual charts and graphs without requiring manual layout adjustments.
TypeScriptdiagramsdiagrams-as-codedocumentation
Vezi pe GitHub88,676
stirling-tools/stirling-pdf
Stirling-Tools/Stirling-PDF
81,109Vezi pe GitHub
Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-based interface for interactive editing and a programmatic, API-first architecture that allows for the automation of document workflows through standard HTTP requests. The project distinguishes itself through its focus on private, infrastructure-agnostic deployment and granular
Tracks system metrics and feature engagement using privacy-conscious analytics services.
TypeScriptdockerhacktoberfestjava
Vezi pe GitHub81,109
junegunn/fzf
junegunn/fzf
81,017Vezi pe GitHub
This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe-oriented architecture, it integrates into existing shell pipelines and workflows to facilitate efficient data exploration. What distinguishes this tool is its highly extensible, event-driven design that allows for deep integration with external processes. It supports asynchrono
Toggles between predefined column configurations during runtime to allow flexible data viewing.
Gobashclifish
Vezi pe GitHub81,017
anuraghazra/github-readme-stats
anuraghazra/github-readme-stats
79,661Vezi pe GitHub
This project is a serverless service that generates dynamic, themeable visual summaries of software development activity. It functions as an automated metadata visualizer, transforming raw platform logs and repository metrics into resolution-independent vector graphics that can be embedded directly into markdown environments. The service distinguishes itself by offering highly configurable, query-parameter-driven rendering that allows users to customize the visual presentation of their coding patterns, language proficiency, and repository details. It supports both real-time generation via ser
Caches and serves platform-specific performance metrics through configurable, high-performance image endpoints.
JavaScriptdynamicprofile-readmereadme-generator
Vezi pe GitHub79,661
nomic-ai/gpt4all
nomic-ai/gpt4all
77,375Vezi pe GitHub
GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a comprehensive ecosystem for managing the entire model lifecycle, including discovery, downloading, and configuration of local weights. What distinguishes the platform is its integrated retrieval-augmented generation engine, which allows users to index local documents into semantic vect
Allows users to attach spreadsheet data to conversations for local analysis and report generation.
C++ai-chatllm-inference
Vezi pe GitHub77,375
elastic/elasticsearch
elastic/elasticsearch
77,012Vezi pe GitHub
Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintaining global cluster state through a distributed consensus mechanism. The platform distinguishes itself through its integrated approach to observability, security, and advanced analytics. It combines full-text, vector, and hybrid search capabilities with machine learning-driven insi
Powers high-performance computation for executing complex analytical queries and processing large-scale data.
Javaelasticsearchjavasearch-engine
Vezi pe GitHub77,012
awesomedata/awesome-public-datasets
awesomedata/awesome-public-datasets
75,979Vezi pe GitHub
This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, the repository facilitates the discovery of data necessary for exploratory analysis, machine learning model training, and the development of data-intensive applications. The directory distinguishes itself through a lightweight, platform-agnostic approach to resource indexing that
Benchmarks machine learning algorithms and data science models through standardized datasets.
aaron-swartzawesome-public-datasetsdatasets
Vezi pe GitHub75,979
grafana/grafana
grafana/grafana
74,456Vezi pe GitHub
Grafana is an observability data platform designed to aggregate metrics, logs, and traces from diverse sources into a unified environment. It functions as a centralized interface for visualizing complex telemetry data, transforming raw streams into interactive dashboards that support real-time system health tracking and performance monitoring. The platform distinguishes itself through a plugin-based modular architecture that integrates disparate databases, cloud services, and monitoring tools via a standardized data abstraction layer. This framework allows for the dynamic loading of external
Renders interactive interfaces that allow teams to visualize and explore complex telemetry data in real-time.
TypeScriptalertinganalyticsbusiness-intelligence
Vezi pe GitHub74,456
apache/superset
apache/superset
73,451Vezi pe GitHub
Superset is a web-based business intelligence platform designed for data exploration, visualization, and interactive dashboarding. It functions as a query-driven analytics engine that connects to various SQL databases, allowing users to perform ad-hoc analysis, define virtual metrics, and build complex data visualizations through a centralized interface. The platform distinguishes itself through a robust semantic layer that transforms raw database schemas into calculated columns and virtual metrics, enabling consistent business logic across an organization. It features a plugin-based visualiz
Enables ad-hoc SQL querying and advanced data transformations to inspect and analyze large datasets within a web interface.
TypeScriptanalyticsapacheapache-superset
Vezi pe GitHub73,451
josephmisiti/awesome-machine-learning
josephmisiti/awesome-machine-learning
72,867Vezi pe GitHub
This project is a comprehensive, community-driven directory of machine learning resources, software libraries, and educational materials. It serves as a centralized knowledge base for developers and researchers, organizing tools and frameworks by their primary programming language and technical domain to simplify discovery across the artificial intelligence ecosystem. The collection distinguishes itself by providing a cross-language development index that spans diverse programming environments, including C, C++, Rust, Clojure, and Python. It covers a wide range of specialized capabilities, fr
Directs users to high-performance libraries optimized for querying and manipulating tabular datasets.
Python
Vezi pe GitHub72,867