30 open-source projects similar to src-d/awesome-machine-learning-on-source-code, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Awesome Machine Learning On Source Code alternative.
This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, the repository facilitates the discovery of data necessary for exploratory analysis, machine learning model training, and the development of data-intensive applications. The directory distinguishes itself through a lightweight, platform-agnostic approach to resource indexing that
This repository serves as a structured educational resource for machine learning and data science, providing a centralized collection of tutorials, lecture notes, and implementation guides. It is designed to support self-directed learning by organizing complex technical concepts into a clear, hierarchical path that spans from foundational statistical methods to advanced deep learning architectures. The project distinguishes itself through a comprehensive approach to skill development, bridging the gap between theoretical algorithmic foundations and functional software applications. It offers
A collection of important graph embedding, classification and representation learning papers with implementations.
A list of awesome and proven Artificial Intelligence use cases and applications
A collection of research papers and software related to explainability in graph machine learning.
A curated list of Monte Carlo tree search papers with implementations.
A list of popular deep learning models related to classification, segmentation and detection problems
A collection of research papers on decision, classification and regression trees with implementations.
A curated list of data mining papers about fraud detection.
Readings for "A Unified View of Relational Deep Learning for Drug Pair Scoring." (IJCAI 2022)
The definitive list of lists (of lists) curated on GitHub and elsewhere
This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. The directory distinguishes itself by providing a structured index of resources categorized by technical domain, ranging from foundational development utilities to specialized engineering fields. It covers high-level capabilities including artificial intelligence, data science, web
This project is a comprehensive, community-driven directory of machine learning resources, software libraries, and educational materials. It serves as a centralized knowledge base for developers and researchers, organizing tools and frameworks by their primary programming language and technical domain to simplify discovery across the artificial intelligence ecosystem. The collection distinguishes itself by providing a cross-language development index that spans diverse programming environments, including C, C++, Rust, Clojure, and Python. It covers a wide range of specialized capabilities, fr
This project is a comprehensive, community-driven repository that serves as a centralized catalog for computer vision research and development. It functions as a structured index of academic papers, open-source software libraries, public datasets, and educational tutorials, providing a navigation point for the complex landscape of modern vision technology. The repository distinguishes itself through a taxonomy-based indexing system that maps the relationships between foundational research, influential academic figures, and their corresponding software implementations. By utilizing a lightweig
A collection of resources for learning type theory and type theory adjacent fields.
A complete daily plan for studying to become a machine learning engineer.
A curated list of community detection research papers with implementations.
This project is a community-driven directory that aggregates and categorizes high-quality technical resources, tools, and learning materials. It functions as a centralized knowledge management repository, designed to help developers navigate the software development landscape by providing structured access to curated lists and external project references. The directory relies on a collaborative, peer-reviewed workflow where external contributors submit and maintain links through a version-controlled system. This community-maintained approach ensures that the information remains current and re
A curated list of gradient boosting research papers with implementations.
:video_game: A curated list of awesome game datasets, and tools to artificial intelligence in games
This project provides a structured computer science curriculum framework designed for self-directed learners. It organizes open-access academic resources, including textbooks, lectures, and assignments, into a cohesive path that mirrors the requirements of a formal undergraduate degree. By integrating theoretical study with practical software engineering methodologies, the platform enables students to master foundational concepts and advanced technical skills independently. The curriculum distinguishes itself by utilizing a version-control-based workflow to manage the educational experience.
A curated list of awesome R packages, frameworks and software.
:chartwithupwards_trend: A curated list of awesome data visualization libraries and resources.
This project is a collection of interactive Python notebooks and educational resources designed for mastering data science, machine learning, and numerical computing. It provides a series of practical guides and tutorials covering deep learning, big data processing, and statistical analysis. The repository features specialized instructional suites for implementing classical machine learning algorithms, building deep learning model architectures, and managing AWS cloud infrastructure. It includes dedicated notebooks for data visualization and numerical computing exercises. The project covers
This project is a curated repository of technical interview questions and a directory of study resources designed for professional software engineering assessments. It serves as a reference guide for interview patterns and common domain questions across various programming languages and technology stacks. The collection organizes coding and computer science questions by language and framework to assist with technical interview preparation and coding assessment study. It covers a wide range of programming domains and system design patterns to support software engineering career growth.
continuous integration and continuous delivery
Free-Certifications is a community-maintained, open-source directory that indexes free professional certification programs and educational training resources. It functions as a static content index, providing a structured hub for discovering learning paths and skill development opportunities across various technology domains and industry sectors. The project operates as a decoupled discovery layer, linking users to external training platforms rather than hosting educational content directly. By utilizing a version-controlled, markdown-based storage system, the directory facilitates collaborat
Curated list of Creative Technology groups, companies, studios, collectives, etc.