30 open-source projects similar to biopython/biopython, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Biopython alternative.
klib is a comprehensive C standard library extension and data structure toolkit. It provides a set of fundamental tools for memory management, data organization, and general-purpose utility functions for standalone C applications. The project features specialized capabilities for bioinformatics sequence analysis, including the parsing of FASTA, FASTQ, and Newick formats and the implementation of Smith-Waterman sequence alignment and Hidden Markov Models. It also includes a mathematical computation library for numerical routines and expression evaluation, as well as a lightweight HTTP and FTP
This project is a scientific agent framework and workflow orchestrator designed to extend large language models with specialized tools for genomic, chemical, and biological research. It provides a system for planning research hypotheses and executing automated workflows by integrating scientific databases with dynamic code execution. The framework includes a cheminformatics modeling suite for predicting molecular bioactivity and performing virtual screening, alongside a bioinformatics analysis toolkit for processing genomic sequences and single-cell data. It also features an academic document
The SxT-Python-SDK is a Python library and SQL database client designed for executing queries and managing database schemas via the Space and Time API. It serves as a programmatic interface for interacting with a decentralized data warehouse. The SDK provides tools for decentralized database management, including the ability to define and create table structures and establish views. It enables blockchain data analysis by allowing users to run SQL statements to extract and manipulate on-chain data. The library handles the full data lifecycle from authentication to export. It manages session t
supabase-js is a comprehensive client library designed to integrate frontend applications with a hosted backend-as-a-service. It provides a unified interface for interacting with a PostgreSQL database, identity management systems, cloud object storage, and real-time data synchronization. The library features an isomorphic client design that operates across both browser and server environments. It distinguishes itself through a type-safe approach, utilizing TypeScript to map database schemas directly to client-side definitions, and employs a PostgREST-based API to translate JavaScript calls in
AlphaFold3 is a biomolecular structure prediction model and bioinformatics structural analysis tool. It uses a deep learning system to predict the three-dimensional shapes of proteins, DNA, RNA, and ligands. The system functions as a diffusion-based protein folding model that predicts the spatial coordinates of biomolecular atoms and interactions. It utilizes a GPU-accelerated inference pipeline to process genetic sequences and structural templates for molecular modeling. The project covers structural bioinformatics analysis and protein interaction modeling to determine the physical arrangem
evo2 is a genomic large language model and foundation model designed to predict, generate, and analyze genetic information across different species. It functions as a nucleotide sequence modeler and a DNA sequence generator, using transformer-based sequence modeling to process genomic data. The system provides capabilities for synthetic DNA generation, creating new genetic sequences based on biological prompts or species-specific tags. It also performs nucleotide likelihood prediction to score genomic variants and analyze biological properties within DNA sequences. The model supports genomic
javdb is a mobile adult media indexer and video database browser. It functions as a REST API client that allows users to search for and discover adult video metadata and titles from a remote database. The application includes a magnet link search tool that automatically locates downloadable torrent files associated with specific media entries by querying external search engines. The system manages content discovery through video database browsing and the retrieval of media information via standard HTTP requests.
HumanSystemOptimization is a structured biohacking framework designed to enhance physical vitality and mental clarity through evidence-based lifestyle interventions. It functions as a comprehensive protocol for managing biological performance by integrating nutritional strategies, circadian rhythm regulation, and cognitive training systems. The project distinguishes itself by focusing on the intersection of metabolic health and neural adaptability. It employs specific methodologies for dopamine-baseline modulation to sustain long-term motivation and utilizes intermittent fasting and microbiom
pugixml is a lightweight C++ XML parser and DOM-based library used for parsing, manipulating, and saving XML documents. It provides a portable toolset for reading XML data from files, strings, or memory buffers and converting them into an in-memory document object model. The library includes a dedicated XPath 1.0 engine for extracting specific nodes and data through path expressions. It distinguishes itself through customizable memory management, allowing heap operations to be redirected to user-defined allocation functions, and the ability to perform in-place buffer parsing to reduce memory
Graphite is a node-based visual design environment that integrates vector illustration, raster image processing, and motion graphics generation into a single platform. It utilizes a functional reactive pipeline and a data-flow execution model to propagate state changes through a graph of interconnected nodes, allowing users to construct complex, automated design workflows. The platform distinguishes itself through a context-aware evaluation engine that injects runtime metadata—such as coordinate data and loop indices—directly into the node graph. This enables the creation of procedural geomet
This project is a cross-platform desktop application that provides a graphical interface for managing and interacting with Redis databases. Built as a standalone software program, it functions as a centralized connection manager that allows users to visualize data structures, perform administrative tasks, and inspect records across multiple database environments. The application distinguishes itself through its integration with external processes and command-line workflows. Users can define custom viewers to render specific data formats using external tools and launch the interface directly f
SurrealDB is a multi-model database engine designed to store and query document, graph, relational, and vector data within a single ACID-compliant platform. It functions as an AI-native data store, integrating vector search, graph traversal, and machine learning model execution directly into its query layer. By providing a unified declarative query language, the platform eliminates the need for external middleware to synchronize data across different storage models. The platform distinguishes itself through its ability to manage agent memory and complex workflows natively. It allows developer
Ransack is an object-based search library for Active Record models that provides a form-driven framework for building database queries. It generates query conditions from user input through a configurable predicate system, allowing developers to filter records using simple or advanced criteria without writing SQL or relying on external dependencies. The library distinguishes itself through its predicate-based query construction, where search methods are dynamically generated from model attribute names and predicate suffixes at runtime. It wraps search parameters into a dedicated form object t
Xan is a command-line tool and data transformation engine for processing CSV, TSV, and JSONL datasets. It functions as a processor for compressed files, enabling random access and seeking within gzipped and Zstd files, and serves as a converter for specialized bioinformatics data formats. The tool handles large datasets without requiring full memory loads by utilizing stream-based processing. It provides capabilities for merging, sorting, and deduplicating massive files, as well as converting data between various tabular formats. The project covers a broad range of data wrangling and analysi
xmltodict is a Python library that provides bidirectional serialization between XML documents and dictionaries. It functions as a parser that converts marked-up input into key-value pairs and a serialization utility that transforms dictionaries back into structured XML documents. The project includes an incremental stream processor that uses depth-based callbacks to handle large XML files while maintaining constant memory usage. It features a namespace manager for mapping prefixes and declarations, as well as a security sanitizer that blocks external entity expansion and validates element nam
This project is a plugin framework and agentic workflow library designed to connect large language models to professional toolstacks. It provides a system for integrating language models with external data warehouses, CRMs, and other enterprise software to retrieve and manipulate real-time business data. The framework enables the automation of specialized professional tasks through a file-based plugin definition system. It allows for the customization of domain expertise and plugin behavior to align with internal company processes, supported by an enterprise data connector that links models t
SVGKit is a graphics framework for the iOS and macOS ecosystems designed for rendering high-performance scalable vector graphics. It functions as a library that utilizes native hardware acceleration to display and interact with vector graphics on Apple platforms. The project provides a programmatic interface for editing vector elements and writing updated files back to disk. It also includes tools to convert vector graphics into rasterized bitmap image formats for use in standard image views. The framework handles the translation of XML-based documents into a hierarchy of hardware-accelerate
This project is a PHP implementation of a CSS selector engine that transforms CSS selector strings into compatible XPath expressions for locating elements within documents. It serves as a converter and expression generator that maps CSS selection logic to the XPath query language. The library processes selectors through a pipeline involving lexer-based tokenization and recursive descent parsing to create an abstract syntax tree. It utilizes pattern-matching logic to handle child and sibling relationships, translating CSS pseudo-classes and selectors into functional XPath logic. These capabil
AlphaFold is a deep learning biology framework and machine learning pipeline designed to predict the three-dimensional coordinates of proteins based on their amino acid sequences. It functions as a bioinformatics inference system for calculating protein folding patterns and estimating prediction confidence. The system includes a protein multimer predictor for determining the 3D structures of protein complexes, supporting both homomers and heteromers. It utilizes specialized model presets to handle these complex structural predictions. The framework covers biological database management for m
OpenEvolve is an open-source framework for evolutionary computation that uses language models to drive automated optimization across multiple domains. It can evolve system prompts for large language models, refine source code across programming languages, search for optimal GPU kernel configurations, discover interpretable mathematical expressions from data, and maintain diverse populations of high-performing solutions. The framework integrates multiple evolutionary strategies, including MAP-Elites diversity mapping and island-based topologies, to avoid premature convergence and preserve a wid
This project is a Node.js library for bidirectional conversion between XML strings and JavaScript objects. It functions as an XML parser that transforms XML content into structured data and an XML serializer that generates formatted strings from JavaScript data objects. The toolkit includes a data transformer that applies custom processing functions to tags and attributes during the conversion process. It manages XML namespaces and supports the definition of custom root elements to maintain document structure during generation. The system handles XML data parsing, string generation, and name
Scanpy is a Python library for the preprocessing, visualization, and analysis of large-scale single-cell gene expression datasets. It serves as a toolkit for single-cell RNA sequencing analysis, providing a framework to process and analyze genomic data from individual cells to identify biological markers and cell types. The library includes a scalable data processing pipeline for cleaning and preparing genomic data, a clustering framework for grouping cells with similar expression profiles, and a system for modeling transitions between cell states to reconstruct biological development and dif
Redka is a SQL-backed key-value store that implements a Redis-compatible interface. It acts as a relational data bridge, translating non-relational protocol commands into relational queries to provide persistence and ACID transactions using a SQL database backend. The system allows for relational data introspection by exposing internal key-value data as SQL views, enabling direct querying and analysis via relational syntax. It can be deployed as a standalone network server or as an embedded library within a Go application. The project supports a variety of data structures, including strings,
Cloud-native genomic dataframes and batch computing
MMseqs2: ultra fast and sensitive search and clustering suite
LibGDX is a Java-based framework designed for cross-platform game development, enabling the creation and deployment of 2D and 3D games across desktop, mobile, and web environments from a single codebase. It functions as a comprehensive library that abstracts hardware-accelerated graphics, audio, input, and file system access, providing a unified interface for developers to manage game logic and application lifecycles. The framework distinguishes itself through a high-performance architecture that prioritizes efficiency and native interoperability. It utilizes a batch-oriented graphics pipelin
Scalable gVCF merging and joint variant calling for population sequencing projects