30 open-source projects similar to rubikplayer/flame-fitting, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Flame Fitting alternative.
This is an official repository of the paper Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision. The project was formerly referred by RingNet. The codebase consists of the inference code, i.e. give an face image using this code one can generate a 3D mesh of a…
COLMAP is a 3D scene reconstruction suite and C++ geometry library that implements a full structure-from-motion pipeline. It functions as a GPU-accelerated photogrammetry tool and multi-view stereo framework designed to produce dense 3D geometry and watertight meshes from collections of 2D images. The project distinguishes itself through hardware-accelerated feature extraction and a modular camera modeling system that supports perspective, fisheye, and equirectangular lens types. It employs vocabulary tree image retrieval to efficiently identify similar images in large datasets and provides P
TRELLIS.2 is a generative image-to-3D system that creates high-resolution 3D assets with physically based rendering materials from 2D images. It utilizes a sparse voxel representation to handle complex topologies and internal structures without relying on iso-surface fields. The project features a structured latent space representation that maps geometry and texture attributes to maintain visual fidelity. It employs an optimization-free geometry reconstruction process to decode latent representations directly into voxel grids and includes a PBR texture generator for synthesizing base color, r
ComfyUI-3D-Pack is a suite of custom nodes for ComfyUI that enables 3D asset generation and rendering within a node-based workflow. It provides a set of tools for reconstructing textured three-dimensional meshes and volumetric scenes from single images, multi-view images, or text prompts. The system includes a Gaussian splatting generator for creating high-fidelity volumetric 3D scene representations and a multi-view image generator to produce consistent image sets for reconstruction. It also features a single image 3D mesh tool to build geometry from a single 2D source. The toolset covers 3
This project is a diffusion-based 3D generator and image-to-3D reconstruction system. It translates natural language descriptions or two-dimensional images into three-dimensional assets using neural radiance fields and diffusion models. The system utilizes score-distillation sampling and diffusion-based guidance to refine 3D shapes without requiring 3D training data. It includes specialized tools for transforming neural representations into exportable meshes with texture and material data, as well as a pipeline for iterative optimization of geometry and textures. The project covers a broad r
InstantMesh is a neural 3D reconstruction tool and single-image 3D mesh generator. It utilizes a sparse-view large reconstruction model to convert a single two-dimensional image into a three-dimensional object mesh. The system functions as a textured 3D mesh exporter, saving generated objects with either vertex colors or full texture maps for use in external rendering software. The framework covers a range of capabilities including feed-forward geometry inference, single-image depth estimation, and neural radiance fields. It also supports differentiable mesh rendering and workflows for spars
Slambook is a visual SLAM framework designed for simultaneous localization and mapping. It provides an integrated system to estimate camera motion and reconstruct 3D environments using visual sensor data. The project includes a visual odometry engine to track camera movement and a dense 3D reconstruction tool for creating volumetric representations of scenes. It features a loop closure detection system to recognize previously visited locations and a pose graph optimizer to refine trajectories and ensure global map consistency. The framework covers spatial estimation and environment modeling
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
This project is a collection of neural network models and geometric tools designed for image feature matching, spatial alignment, and visual localization. It provides a pre-trained neural network model for identifying high-accuracy correspondences between sparse image features without requiring local training. The system utilizes a graph neural network matcher that employs attention mechanisms and message passing to learn spatial relationships between image feature points. It integrates a RANSAC camera pose estimator to filter feature matches and calculate the relative spatial transformation
This project is a computer vision system for object segmentation and tracking across images and videos. It employs models capable of identifying and masking objects using text prompts, bounding boxes, click points, or image exemplars. The system differentiates itself through memory-based video tracking and shared-memory architectures that maintain consistent object identities over time. It supports multi-object processing in single computation passes to increase frame throughput and utilizes iterative refinement to correct segmentation boundaries through sequential prompts. The software also
Single/multi view image(s) to voxel reconstruction using a recurrent neural network
This project is a framework for neural radiance fields used to synthesize three-dimensional environments from sets of two-dimensional images and camera poses. It functions as a volumetric rendering engine and scene synthesizer that optimizes neural representations of spatial volumes to generate novel views of complex 3D scenes. The system implements a coordinate encoding system that transforms spatial coordinates into high-dimensional space to capture high-frequency geometric details. It also includes a neural mesh extractor that converts trained radiance fields into triangle meshes via march
This repository provides a Torch implementation of the framework proposed in CVPR 2017 paper Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes with Deep Generative Networks by Amir A. Soltani, Haibin Huang, Jiajun Wu, Tejas Kulkarni and Joshua Tenenbaum
Training scripts and a couple of trained demo networks are included. More demos and the complete set of data are on the road.
pifuhd is a 3D human reconstruction framework that generates high-resolution 3D meshes of people from a single 2D image. It utilizes pixel-aligned implicit functions to map image pixels to 3D space, predicting surface occupancy and distance to create detailed geometry. The system includes a pipeline for creating digital human assets, moving from 2D image feature projection to the extraction of discrete triangular meshes. It features specialized tools for refining these models, including a post-processor that removes geometric artifacts by isolating the largest connected component of the mesh.
This repository contains the official pytorch implementation for the paper "Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification"
InsightFace is a comprehensive deep learning framework designed for face recognition, biometric identity verification, and feature extraction. It provides a specialized engine for one-to-one verification and one-to-many identification tasks, utilizing convolutional neural networks to transform raw image pixels into high-dimensional vector embeddings. The project includes a complete toolkit for detecting, aligning, and processing facial data to ensure consistent identity discrimination. Beyond core recognition, the platform distinguishes itself through an extensive model management and optimiz
OctNetFusion is a data-driven method for volumetric depth fusion and depth completion. We extend OctNet to enable high-resolution 3D outputs of convolutional networks.
Three-dimensional content creation has been a central research area in computer graphics for decades. The main challenge is to minimize manual intervention, while still allowing the creation of a variety of plausible 3D objects. In this work, we present a global-to-local generative model to…
This is an official repository of Generating 3D Faces using Convolutional Mesh Autoencoders