Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,49 @@
|
|
1 |
-
---
|
2 |
-
license: mit
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
pipeline_tag: image-to-3d
|
4 |
+
tags:
|
5 |
+
- triposg
|
6 |
+
- 3d-generation
|
7 |
+
- rectified-flow
|
8 |
+
---
|
9 |
+
# TripoSG - High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
|
10 |
+
|
11 |
+
TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
|
12 |
+
|
13 |
+
## Model Description
|
14 |
+
|
15 |
+
### Model Architecture
|
16 |
+
|
17 |
+
TripoSG utilizes a novel architecture combining:
|
18 |
+
- Rectified Flow (RF) based Transformer for stable, linear trajectory modeling
|
19 |
+
- Advanced VAE with SDF-based representation and hybrid geometric supervision
|
20 |
+
- Cross-attention mechanism for image feature condition
|
21 |
+
- 1.5B parameters operating on 2048 latent tokens
|
22 |
+
|
23 |
+
TripoSG-scribble accepts scribble image and text prompt condition. For inference efficiency, TripoSG-scribble is different from TripoSG in:
|
24 |
+
- TripoSG-scribble is a CFG-distilled model and should be used with CFG=0
|
25 |
+
- TripoSG-scribble is trained with 512 latent tokens
|
26 |
+
|
27 |
+
## Intended Uses
|
28 |
+
|
29 |
+
This model is designed for:
|
30 |
+
- Converting scribble image and text prompt to high-quality 3D meshes
|
31 |
+
- Creative and design applications
|
32 |
+
- Gaming and VFX asset creation
|
33 |
+
- Prototyping and visualization
|
34 |
+
|
35 |
+
## Requirements
|
36 |
+
|
37 |
+
- CUDA-capable GPU (>8GB VRAM)
|
38 |
+
|
39 |
+
## Usage
|
40 |
+
|
41 |
+
For detailed usage instructions, please visit our [GitHub repository](https://github.com/VAST-AI-Research/TripoSG).
|
42 |
+
|
43 |
+
## About
|
44 |
+
|
45 |
+
TripoSG-scribble is developed by [Tripo](https://www.tripo3d.ai), [VAST AI Research](https://github.com/orgs/VAST-AI-Research), pushing the boundaries of 3D Generative AI.
|
46 |
+
For more information:
|
47 |
+
- [GitHub Repository](https://github.com/VAST-AI-Research/TripoSG)
|
48 |
+
- [Paper](https://arxiv.org/abs/2502.06608)
|
49 |
+
- [Gradio Demo](https://huggingface.co/spaces/VAST-AI/TripoSG-scribble)
|