Comments
LL3M uses a team of large language models to write Python code that creates and edits 3D assets in Blender. Given user text instructions, the agents are capable of creating expressive shapes from scratch, and realizing complex, precise geometric manipulations in code. Whereas previous uses of code-writing LLMs for 3D creation have focused on specific subtasks or constrained procedural programs and primitives, our method is able to create unconstrained assets with geometry, layout, and appearance. With high-level code as a 3D representation, our pipeline is natively a loop of iterative refinement and co-creation: agents perform automatic code and visual self-critique, and users can provide continuous high-level feedback. Further editing avenues are enabled by the clear code and the parameters transparent in the generated Blender nodes and structures.