Unified Controllable and Faithful Text-to-CAD Generation with LLMs

Published: (June 9, 2026 at 10:04 AM EDT)
2 min read

Source: Hacker News

View PDF HTML (experimental) Abstract:The construction of CAD models has traditionally relied on labor-intensive manual operations and specialized expertise. Recent advances in large language models (LLMs) have inspired research into text-to-CAD generation. However, existing approaches typically treat generation and editing as disjoint tasks, limiting their practicality. We propose PR-CAD, a progressive refinement framework that unifies generation and editing for controllable and faithful text-to-CAD modeling. To support this, we curate a high-fidelity interaction dataset spanning the full CAD lifecycle, encompassing multiple CAD representations as well as both qualitative and quantitative descriptions. The dataset systematically defines the types of edit operations and generates highly human-like interaction data. Building on a CAD representation tailored for LLMs, we propose a reinforcement learning-enhanced reasoning framework that integrates intent understanding, parameter estimation, and precise edit localization into a single agent. This enables an “all-in-one” solution for both design creation and refinement. Extensive experiments demonstrate strong mutual reinforcement between generation and editing tasks, and across qualitative and quantitative modalities. On public benchmarks, PR-CAD achieves state-of-the-art controllability and faithfulness in both generation and refinement scenarios, while also proving user-friendly and significantly improving CAD modeling efficiency.

      Subjects:
      
        Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
    
      Cite as:
      [arXiv:2604.19773](https://arxiv.org/abs/2604.19773) [cs.CL]
    
    
       
      (or 
          [arXiv:2604.19773v1](https://arxiv.org/abs/2604.19773v1) [cs.CL] for this version)
      
    
    
       
                    [https://doi.org/10.48550/arXiv.2604.19773](https://doi.org/10.48550/arXiv.2604.19773)
          
          
              arXiv-issued DOI via DataCite

        
      
    


  

Submission history

From: Jiyuan An [view email]
[v1] Fri, 27 Mar 2026 12:13:20 UTC (10,655 KB)

0 views
Back to Blog

Related posts

Read more »

Cosmodial Sky Atlas

Article URL: https://killedbyapixel.github.io/Cosmodial/ Comments URL: https://news.ycombinator.com/item?id=48507571 Points: 15 Comments: 1...

I Am Not a Reverse Centaur

About a year ago I wrote on this blog about how coding with LLMs would not work for mehttps://blog.miguelgrinberg.com/post/why-generative-ai-coding-tools-and-ag...

'Don't You Just Upload It to ChatGPT?'

Article views: 10,114 In my Ottawa lifehttps://correresmidestino.com/tag/ottawa/, every Tuesday evening, I take two gym classeshttps://correresmidestino.com/im-...