FlexCAD

[ICLR 2025] FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models by Zhanwei Zhang, Shizhao Sun, Wenxiao Wang, Deng Cai, Jiang Bian.

[Paper][Hugging Face]

FlexCAD aims at achieving controllable CAD generation across all CAD construction hierarchies. It takes an original CAD model along with the part the user intends to modify (highlighted in blude) as input, and generates multiple new CAD models with only the chosen part changed.

Figure (a) illustrates training process. Initially, a CAD model is converted int 8000 o a structured text. Next, a hierarchy-aware masking strategy is proposed to mask a specific field in the text with a special mask token. This field is set differently at each epoch to reflect various hierarchies. Then, LLMs are fine-tuned to predict the masked field. Figure (b) presents inference process. The original CAD model is transformed into a structured text with a mask token replacing the part the user wants to change. The fine-tuned LLMs are provided with this masked text to generate diverse predictions, which are then converted into new CAD models by infilling and rendering.

Installation

Create a conda environment and install all the dependencies

conda env create -f environments.yaml

After installation, activate the environment with

conda activate <env>

Data preparation

Download the processed data by SkexGen: Google Drive link.

gdown --id 1so_CCGLIhqGEDQxMoiR--A4CQk4MjuOp

Convert the dicts to the sequences. Note train.pkl, val.pkl and test.pkl should be converted separately. Circle_type: [ udlr, ldru, diam, or ].

python3 utils/convert.py --in_path <in_path_name> --out_path <out_path_name> --circle_type <circle_type_name>

For example:

python3 utils/convert.py --in_path ./cad_data/train_deduplicate_s.pkl --out_path ./cad_data/processed_data/train.pkl --circle_type ldru

Training

Before starting training, make sure to register and download the LLaMA 3 model for fine-tuning.

Run training with multiple GPUs. Change num_processes in ds_config.yaml to specify how many GPUs will be used.

CUDA_VISIBLE_DEVICES=<gpu_ids> accelerate launch --config_file ds_config.yaml finetune.py --run-name <run_name> --data-path <data_path> --eval-freq 200000 --save-freq 50000 --model_name <model_name>

For example: use Llama 3 8B as base model:

CUDA_VISIBLE_DEVICES=0,1,2,3 accelerate launch --config_file ds_config.yaml finetune.py --run-name llama3_8B --data-path ./cad_data/processed_data --eval-freq 200000 --save-freq 20000 --model-name 8B

Run training with single GPU.

CUDA_VISIBLE_DEVICES=<gpu_id> python3 finetune.py --run-name <run_name> --data-path <data_path> --eval-freq 200000 --save-freq 20000 --model_name <model_name>

Inference

Download our trained model weights for FlexCAD from Hugging Face.

For conditional generation, run inference with (mask_type: [unconditional, cad, sketch-extrusion(es), extrusion, sketch, face, loop, curve]. When selecting 'curve', you can enable use_fixed_demo to customize the type and number of curves to generate as desired.)

CUDA_VISIBLE_DEVICES=<gpu_id> python3 sample.py --model_path <model_path> --num_samples <num_samples> --model_name <model_name> --data_path <data_path> --mask_type <mask_type>

The output should be a json file, where each line is a string representing a CAD design.

Visualization

Step 1: parse the generated string to CAD obj. The in_path should be set the same as the out_path in the inference.

python3 utils/parser.py --in_path <in_path> --out_path <out_path>

Step 2: convert generated CAD obj to stl format. Use timeout command to prevent occ hanging. The data_folder should be set the same as the out_path in step 1.

timeout 180 python3 utils/visual_obj.py --data_folder <data_folder>

Step 3: render and visualize to images. The input_dir should be set the same as the data_folder in step 2. Note that this step only succeeds on Windows now.

python3 utils/cad_img.py --input_dir <input_dir> --output_dir <output_dir>

Evaluation

Uniformly sample 2000 points

python utils/sample_points.py --in_dir <sample_dir> --out_dir pcd

Evaluate performance

python utils/eval_cad.py --fake <sample_dir> --real ../data/test_eval

Citation

If you find our work useful in your research, please cite our paper:

@InProceedings{zhang2024flexcad,
  title={FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models},
  author={Zhang, Zhanwei and Sun, Shizhao and Wang, Wenxiao and Cai, Deng and Bian, Jiang},
  booktitle={ICLR},
  year={2025}
}

Acknowledgement

Our code is partially based on Skexgen and Crystal-text-llm. We appreciate all the contributors for their awesome work.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
utils		utils
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
dataset.py		dataset.py
ds_config.yaml		ds_config.yaml
environments.yaml		environments.yaml
finetune.py		finetune.py
sample.py		sample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FlexCAD

Installation

Data preparation

Training

Inference

Visualization

Evaluation

Citation

Acknowledgement

Contributing

Trademarks

About

Uh oh!

Releases

< 3A17 a href="/orgs/microsoft/packages?repo_name=FlexCAD" data-view-component="true" class="Link--primary no-underline Link d-flex flex-items-center">Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

microsoft/FlexCAD

Folders and files

Latest commit

History

Repository files navigation

FlexCAD

Installation

Data preparation

Training

Inference

Visualization

Evaluation

Citation

Acknowledgement

Contributing

Trademarks

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

< 3A17 a href="/orgs/microsoft/packages?repo_name=FlexCAD" data-view-component="true" class="Link--primary no-underline Link d-flex flex-items-center">Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

< 3A17 a href="/orgs/microsoft/packages?repo_name=FlexCAD" data-view-component="true" class="Link--primary no-underline Link d-flex flex-items-center">Packages