RefiningGPT: Specialized language Models for Automated Refinery Unit-level Process Diagram Synthesis

Dongxiao Liu, Yuwen Ding, Xinghai Wei, Jiacheng Ji, Lei Li, Linghui Li, Xiaoyong Li

arXiv:2605.19704·cs.CE·Published 2026-05-19

Applying LLMs to complex industrial processes remains challenging due to the semantic gap between natural language design intents and the rigorous physical logic of engineering. In the field of petroleum refining engineering, a critical bottleneck is the automated synthesis of Unit-level Process Diagrams (UPDs), which serve as the topological bridge connecting abstract requirements to concrete unit operations. In this paper, we propose RefineGPT, a domain-specialized agent for autonomous refinery design.RefineGPT adopts a hierarchical architecture in which a supervised fine-tuned small language model is responsible for selecting units that satisfy design requirements, while a large language model is used to connect these units to generate the final topology. To enable supervised training, we develop a pipeline that extracts latent process motifs from noisy, unstructured legacy topologies and synthesizes high-quality rationale-based Chain-of-Thought (CoT) training data. Empirical validation demonstrates that RefineGPT achieves substantial improvements in topological consistency and chemical engineering feasibility, establishing a high-fidelity pathway for AI-augmented industrial process synthesis.

TopicsProcess Control & Optimization

Tagschemical-engineering large-language-models process-design

arXiv categoriescs.CE

arXiv abstract page PDF