面向航天控制软件智能合成技术评价方法

    Evaluation Methods for Intelligent Synthesis Technology of Aerospace Control Software

    • 程序合成是自动生成满足用户意图程序代码的软件开发活动, 随着人工智能在程序合成领域的成功应用, 智能程序合成技术逐渐成为软件开发的新范式. 虽然现有一些智能程序合成技术的评价方法, 但是仍面临许多问题需要进一步完善和改进. 本文通过调研智能程序合成技术使用的评价标准以及分析当前主流智能程序合成技术的评价方法, 分析并完善了智能程序合成技术的评价指标, 结合航天嵌入式软件的特点, 构建了航天嵌入式软件智能合成的层级式评价指标体系, 设计了以动态和静态相结合为主的面向航天控制软件智能合成技术的综合评价方法. 通过实验验证其中动静结合评价方法的有效性, 其能够获得与人类评分相比更高的皮尔逊相关系数.

       

      Abstract: Program synthesis is a technique for automatically generating programs, which derives corresponding program code from given specifications or requirements. With the successful application of artificial intelligence in the field of program synthesis, intelligent program synthesis technology has become a new paradigm for software development. Although there are some evaluation methods for intelligent program synthesis technology, there are still many challenges that need further improvement and refinement. This paper summarizes and refines evaluation indicators for intelligent program synthesis technology by investigating the evaluation criteria used in intelligent program synthesis technology and analyzing the mainstream evaluation methods of intelligent program synthesis technology. Combined with the characteristics of aerospace embedded software, a hierarchical evaluation indicator system for intelligent synthesis of aerospace embedded software is constructed, and a comprehensive evaluation method for intelligent synthesis technology of aerospace control software mainly based on dynamic and static combination is designed. By calculating the Pearson correlation coefficient with ChatGPT3.5 simulating human scores, it is found that the proposed combined dynamic and static evaluation method can obtain a higher correlation coefficient than either dynamic or static evaluation methods alone, and can reflect the improvement of performance after model iteration.

       

    /

    返回文章
    返回