Fixing rust compilation errors using llms

P Deligiannis, A Lal, N Mehrotra, A Rastogi - arXiv preprint arXiv …, 2023 - arxiv.org
… RustAssistant is parametric in the choice of LLM; we show the interactions with GPT-4 in
this section. Figure 2 shows the output of GPT-4. The output contains the suggested fix in text, …

What makes good in-context demonstrations for code intelligence tasks with llms?

S Gao, XC Wen, C Gao, W Wang… - 2023 38th IEEE/ACM …, 2023 - ieeexplore.ieee.org
4), we can also find that the performance with four demonstration examples is good enough,
… , BLEU-4, and CodeBLEU, respectively. Therefore, considering the above trade-off, using …

Hot patching hot fixes: Reflection and perspectives

C Hanna, J Petke - 2023 38th IEEE/ACM International …, 2023 - ieeexplore.ieee.org
With our reliance on software continuously increasing, it is of utmost importance that it be
reliable. However, complete prevention of bugs in live systems is unfortunately an impossible …

Program transformation landscapes for automated program modification using Gin

J Petke, B Alexander, ET Barr, AEI Brownlee… - Empirical Software …, 2023 - Springer
4 is practical, we instantiate it later in Section 5.4 by empirically solving it when \(2^O\) contains
only two sets — … Thus, we answer RQ1–4 in terms of the number of test-passing variants. …

[PDF][PDF] End-to-End Software Construction using Chat-GPT: An Experience Report

M Monteiro, BC Branco, S Silvestre… - arXiv preprint arXiv …, 2023 - researchgate.net
… As a result, we document four categories of prompts that can be used by developers in similar
contexts, including initialization prompts, feature requests, bug-fixing, and layout prompts. …

Search-based Automated Program Repair of CPS Controllers Modeled in Simulink-Stateflow

A Arrieta, P Valle, S Ali - arXiv preprint arXiv:2404.04688, 2024 - arxiv.org
Stateflow models are widely used in the industry to model the high-level control logic of
Cyber-Physical Systems (CPSs) in Simulink--the defacto CPS simulator. Many approaches exist …

Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
… their criteria, the first LLM is T5 released by Google in 2019, followed by GPT-3 released
by OpenAI in 2020, and there are more than thirty LLMs released between 2021 and 2023

Apigen: Generative api method recommendation

Y Chen, C Gao, M Zhu, Q Liao… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
… We compare APIGen with four existing approaches on two publicly available benchmarks.
Experiments show that APIGen outperforms the best baseline CLEAR by 105.8% in method-…

GI Software with fewer Data Cache Misses

WB Langdon, J Petke, A Blot, D Clark - arXiv preprint arXiv:2304.03235, 2023 - arxiv.org
… At the beginning of 2023 OpenCV’s open source repository on GitHub comprised more
than two million lines of code (mostly C, C++ and XML). Therefore, we selected an important …

RepoFixEval: A Repository-Level Program Repair Benchmark From Issue Discovering to Bug Fixing

T Sun, Y Yang, X Cheng, J Yang, Y Huo, Z Ye, R Yang… - openreview.net
… For the given issues, we primarily assess them from four perspectives: reproducibility Ir,
relevance Iv, explanation Ie and overall Io. Reproducibility (Ir) assesses the presence and quality …