Fixing rust compilation errors using llms
… RustAssistant is parametric in the choice of LLM; we show the interactions with GPT-4 in
this section. Figure 2 shows the output of GPT-4. The output contains the suggested fix in text, …
this section. Figure 2 shows the output of GPT-4. The output contains the suggested fix in text, …
What makes good in-context demonstrations for code intelligence tasks with llms?
… 4), we can also find that the performance with four demonstration examples is good enough,
… , BLEU-4, and CodeBLEU, respectively. Therefore, considering the above trade-off, using …
… , BLEU-4, and CodeBLEU, respectively. Therefore, considering the above trade-off, using …
Hot patching hot fixes: Reflection and perspectives
With our reliance on software continuously increasing, it is of utmost importance that it be
reliable. However, complete prevention of bugs in live systems is unfortunately an impossible …
reliable. However, complete prevention of bugs in live systems is unfortunately an impossible …
Program transformation landscapes for automated program modification using Gin
… 4 is practical, we instantiate it later in Section 5.4 by empirically solving it when \(2^O\) contains
only two sets — … Thus, we answer RQ1–4 in terms of the number of test-passing variants. …
only two sets — … Thus, we answer RQ1–4 in terms of the number of test-passing variants. …
[PDF][PDF] End-to-End Software Construction using Chat-GPT: An Experience Report
M Monteiro, BC Branco, S Silvestre… - arXiv preprint arXiv …, 2023 - researchgate.net
… As a result, we document four categories of prompts that can be used by developers in similar
contexts, including initialization prompts, feature requests, bug-fixing, and layout prompts. …
contexts, including initialization prompts, feature requests, bug-fixing, and layout prompts. …
Search-based Automated Program Repair of CPS Controllers Modeled in Simulink-Stateflow
Stateflow models are widely used in the industry to model the high-level control logic of
Cyber-Physical Systems (CPSs) in Simulink--the defacto CPS simulator. Many approaches exist …
Cyber-Physical Systems (CPSs) in Simulink--the defacto CPS simulator. Many approaches exist …
Software testing with large language models: Survey, landscape, and vision
… their criteria, the first LLM is T5 released by Google in 2019, followed by GPT-3 released
by OpenAI in 2020, and there are more than thirty LLMs released between 2021 and 2023 …
by OpenAI in 2020, and there are more than thirty LLMs released between 2021 and 2023 …
Apigen: Generative api method recommendation
… We compare APIGen with four existing approaches on two publicly available benchmarks.
Experiments show that APIGen outperforms the best baseline CLEAR by 105.8% in method-…
Experiments show that APIGen outperforms the best baseline CLEAR by 105.8% in method-…
GI Software with fewer Data Cache Misses
… At the beginning of 2023 OpenCV’s open source repository on GitHub comprised more
than two million lines of code (mostly C, C++ and XML). Therefore, we selected an important …
than two million lines of code (mostly C, C++ and XML). Therefore, we selected an important …
RepoFixEval: A Repository-Level Program Repair Benchmark From Issue Discovering to Bug Fixing
… For the given issues, we primarily assess them from four perspectives: reproducibility Ir,
relevance Iv, explanation Ie and overall Io. Reproducibility (Ir) assesses the presence and quality …
relevance Iv, explanation Ie and overall Io. Reproducibility (Ir) assesses the presence and quality …