SOTAVerified

OpenAi's GPT4 as coding assistant

2023-09-22Code Available0· sign in to hype

Lefteris Moussiades, George Zografos

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Lately, Large Language Models have been widely used in code generation. GPT4 is considered the most potent Large Language Model from Openai. In this paper, we examine GPT3.5 and GPT4 as coding assistants. More specifically, we have constructed appropriate tests to check whether the two systems can a) answer typical questions that can arise during the code development, b) produce reliable code, and c) contribute to code debugging. The test results are impressive. The performance of GPT4 is outstanding and signals an increase in the productivity of programmers and the reorganization of software development procedures based on these new tools.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
TimeQuestionsGPT-4P@130.6Unverified

Reproductions