SOTAVerified

Code Completion

Papers

Showing 51100 of 212 papers

TitleStatusHype
LLMSecEval: A Dataset of Natural Language Prompts for Security EvaluationsCode1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy PreservationCode1
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file ContextCode1
Energy-Based Models for Code Generation under Compilability ConstraintsCode1
Empirical Study of Transformers for Source CodeCode1
Learning Deep Semantics for Test CompletionCode1
How Effective Are Neural Networks for Fixing Security VulnerabilitiesCode1
VersiCode: Towards Version-controllable Code GenerationCode1
Language Models for Code Completion: A Practical EvaluationCode1
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code CompletionCode1
Security Attacks on LLM-based Code Completion ToolsCode1
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code GeneratorsCode1
Ada-Instruct: Adapting Instruction Generators for Complex ReasoningCode1
Long Code Arena: a Set of Benchmarks for Long-Context Code ModelsCode1
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective PartitioningCode1
Dataflow-Guided Retrieval Augmentation for Repository-Level Code CompletionCode1
A Syntax-Guided Edit Decoder for Neural Program RepairCode1
LambdaNet: Probabilistic Type Inference using Graph Neural NetworksCode1
CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE DetectionCode1
Can Large Language Models Write Parallel Code?Code1
ReACC: A Retrieval-Augmented Code Completion FrameworkCode1
How to Get Your LLM to Generate Challenging Problems for EvaluationCode1
Syntax-Aware On-the-Fly Code CompletionCode0
Time-Efficient Code Completion Model for the R Programming LanguageCode0
Breaking the Silence: the Threats of Using LLMs in Software EngineeringCode0
Structural Language Models of CodeCode0
Traces of Memorisation in Large Language Models for CodeCode0
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationCode0
CodeMark: Imperceptible Watermarking for Code Datasets against Neural Code Completion ModelsCode0
CodeKGC: Code Language Model for Generative Knowledge Graph ConstructionCode0
CodeGRU: Context-aware Deep Learning with Gated Recurrent Unit for Source Code ModelingCode0
Automating Code-Related Tasks Through Transformers: The Impact of Pre-trainingCode0
Open Vocabulary Learning on Source Code with a Graph-Structured CacheCode0
Code Completion with Neural Attention and Pointer NetworksCode0
Pythia: AI-assisted Code Completion SystemCode0
On the Embeddings of Variables in Recurrent Neural Networks for Source CodeCode0
Neural Software AnalysisCode0
MERGE: Fast Private Text GenerationCode0
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation GroundingCode0
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language UnderstandingCode0
Don't Complete It! Preventing Unhelpful Code Completion for Productive and Sustainable Neural Code Completion SystemsCode0
Enriching Source Code with Contextual Data for Code Completion Models: An Empirical StudyCode0
GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code GenerationCode0
Learning to Execute Programs with Instruction Pointer Attention Graph Neural NetworksCode0
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-TuningCode0
A Transformer-Based Approach for Smart Invocation of Automatic Code CompletionCode0
Eclipse CDT code analysis and unit testingCode0
Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security PerspectiveCode0
INSPECT: Intrinsic and Systematic Probing Evaluation for Code TransformersCode0
Investigating the Performance of Language Models for Completing Code in Functional Programming Languages: a Haskell Case StudyCode0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1deepseek-coder-33b-baseAverage69.01Unverified
2deepseek-coder-6.7b-baseAverage63.4Unverified
3starcoderbaseAverage55.54Unverified
4gpt-4-1106-previewAverage53.28Unverified
5CodeLlama-13b-hfAverage52.78Unverified
6deepseek-coder-1.3b-baseAverage52.63Unverified
7CodeLlama-34b-hfAverage49.66Unverified
8CodeLlama-7b-hfAverage45Unverified
9gpt-3.5-turbo-0301Average40.86Unverified
10incoder-6BAverage33.79Unverified
#ModelMetricClaimedVerifiedStatus
1CodeGPT-adaptedAccuracy (token-level)77.13Unverified
2CodeT5+ 770MEM (line-level)37.9Unverified
3CodeT5+ 220MEM (line-level)35.17Unverified
#ModelMetricClaimedVerifiedStatus
1CodeGPT-adaptedAccuracy (token-level)75.11Unverified
2CodeT5+ 770MEM (line-level)44.86Unverified
3CodeT5+ 220MEM (line-level)43.42Unverified
#ModelMetricClaimedVerifiedStatus
1SantaCoder-MGDCompilation Rate73.03Unverified
2SantaCoderCompilation Rate59.97Unverified
3SantaCoderCompilation Rate59.79Unverified
#ModelMetricClaimedVerifiedStatus
1RamboCompilation Rate76.47Unverified
2RepoCoderCompilation Rate74.02Unverified
#ModelMetricClaimedVerifiedStatus
1RamboCompilation Rate61.7Unverified
2RepoCoderCompilation Rate58.09Unverified