SOTAVerified

Code Search

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Papers

Showing 76100 of 125 papers

TitleStatusHype
CSSAM:Code Search via Attention Matching of Code Semantics and Structures0
Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning0
DeepRTL2: A Versatile Model for RTL-Related Tasks0
Distilling Transformers for Neural Cross-Domain Search0
EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In-Situ Code Search and Recommendation0
Energy-bounded Learning for Robust Models of Code0
CoCoSoDa: Effective Contrastive Learning for Code Search0
Evaluating and Optimizing the Effectiveness of Neural Machine Translation in Supporting Code Retrieval Models: A Study on the CAT Benchmark0
Evaluation of Siamese Networks for Semantic Code Search0
Generating Code with the Help of Retrieved Template Functions and Stack Overflow Answers0
In-the-loop Hyper-Parameter Optimization for LLM-Based Automated Design of Heuristics0
Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities0
OASIS: Order-Augmented Strategy for Improved Code Search0
MoSE: Hierarchical Self-Distillation Enhances Early Layer Embeddings0
On the Compression of Language Models for Code: An Empirical Study on CodeBERT0
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages0
OrcaLoca: An LLM Agent Framework for Software Issue Localization0
SCELMo: Source Code Embeddings from Language Models0
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets0
Semantic approaches to software component retrieval with English queries0
Semantic Code Search for Smart Contracts0
Semantic Source Code Search: A Study of the Past and a Glimpse at the Future0
CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words0
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models0
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1cpt-code MOverall93.5Unverified
2cpt-code SOverall93.4Unverified
3CodeT5+ 770MOverall77.4Unverified
4GraphCodeBERTOverall77.4Unverified
5CodeT5+ 220MOverall77.1Unverified
6CodeBERTOverall76Unverified
#ModelMetricClaimedVerifiedStatus
1Self-attentionTest MRR0.84Unverified
2NBOWTest MRR0.81Unverified
3RNNTest MRR0.77Unverified
#ModelMetricClaimedVerifiedStatus
1CodeT5+ 770MMRR44.7Unverified
2CodeT5+ 220MMRR43.3Unverified
3CodeBERTMRR27.19Unverified
#ModelMetricClaimedVerifiedStatus
1Uni-SBTMRR0.36Unverified
#ModelMetricClaimedVerifiedStatus
1CodeBERTAccuracy47.8Unverified
#ModelMetricClaimedVerifiedStatus
1Voyage-code-002nDCG@1056.26Unverified