SOTAVerified

Code Search

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Papers

Showing 51100 of 125 papers

TitleStatusHype
Repository-level Code Search with Neural Retrieval MethodsCode0
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven AgentsCode0
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree TransformationCode0
Global Contrastive Batch Sampling via Optimization on Sample PermutationsCode0
Isotropy Matters: Soft-ZCA Whitening of Embeddings for Semantic Code SearchCode0
On the Compression of Language Models for Code: An Empirical Study on CodeBERT0
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages0
OrcaLoca: An LLM Agent Framework for Software Issue Localization0
SCELMo: Source Code Embeddings from Language Models0
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets0
Semantic approaches to software component retrieval with English queries0
Semantic Code Search for Smart Contracts0
Semantic Source Code Search: A Study of the Past and a Glimpse at the Future0
CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words0
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models0
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code0
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search0
Variable-Length Hashing0
What Do They Capture? -- A Structural Analysis of Pre-Trained Language Models for Source Code0
You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries0
LLM Agents Improve Semantic Code Search0
Accelerating Code Search with Deep Hashing and Code Classification0
ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models0
A Language-Agnostic Model for Semantic Source Code Labeling0
A Multi-Perspective Architecture for Semantic Code Search0
Analyzing CodeBERT's Performance on Natural Language Code Search0
A New Search Paradigm for Natural Language Code Search0
AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees0
A Study on Mixup-Inspired Augmentation Methods for Software Vulnerability Detection0
A Systematic Review of Automated Query Reformulations in Source Code Search0
Bag-of-Words Baselines for Semantic Code Search0
BERT2Code: Can Pretrained Language Models be Leveraged for Code Search?0
Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning0
Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets0
Cascaded Fast and Slow Models for Efficient Semantic Code Search0
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search0
Clone-Seeker: Effective Code Clone Search Using Annotations0
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation0
CodeDSI: Differentiable Code Search0
Code Execution with Pre-trained Language Models0
Code Representation Pre-training with Complements from Program Executions0
CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search0
CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search0
CoDesc: A Large Code-Description Parallel Dataset0
Code Search Debiasing:Improve Search Results beyond Overall Ranking Performance0
Contrastive Learning of Natural Language and Code Representations for Semantic Code Search0
Contrastive Prompt Learning-based Code Search based on Interaction Matrix0
COSEA: Convolutional Code Search with Layer-wise Attention0
Crowd Sourced Data Analysis: Mapping of Programming Concepts to Syntactical Patterns0
CSSAM:Code Search via Attention Matching of Code Semantics and Structures0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1cpt-code MOverall93.5Unverified
2cpt-code SOverall93.4Unverified
3CodeT5+ 770MOverall77.4Unverified
4GraphCodeBERTOverall77.4Unverified
5CodeT5+ 220MOverall77.1Unverified
6CodeBERTOverall76Unverified
#ModelMetricClaimedVerifiedStatus
1Self-attentionTest MRR0.84Unverified
2NBOWTest MRR0.81Unverified
3RNNTest MRR0.77Unverified
#ModelMetricClaimedVerifiedStatus
1CodeT5+ 770MMRR44.7Unverified
2CodeT5+ 220MMRR43.3Unverified
3CodeBERTMRR27.19Unverified
#ModelMetricClaimedVerifiedStatus
1Uni-SBTMRR0.36Unverified
#ModelMetricClaimedVerifiedStatus
1CodeBERTAccuracy47.8Unverified
#ModelMetricClaimedVerifiedStatus
1Voyage-code-002nDCG@1056.26Unverified