SOTAVerified

Code Search

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Papers

Showing 5175 of 125 papers

TitleStatusHype
Repository-level Code Search with Neural Retrieval MethodsCode0
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven AgentsCode0
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree TransformationCode0
Global Contrastive Batch Sampling via Optimization on Sample PermutationsCode0
Isotropy Matters: Soft-ZCA Whitening of Embeddings for Semantic Code SearchCode0
On the Compression of Language Models for Code: An Empirical Study on CodeBERT0
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages0
OrcaLoca: An LLM Agent Framework for Software Issue Localization0
SCELMo: Source Code Embeddings from Language Models0
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets0
Semantic approaches to software component retrieval with English queries0
Semantic Code Search for Smart Contracts0
Semantic Source Code Search: A Study of the Past and a Glimpse at the Future0
CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words0
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models0
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code0
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search0
Variable-Length Hashing0
What Do They Capture? -- A Structural Analysis of Pre-Trained Language Models for Source Code0
You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries0
LLM Agents Improve Semantic Code Search0
Accelerating Code Search with Deep Hashing and Code Classification0
ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models0
A Language-Agnostic Model for Semantic Source Code Labeling0
A Multi-Perspective Architecture for Semantic Code Search0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1cpt-code MOverall93.5Unverified
2cpt-code SOverall93.4Unverified
3CodeT5+ 770MOverall77.4Unverified
4GraphCodeBERTOverall77.4Unverified
5CodeT5+ 220MOverall77.1Unverified
6CodeBERTOverall76Unverified
#ModelMetricClaimedVerifiedStatus
1Self-attentionTest MRR0.84Unverified
2NBOWTest MRR0.81Unverified
3RNNTest MRR0.77Unverified
#ModelMetricClaimedVerifiedStatus
1CodeT5+ 770MMRR44.7Unverified
2CodeT5+ 220MMRR43.3Unverified
3CodeBERTMRR27.19Unverified
#ModelMetricClaimedVerifiedStatus
1Uni-SBTMRR0.36Unverified
#ModelMetricClaimedVerifiedStatus
1CodeBERTAccuracy47.8Unverified
#ModelMetricClaimedVerifiedStatus
1Voyage-code-002nDCG@1056.26Unverified