SOTAVerified

Code Search

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Papers

Showing 51100 of 125 papers

TitleStatusHype
Repository-level Code Search with Neural Retrieval MethodsCode0
OrcaLoca: An LLM Agent Framework for Software Issue Localization0
On the Compression of Language Models for Code: An Empirical Study on CodeBERT0
Isotropy Matters: Soft-ZCA Whitening of Embeddings for Semantic Code SearchCode0
In-the-loop Hyper-Parameter Optimization for LLM-Based Automated Design of Heuristics0
Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning0
Natural Language Outlines for Code: Literate Programming in the LLM Era0
LLM Agents Improve Semantic Code Search0
SpecRover: Code Intent Extraction via LLMs0
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models0
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven AgentsCode0
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code SearchCode0
Code Search Debiasing:Improve Search Results beyond Overall Ranking Performance0
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language UnderstandingCode0
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree TransformationCode0
Noisy Pair Corrector for Dense Retrieval0
ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models0
Contrastive Prompt Learning-based Code Search based on Interaction Matrix0
Code Representation Pre-training with Complements from Program Executions0
Laminar: A New Serverless Stream-based Framework with Semantic Code Search and Code Completion0
MELT: Mining Effective Lightweight Transformations from Pull RequestsCode0
Evaluating and Optimizing the Effectiveness of Neural Machine Translation in Supporting Code Retrieval Models: A Study on the CAT Benchmark0
Constructing Multilingual Code Search Dataset Using Neural Machine TranslationCode0
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search0
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets0
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationCode0
Code Execution with Pre-trained Language ModelsCode0
REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search ModelsCode0
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation0
Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities0
You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries0
Global Contrastive Batch Sampling via Optimization on Sample PermutationsCode0
CodeDSI: Differentiable Code Search0
CSSAM:Code Search via Attention Matching of Code Semantics and Structures0
NS3: Neuro-Symbolic Semantic Code SearchCode0
CoCoSoDa: Effective Contrastive Learning for Code Search0
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages0
Accelerating Code Search with Deep Hashing and Code Classification0
What Do They Capture? -- A Structural Analysis of Pre-Trained Language Models for Source CodeCode0
CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code SearchCode0
Generating Clarifying Questions for Query Refinement in Source Code SearchCode0
AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees0
Analyzing CodeBERT's Performance on Natural Language Code Search0
Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning0
Energy-bounded Learning for Robust Models of Code0
EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In-Situ Code Search and Recommendation0
Semantic Code Search for Smart Contracts0
A New Search Paradigm for Natural Language Code Search0
CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search0
Cascaded Fast and Slow Models for Efficient Semantic Code Search0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1cpt-code MOverall93.5Unverified
2cpt-code SOverall93.4Unverified
3CodeT5+ 770MOverall77.4Unverified
4GraphCodeBERTOverall77.4Unverified
5CodeT5+ 220MOverall77.1Unverified
6CodeBERTOverall76Unverified
#ModelMetricClaimedVerifiedStatus
1Self-attentionTest MRR0.84Unverified
2NBOWTest MRR0.81Unverified
3RNNTest MRR0.77Unverified
#ModelMetricClaimedVerifiedStatus
1CodeT5+ 770MMRR44.7Unverified
2CodeT5+ 220MMRR43.3Unverified
3CodeBERTMRR27.19Unverified
#ModelMetricClaimedVerifiedStatus
1Uni-SBTMRR0.36Unverified
#ModelMetricClaimedVerifiedStatus
1CodeBERTAccuracy47.8Unverified
#ModelMetricClaimedVerifiedStatus
1Voyage-code-002nDCG@1056.26Unverified