SOTAVerified

4k

Papers

Showing 151200 of 367 papers

TitleStatusHype
Global-Local Stepwise Generative Network for Ultra High-Resolution Image RestorationCode0
GThinker: Towards General Multimodal Reasoning via Cue-Guided RethinkingCode0
GUDN: A novel guide network with label reinforcement strategy for extreme multi-label text classificationCode0
High Quality Segmentation for Ultra High-resolution ImagesCode0
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source SuitesCode0
InstructRetro: Instruction Tuning post Retrieval-Augmented PretrainingCode0
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HDCode0
Joint Super-Resolution and Inverse Tone-Mapping: A Feature Decomposition Aggregation Network and A New BenchmarkCode0
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing ApplicationsCode0
KOLOMVERSE: Korea open large-scale image dataset for object detection in the maritime universeCode0
Leveraging Vision-Language Models for Visual Grounding and Analysis of Automotive UICode0
Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented DialogCode0
Measuring and Addressing Indexical Bias in Information RetrievalCode0
Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale EventsCode0
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated ImagesCode0
Progressive Feature Fusion Network for Realistic Image DehazingCode0
REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion DetectionCode0
Restoring Extremely Dark Images in Real TimeCode0
Robust and Scalable Gaussian Process Regression and Its ApplicationsCode0
Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label PropagationCode0
S-BDT: Distributed Differentially Private Boosted Decision TreesCode0
Long Context Compression with Activation BeaconCode0
Solution Concepts in Hierarchical Games under Bounded Rationality with Applications to Autonomous DrivingCode0
Subjective assessment of the impact of a content adaptive optimiser for compressing 4K HDR content with AV1Code0
UAVid: A Semantic Segmentation Dataset for UAV ImageryCode0
UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality AssessmentCode0
UIO-LLMs: Unbiased Incremental Optimization for Long-Context LLMsCode0
VeriFastScore: Speeding up long-form factuality evaluationCode0
ViralVectors: Compact and Scalable Alignment-free Virome Feature GenerationCode0
Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-ResolutionCode0
Good Semi-supervised VAE Requires Tighter Evidence Lower Bound0
Lightweight Hardware Transform Design for the Versatile Video Coding 4K ASIC Decoders0
Lightweight hardware implementation of VVC transform block for ASIC decoder0
Lightweight Portrait Matting via Regional Attention and Refinement0
USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s0
Global Priors Guided Modulation Network for Joint Super-Resolution and Inverse Tone-Mapping0
GAEA: A Geolocation Aware Conversational Model0
FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion0
From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs0
LoLA: Low-Rank Linear Attention With Sparse Caching0
Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach0
LongFin: A Multimodal Document Understanding Model for Long Financial Domain Documents0
LongIns: A Challenging Long-context Instruction-based Exam for LLMs0
Fewshot learning on global multimodal embeddings for earth observation tasks0
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs0
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos0
LUT-GCE: Lookup Table Global Curve Estimation for Fast Low-light Image Enhancement0
Feature-level Rating System using Customer Reviews and Review Votes0
Machine Translation for Ge'ez Language0
Exploring Generalizable Pre-training for Real-world Change Detection via Geometric Estimation0
Show:102550
← PrevPage 4 of 8Next →

No leaderboard results yet.