当前位置: 首页 > article >正文

CVPR 2024 图像、视频处理总汇(视频字幕、图像超分辨率、图像分类和压缩等)

1、Image/Video Captioning(图像/视频字幕)

  • Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
  • Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
    ⭐code
    🏠project
  • Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
    ⭐code
  • MeaCap: Memory-Augmented Zero-shot Image Captioning
    ⭐code
  • Sieve: Multimodal Dataset Pruning using Image Captioning Models
  • [EVCap: Retrieval-Augmented Image Captioning with External Visual--Name Memory for Open-World Comprehension]
  • EVCap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
  • 视频描述/字幕
    • Streaming Dense Video Captioning
      ⭐code
      ⭐code
    • Video ReCap: Recursive Captioning of Hour-Long Videos
      ⭐code
      🏠project
      🌻dataset
    • Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
    • VideoCon: Robust Video-Language Alignment via Contrast Captions
      ⭐code
      🏠project
    • Retrieval-Augmented Egocentric Video Captioning
  • 密集字幕
    • A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
    • DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement
  • 生成图解说明
    • Generating Illustrated Instructions
      ⭐code
      🏠project

2、Image/Video Compression(图像/视频压缩)

  • 视频压缩
    • Neural Video Compression with Feature Modulation
      ⭐code
    • C3: High-Performance and Low-Complexity Neural Compression from a Single Image or Video
      ⭐code
      🏠project
    • Task-Aware Encoder Control for Deep Video Compression
  • 图像压缩
    • Towards Backward-Compatible Continual Learning of Image Compression
      ⭐code
    • Generative Latent Coding for Ultra-Low Bitrate Image Compression
    • Dual Prior Unfolding for Snapshot Compressive Imaging
    • Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain
    • SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image
      ⭐code
    • JDEC: JPEG Decoding via Enhanced Continuous Cosine CoefficientsJPEG 解码
    • Learned Lossless Image Compression based on Bit Plane Slicing

3、Image/Video Super-Resolution(图像超分辨率)

  • Image Processing GNN: Breaking Rigidity in Super-Resolution
  • Learning Large-Factor EM Image Super-Resolution with Generative Priors
  • Super-Resolution Reconstruction from Bayer-Pattern Spike Streams
  • Continuous Optical Zooming: A Benchmark for Arbitrary-Scale Image Super-Resolution in Real World
  • Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary
    ⭐code
  • Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution
  • SinSR: Diffusion-Based Image Super-Resolution in a Single Step
    ⭐code
  • CAMixerSR: Only Details Need More "Attention"
  • Text-guided Explorable Image Super-resolution
  • CFAT: Unleashing Triangular Windows for Image Super-resolution
  • SeD: Semantic-Aware Discriminator for Image Super-Resolution
  • Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts
  • Boosting Flow-based Generative Super-Resolution Models via Learned Prior
    ⭐code
  • Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss
    ⭐code
  • AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution
    ⭐code
  • Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer
  • DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF超分辨率
  • Neural Super-Resolution for Real-time Rendering with Radiance Demodulation
  • Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution
    ⭐code
  • Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning
  • CoSeR: Bridging Image and Language for Cognitive Super-Resolution
    ⭐code
    🏠project
  • Navigating Beyond Dropout: An Intriguing Solution towards Generalizable Image Super Resolution
  • Bilateral Event Mining and Complementary for Event Stream Super-Resolution
  • 盲图像超分辨率
    • CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution
    • A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution
  • 真实世界超分辨率 Universal Robustness via Median Randomized Smoothing for Real-World Super-Resolution
    • APISR: Anime Production Inspired Real-World Anime Super-Resolution
    • SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
  • VSR
    • Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
    • Enhancing Video Super-Resolution via Implicit Resampling-based Alignment
    • Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
      🏠project
    • Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
      ⭐code
  • 文本图像超分
    • Diffusion-based Blind Text Image Super-Resolution

4、Image Classification(图像分类)

  • Fair-VPT: Fair Visual Prompt Tuning for Image Classification
  • Logarithmic Lenses: Exploring Log RGB Data for Image Classification
  • SLICE: Stabilized LIME for Consistent Explanations for Image Classification
  • Classes Are Not Equal: An Empirical Study on Image Recognition Fairness
  • MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes
  • SURE: SUrvey REcipes for building reliable and robust deep networks
    ⭐code
  • A Bayesian Approach to OOD Robustness in Image Classification
  • Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
  • Hyperspherical Classification with Dynamic Label-to-Prototype Assignment
    ⭐code
  • Discover and Mitigate Multiple Biased Subgroups in Image Classifiers
    ⭐code
  • Deep Imbalanced Regression via Hierarchical Classification Adjustment
  • Large Language Models are Good Prompt Learners for Low-Shot Image Classification
    ⭐code
  • Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
  • Bayesian Exploration of Pre-trained Models for Low-shot Image Classification
  • Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
  • Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification
    ⭐code
  • In-distribution Public Data Synthesis with Diffusion Models for Differentially Private Image Classification
  • 域泛化图像分类
    • Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification
      🏠project
  • 长尾识别
    • LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content
  • 小样本图像分类
    • Frozen Feature Augmentation for Few-Shot Image Classification
  • 零样本分类
    • Label Propagation for Zero-shot Classification with Vision-Language Models
      ⭐code
    • CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification
      ⭐code
    • Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions
      ⭐code零样本分类
  • 细粒度
    • Fine-grained Bipartite Concept Factorization for Clustering
    • Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
      ⭐code
  • 开集分类
    • ProTeCt: Prompt Tuning for Taxonomic Open Set Classification
  • 小样本识别
    • Instance-based Max-margin for Practical Few-shot Recognition
  • GCD(广义类别发现)
    • Federated Generalized Category Discovery
    • Active Generalized Category Discovery
      ⭐code
    • Contrastive Mean-Shift Learning for Generalized Category Discovery
    • Solving the Catastrophic Forgetting Problem in Generalized Category Discovery

http://www.kler.cn/a/512778.html

相关文章:

  • EDI安全:2025年数据保护与隐私威胁应对策略
  • JavaScript语言的多线程编程
  • ubuntu系统文件查找、关键字搜索
  • python学opencv|读取图像(三十九 )阈值处理Otsu方法
  • 支持向量机SVM的应用案例
  • Mysql 主从复制原理及其工作过程,配置一主两从实验
  • HotSpot JVM中的两种模式
  • 大华Java开发面试题及参考答案 (上)
  • Java中List集合的面试试题及答案解析
  • Flask:后端框架使用
  • 【Linux】Linux命令:curl
  • 论文笔记-NeruIPS2024-LLM-ESR
  • JavaEE:多线程进阶
  • vue3 hooks例子
  • Go语言-学习一
  • 网络安全:信息时代的守护者
  • JWT(JSON Web Token)
  • ChemLLM化学大模型再升级,AI助力化学研究
  • 【Python使用】嘿马头条项目从到完整开发教程第10篇:APScheduler定时任务,1. 什么是RPC【附代码文档】
  • 【2024年华为OD机试】(A卷,100分)- 完美走位 (Java JS PythonC/C++)
  • 周末总结(2024/01/18)
  • 面试--你的数据库中密码是如何存储的?
  • 《offer 来了:Java 面试核心知识点精讲 -- 框架篇》(附资源)
  • 【Elasticsearch】分片与副本机制:优化数据存储与查询性能
  • 在Windows/Linux/MacOS C++程序中打印崩溃调用栈和局部变量信息
  • C/C++ 时间复杂度(On)