墙裂推荐:想获取更多前沿论文算法优化idea冲击顶会或发表专利,包含目标检测目标跟踪图像分割视频分割Visual Grounding可见光红外融合多任务学习多模态基础模型文生图自动驾驶BEV占用预测具身智能VLA深度估计动作识别表情识别三维重建、点云3D检测医学图像分割医学图像目标检测医学大模型缺陷检测异常检测遥感图像分割遥感图像变化检测数字人知识蒸馏、视频理解、3D生成、姿态估计、图像增强、人群/目标计数、视频编辑、图像去雨等众多主题,请参考:https://qcno08je5sgu.feishu.cn/

1.【图像融合】UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation

2.【多模态大模型】UAVBench and UAVIT-1M: Benchmarking and Enhancing MLLMs for Low-Altitude UAV Vision-Language Understanding

3.【多模态大模型】Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models

4.【医学大模型】(ICLR2026)How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

5.【行人重识别】(CVPR2026)BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification

6.【数字人】AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

7.【视觉语言导航】AerialVLA: A Vision-Language-Action Model for UAV Navigation via Minimalist End-to-End Control

8.【视觉语言导航】(ICLR2026)All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation

9.【文生图】Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models

10.【文生视频】Early Failure Detection and Intervention in Video Diffusion Models

11.【文生视频】Seeking Physics in Diffusion Noise

12.【图像生成】Representation Alignment for Just Image Transformers is not Easier than You Think

群内包含目标检测、图像分割、目标跟踪、Transformer、多模态、NeRF、GAN、缺陷检测、显著目标检测、关键点检测、超分辨率重建、SLAM、人脸、OCR、生物医学图像、三维重建、姿态估计、自动驾驶感知、深度估计、视频理解、行为识别、图像去雾、图像去雨、图像修复、图像检索、车道线检测、点云目标检测、点云分割、图像压缩、运动预测、神经网络量化、网络部署等多个领域的大佬,不定期分享技术知识、面试技巧和内推招聘信息

Logo

AtomGit 是由开放原子开源基金会联合 CSDN 等生态伙伴共同推出的新一代开源与人工智能协作平台。平台坚持“开放、中立、公益”的理念,把代码托管、模型共享、数据集托管、智能体开发体验和算力服务整合在一起,为开发者提供从开发、训练到部署的一站式体验。

更多推荐