daily_AIOR_report

2026-05-20

WavFlow: Audio Generation in Waveform Space

Symmetry-Compatible Principle for Optimizer Design: Embeddings, LM Heads, SwiGLU MLPs, and MoE Routers

DexHoldem: Playing Texas Hold’em with Dexterous Embodied System

TopoPrimer: The Missing Topological Context in Forecasting Models

SafeDiffusion-R1: Online Reward Steering for Safe Diffusion Post-Training

AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents

AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents

Code as Agent Harness

Lance: Unified Multimodal Modeling by Multi-Task Synergy

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

Auditing Agent Harness Safety

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

PhysBrain 1.0 Technical Report

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

MMSkills: Towards Multimodal Skills for General Visual Agents

FFAvatar: Few-Shot, Feed-Forward, and Generalizable Avatar Reconstruction

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

HodgeCover: Higher-Order Topological Coverage Drives Compression of Sparse Mixture-of-Experts

今日无新增 OR 相关研究

📅 2026-05-17

今日无新增 OR 相关研究

📅 2026-05-17

今日无新增 OR 相关研究

📅 2026-05-17

今日无新增 OR 相关研究

📅 2026-05-17

今日无新增 OR 相关研究

📅 2026-05-16

根据强制筛选原则,对提供的论文列表进行分析后,筛选出属于“AI + 运筹优化”交叉领域的高质量研究如下:


大规模多智能体路径规划中的局部通信学习