共计 22 篇文章
2025
广义优势估计(Generalized Advantage Estimation,GAE)
    
  
    
    
      
      6D连续旋转表示(On the Continuity of Rotation Representations in Neural Networks)
    
  
    
    
      
      记录一个在运行isaac gym时遇到的bug和修bug过程:cudaExternamMemoryGetMappedBuffer failed on…
    
  
    
    
      
      通过SSH克隆Github仓库时报错The authenticity of host 'github.com' can't be established.
    
  
    
    
      
      强化学习中回报(Return)、价值(Value)、动作价值(Action-Value)和优势(Advantage)的联系
    
  
    
      
      2024
Introduction to Robotics-Stanford 笔记 LEC4
    
  
    
    
      
      多智能体强化学习(MARL)值函数分解——从VDN到QMIX
    
  
    
    
      
      DDPM(Denoising Diffusion Probabilistic Models)论文阅读笔记
    
  
    
    
      
      Introduction to Robotics-Stanford 笔记 LEC3
    
  
    
    
      
      Introduction to Robotics-Stanford 笔记 LEC2