RL/ ├── common/ │ ├── networks.py # MLP, Q-net, categorical/Gaussian/squashed actors, critics │ ├── buffers.py # ReplayBuffer (off-policy ...
1 All authors are with Machine Vision and Autonomous System Laboratory, Shanghai Jiao Tong University, Shanghai, China. 2 Chengyuan Luo is with SJTU-Paris Elite Institute of Technology, Shanghai Jiao ...