arxiv:2111.08819
Shengyi Costa Huang
vwxyzjn
Research interests
None yet
Organizations
Papers
2
models
27
vwxyzjn/train_policy_accelerate__None__seed1__1695136188
Text Generation
•
Updated
vwxyzjn/train_policy_accelerate-None-seed1
Text Generation
•
Updated
vwxyzjn/testyes4
Text Generation
•
Updated
vwxyzjn/testyes2
Text Generation
•
Updated
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
•
384
vwxyzjn/starcoderbase-triviaqa1
Text Generation
•
Updated
•
1
vwxyzjn/starcoderbase_1_0_triviaqa
Text Generation
•
Updated
vwxyzjn/Breakout-v5-cleanba_impala_envpool_machado_atari_wrapper-seed1
Reinforcement Learning
•
Updated
vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper-seed1
Reinforcement Learning
•
Updated
vwxyzjn/BigfishHard-v0-cleanba_ppo_envpool_procgen-seed1
Reinforcement Learning
•
Updated