Reinforcement Learning related models
Davide Buoso
lambdavi
AI & ML interests
PhD Student @ VANDAL (Polytechnic University of Turin).
Interested in the intersection of Robotics and Generative AI.
Organizations
None yet
models
18
lambdavi/span-marker-luke-legal
Token Classification
•
0.3B
•
Updated
•
8
•
3
lambdavi/legal-luke-base-ner
Token Classification
•
0.3B
•
Updated
•
2
•
1
lambdavi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
lambdavi/ppo-Pyramids
Reinforcement Learning
•
Updated
•
5
lambdavi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
2
lambdavi/ddpg-PandaReach-v3
Reinforcement Learning
•
Updated
lambdavi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
lambdavi/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
lambdavi/span-marker-luke-base-conll2003
Token Classification
•
0.3B
•
Updated
•
3
•
2
lambdavi/luke-base_finetuned_conll2003
Token Classification
•
0.3B
•
Updated
•
2
datasets
0
None public yet