Models

72,229

Full-text search

Active filters: reinforcement-learning

nvidia/GEAR-SONIC

Reinforcement Learning • Updated 7 days ago • 29

Adilbai/stock-trading-rl-agent

Reinforcement Learning • Updated Jan 8 • 121 • 136

nvidia/EGM-8B

Image-Text-to-Text • 9B • Updated 8 days ago • 191 • 5

nvidia/EGM-4B

Image-Text-to-Text • 5B • Updated 8 days ago • 594 • 6

zai-org/GLM-TTS

Text-to-Speech • Updated Jan 12 • 2.35k • 334

exla-ai/openpie-0.6

Robotics • Updated Feb 4 • 156 • 19

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8

Reinforcement Learning • 8B • Updated Mar 28, 2025 • 24.6k • 202

NousResearch/DeepHermes-ToolCalling-Specialist-Atropos

Reinforcement Learning • 8B • Updated Apr 28, 2025 • 37 • 16

One-RL-to-See-Them-All/Orsta-7B

Image-Text-to-Text • 8B • Updated Jun 4, 2025 • 17 • 12

ValueFX9507/Tifa-DeepsexV3-14b-GGUF-Q6

Reinforcement Learning • 15B • Updated Jul 1, 2025 • 15.6k • 42

PhysicsWallahAI/Aryabhata-1.0

Text Generation • 8B • Updated Aug 13, 2025 • 276 • 111

InfiX-ai/InfiGUI-G1-7B

Image-Text-to-Text • 8B • Updated Aug 12, 2025 • 39 • 11

PRIME-RL/P1-235B-A22B

Text Generation • 235B • Updated Oct 24, 2025 • 33 • 20

chaseungjoon/wildfire-prediction-A3C-LSTM

Reinforcement Learning • Updated Dec 8, 2025 • 3 • 1

Maincode/Maincoder-1B-ONNX

Text Generation • Updated Dec 30, 2025 • 13 • 4

PrimeIntellect/INTELLECT-3.1

Text Generation • 107B • Updated Feb 18 • 261 • 42

AQ-MedAI/PulseMind-72B

Image-Text-to-Text • 73B • Updated Jan 30 • 15 • 1

bengusu80/humanoid-light-switch-policy-model

Reinforcement Learning • Updated Feb 26 • 1

XunmeiLiu/VFIG-4B

Reinforcement Learning • 4B • Updated 22 days ago • 257 • 5

AaryanK/ModelGate

Text Classification • 2B • Updated 26 days ago • 99 • 4

batteryphil/mamba-2.8b-latent

Text Generation • 3B • Updated 4 days ago • 1.33k • 3

JosedelaPepe/ppo-LunarLander-v2

Reinforcement Learning • Updated about 13 hours ago • 19 • 1

vivekvish2004/openenv-customer-support

Reinforcement Learning • Updated 5 days ago • 1

Accio-Lab/Metis-8B-RL

Image-Text-to-Text • 9B • Updated 7 days ago • 179 • 1

hongli-zhan/MINT-empathy-Qwen3-1.7B

Text Generation • 2B • Updated 1 day ago • 798 • 1

hongli-zhan/MINT-empathy-Qwen3-4B

Text Generation • 4B • Updated 1 day ago • 807 • 1

yssnn04/ppo-LunarLander-v3

Reinforcement Learning • Updated 6 days ago • 50 • 1

Lingyu-Lingluo/ppo-LunarLander-v2

Reinforcement Learning • Updated 6 days ago • 38 • 1

mradermacher/Vero-MiMo-7B-i1-GGUF

Reinforcement Learning • 8B • Updated 5 days ago • 2.7k • 1

eddy1111111/Qwen3.5-122B-A10B-ReActsft-nvfp4

Text Generation • 62B • Updated 5 days ago • 234 • 1