Nek's picture

1 11 9

Nek

Rob1234567

·

AI & ML interests

None yet

Recent Activity

new activity about 3 hours ago

nvidia/Nemotron-Agentic-v1:Inquiry regarding Banking Domain data mentioned in Nemotron 3 Nano Paper (arXiv:2512.20848, p. 17)

liked a model 27 days ago

allenai/Olmo-3-32B-Think

upvoted a collection 27 days ago

View all activity

Organizations

None yet

New activity in nvidia/Nemotron-Agentic-v1 about 3 hours ago

Inquiry regarding Banking Domain data mentioned in Nemotron 3 Nano Paper (arXiv:2512.20848, p. 17)

#4 opened about 3 hours ago by

liked a model 27 days ago

allenai/Olmo-3-32B-Think

Text Generation • 1.05M • Updated 2 days ago • 7.55k • • 164

upvoted a collection 27 days ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 15 days ago • 158

upvoted a paper about 1 month ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 86

upvoted 2 articles about 1 month ago

Article

What makes good reasoning data

Oct 30, 2025

•

43

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

41

upvoted a collection about 2 months ago

Gemma 3 Release

28 items • Updated Aug 11, 2025 • 580

upvoted a collection 4 months ago

Qwen3Guard

7 items • Updated 7 days ago • 60

liked a model 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.43M • • 4.32k

liked a model 8 months ago

ai-sage/GigaChat-20B-A3B-instruct

Text Generation • 21B • Updated Jun 25, 2025 • 659 • 49

upvoted a paper 8 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

liked a dataset 8 months ago

logicreasoning/logi_glue

Viewer • Updated Oct 31, 2023 • 356k • 957 • 4

liked a model 8 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated 21 days ago • 5.69k • 1.24k

upvoted a collection 8 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 7 days ago • 673

liked a model 8 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 8.06M • • 954

upvoted a collection 9 months ago

LiveBench

Datasets for LiveBench • 8 items • Updated Mar 31, 2025 • 13

upvoted a paper 10 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 119

liked a Space 10 months ago

Agora Demo

A simple demo showcasing Agora

upvoted a collection 11 months ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 826

liked a model 11 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 5.84M • • 581