Allan Victor's picture

53 310

Allan Victor

BecomeAllan

·

https://becomeallan.github.io/webportfolio/

AI & ML interests

Deep Learning

Recent Activity

liked a model 3 days ago

openbmb/MiniCPM4-0.5B

liked a model 5 days ago

LiquidAI/LFM2-350M-ENJP-MT-GGUF

liked a model 5 days ago

YannQi/R-4B

View all activity

Organizations

upvoted a paper 6 days ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 7 days ago • 112

upvoted a collection 12 days ago

TimesFM Release

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated Jul 10 • 19

upvoted a paper 19 days ago

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published 21 days ago • 57

upvoted a collection about 2 months ago

OpenReasoning-Nemotron

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 7 days ago • 42

upvoted a paper about 2 months ago

AI Flow: Perspectives, Scenarios, and Approaches

Paper • 2506.12479 • Published Jun 14 • 2

upvoted a collection 2 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated 2 days ago • 75

upvoted an article 2 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

Jun 3

• 243

upvoted a collection 3 months ago

Jan-nano

5 items • Updated Jul 1 • 23

upvoted an article 3 months ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 133

upvoted a collection 3 months ago

🌞 May 2025 - Open works from the Chinese community

43 items • Updated 8 days ago • 9

upvoted 2 collections 4 months ago

Any-to-Any Models, Datasets, Spaces

18 items • Updated Jun 20 • 24

Releases 23 May

34 items • Updated May 26 • 8

upvoted a paper 4 months ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 66

upvoted 3 collections 5 months ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27

InternVL3

34 items • Updated Apr 20 • 81

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209

upvoted a collection 6 months ago

Open-RS

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated Mar 21 • 12

upvoted an article 6 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted a paper 7 months ago

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Paper • 2312.08361 • Published Dec 13, 2023 • 28

upvoted a paper 8 months ago

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10, 2024 • 28