Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
InternRobotics
/
InternVLA-M1
like
25
Follow
Intern Robotics
214
Robotics
Transformers
Safetensors
qwen2_5_vl
image-to-text
vision-language-action-model
vision-language-model
text-generation-inference
License:
cc-by-nc-sa-4.0
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
Sort: Recently created
Spatial Grounding Pre-training included?
#3 opened 1 day ago by
Jarry2020
Improve model card: Add pipeline tag, paper link, abstract, and sample usage
#2 opened about 2 months ago by
nielsr