what is the recommended method to start up the vllm server engine for inferencing for InternVL3_5-8B, getting 2 qps?
#3 opened about 3 hours ago
by
Rupasai
Fine tuning?
2
#2 opened 8 days ago
by
s1ngularutyy
