How do I serve a model in the original folder as bf16 in VLLM?
4
#60 opened about 1 month ago
by
bakch92
Model Performance
😔
🤗
5
1
#59 opened about 1 month ago
by
Joe1998
Disgusting, maximally censored model!
👍
33
16
#56 opened about 1 month ago
by
Lord-Kvento

Llama, Mistral, Gemma… and now OpenAI enters the hunger games. 🐎⚔️
🤗
🔥
2
2
#54 opened about 1 month ago
by
Stephen555

Request: FP8 / BF16 version of model?
👍
2
1
#53 opened about 1 month ago
by
Epliz
how to disable the reasoning mode?
👍
10
7
#50 opened about 1 month ago
by
szzzzz
Enterprise AI factory OS
#47 opened about 1 month ago
by
DavidSteinbauer
How to use different reasoning effort in the example?
🔥
1
2
#45 opened about 1 month ago
by
TianheWu

Structured Outputs
1
#44 opened about 1 month ago
by
DykeF
Problems with Metal
3
#42 opened about 1 month ago
by
Thalesian
[v1 engine][flash_attn backend] TypeError: flash_attn_varlen_func() got an unexpected keyword argument 's_aux' when running gpt-oss-120b on H200
👍
7
13
#41 opened about 1 month ago
by
RekklesAI

openai_harmony.HarmonyError: error downloading or loading vocab file: failed to download or load vocab file
11
#39 opened about 1 month ago
by
rsullenbLL
Cooooool !大模型本地部署,能力评测,幻觉评测微信交流群,欢迎感兴趣的朋友加入
#38 opened about 1 month ago
by
jakyer
I feel unsafe
🤗
5
9
#37 opened about 1 month ago
by
lmganon123
Deploy gpt-oss models in your own AWS account using vLLM and Tensorfuse
🔥
2
3
#36 opened about 1 month ago
by
agam30

Local Installation Video and Testing - Step by Step
#34 opened about 1 month ago
by
fahdmirzac

vLLM FlashAttention3 with A6000
👍
16
19
#33 opened about 1 month ago
by
YieumYoon
Request: 4-bit GPTQ or AWQ quantized version of openai/gpt-oss-20b
❤️
🚀
12
12
#32 opened about 1 month ago
by
powtac

Benchmaxed with no world knowledge or intuition. (ex. Wheelies on a motorcycle do not use front brakes to adjust the height b/c the front wheel is already off the ground duh)
👍
8
#31 opened about 1 month ago
by
TroyDoesAI

Streaming Issue
1
#29 opened about 1 month ago
by
seanliu96
Model claims to be GPT4-Turbo - any explanation behind this?
3
#28 opened about 1 month ago
by
marksverdhei

Web browsing (using built-in browsing tools)
1
#27 opened about 1 month ago
by
eryk-mazus

Knowledge limitations
👍
2
5
#25 opened about 1 month ago
by
hexess
VLLM - Flash-attn 3
12
#23 opened about 1 month ago
by
chriswritescode

From 🤗 to 🤯 — The Evolution
❤️
🔥
9
2
#22 opened about 1 month ago
by
gokularaman

Recommended Sampling Parameters
👍
1
4
#21 opened about 1 month ago
by
YunfanZhang42
Promise kept, excellent work. Thank you
❤️
1
#20 opened about 1 month ago
by
TestregX

poor multi language support
😔
👍
5
8
#19 opened about 1 month ago
by
devops724
OpenAI
❤️
2
1
#18 opened about 1 month ago
by
nanowell

Function calling doesn't work
7
#17 opened about 1 month ago
by
patrickvonplaten

Finally OpenAI
🚀
2
1
#16 opened about 1 month ago
by
nitinprajwal
gpt-oss not supporting OpenAI Agents SDK tools
3
#13 opened about 1 month ago
by
davidduyun

how to finetune this model?
👍
1
3
#11 opened about 1 month ago
by
daiwk
Witnessing History
#8 opened about 1 month ago
by
Lazycuber

OpenAI is finally open.
🤗
🔥
13
#7 opened about 1 month ago
by
anupbhat
GG WP
#6 opened about 1 month ago
by
alexcarter1
I was there
🤝
1
1
#4 opened about 1 month ago
by
Fotachu

OSS FTW!
🚀
😔
3
2
#2 opened about 1 month ago
by
nanowell

note: to expensive to run
🔥
🤗
2
11
#1 opened about 1 month ago
by
MichaelBoll
