印奇，AI攒局型企业家

2026年2月1日 · 徐丽 · 来源：dev资讯

I wanted to verify this for myself, so I set up a small test harness on my production server. It ran 360 chat completions across a range of models, cancelling each request immediately after the first token was received. Below are the resulting first-token latency measurements:

Названа пр

It is very easy to catch, especially if you've never had it before.，更多细节参见51吃瓜

(Image credit: Intel)

官方回应