Proton's biased article on Deepseek

JOMusic@lemmy.ml · 23 hours ago

Proton's biased article on Deepseek

morrowind@lemmy.ml · edit-2 27 minutes ago

Have you compared it with the regular qwen? It was also very good

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 7 hours ago

The main difference is speed and memory usage. Qwen is a full-sized, high-parameter model while qwen-distill is a smaller model created using knowledge distillation to mimic qwen’s outputs. If you have the resources to run qwen fast then I’d just go with that.

morrowind@lemmy.ml · 26 minutes ago

I think you’re confusing the two. I’m talking about the regular qwen before it was finetuned by deep seek, not the regular deepseek