The similarities are way also good to disregard. They possibly trained the model over a synthetic dataset generated by GPT-4o. Did High-Flyer misrepresent its usage of GPUs to create DeepSeek appear to be extra efficient than it in fact is? Was DeepSeek’s unexpected community launch timed to drive down Nvidia’s https://x.com/kidtsang/status/1884008035535782292