Final week, I made a video about DeepSeek-V3, and it triggered an enormous stir within the world AI neighborhood.
Two Days in the past, a Chinese language DeepSeek launched the inference-based large-scale language mannequin “DeepSeek-R1” as open supply!
It’s stated to carry out simply in addition to OpenAI’s most correct inference mannequin, ‘o1.’ What’s much more spectacular is that It’s an extremely price-breaking mannequin with an API worth of lower than 1/25 of OpenAI o1. On prime of that, it’s been open-sourced beneath the extremely versatile MIT license, so anybody can obtain and use the mannequin
As quickly because the R1 mannequin got here out this time, it not solely refuted the earlier assertion of distilling OpenAI o1, however the official additionally straight stated: “We are able to tie with the open supply model of o1.”
It’s price mentioning that R1 breaks by way of the earlier mannequin coaching strategies and doesn’t use any SFT knowledge in any respect. It solely trains the mannequin by way of pure RL. This exhibits that R1 has discovered to consider issues by itself — which is definitely extra consistent with human considering. rule.