Close Menu
    Trending
    • 🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025
    • Futurwise: Unlock 25% Off Futurwise Today
    • 3D Printer Breaks Kickstarter Record, Raises Over $46M
    • People are using AI to ‘sit’ with them while they trip on psychedelics
    • Reinforcement Learning in the Age of Modern AI | by @pramodchandrayan | Jul, 2025
    • How This Man Grew His Beverage Side Hustle From $1k a Month to 7 Figures
    • Finding the right tool for the job: Visual Search for 1 Million+ Products | by Elliot Ford | Kingfisher-Technology | Jul, 2025
    • How Smart Entrepreneurs Turn Mid-Year Tax Reviews Into Long-Term Financial Wins
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»Data Science»SambaNova Reports Fastest DeepSeek-R1 671B with High Efficiency
    Data Science

    SambaNova Reports Fastest DeepSeek-R1 671B with High Efficiency

    Team_AIBS NewsBy Team_AIBS NewsFebruary 18, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Palo Alto, CA – Generative AI firm SambaNova introduced final week that DeepSeek-R1 671B is working right this moment on SambaNova Cloud at 198 tokens per second (t/s), “reaching speeds and effectivity that no different platform can match,” the corporate stated.

    DeepSeek-R1 has lowered AI coaching prices by 10X, however its widespread adoption has been hindered by excessive inference prices and inefficiencies — till now, in accordance with the corporate. “SambaNova has eliminated this barrier, unlocking real-time, cost-effective inference at scale for builders and enterprises,” the corporate stated.

    “Powered by the SN40L RDU chip, SambaNova is the quickest platform working DeepSeek at 198 tokens per second per person,” said Rodrigo Liang, CEO and co-founder of SambaNova. “This may improve to 5X sooner than the newest GPU velocity on a single rack — and by 12 months finish, we’ll provide 100X the capability for DeepSeek-R1.”

    “Having the ability to run the total DeepSeek-R1 671B mannequin — not a distilled model — at SambaNova’s blazingly quick velocity is a sport changer for builders. Reasoning fashions like R1 must generate a whole lot of reasoning tokens to give you a superior output, which makes them take longer than conventional LLMs. This makes dashing them up particularly vital,” said Dr. Andrew Ng, Founding father of DeepLearning.AI, Managing Normal Companion at AI Fund, and an Adjunct Professor at Stanford College’s Pc Science Division.

    “Synthetic Evaluation has independently benchmarked SambaNova’s cloud deployment of the total 671 billion parameter DeepSeek- R1 Combination of Specialists mannequin at over 195 output token/s, the quickest output velocity now we have ever measured for DeepSeek-R1. Excessive output speeds are significantly vital for reasoning fashions, as these fashions use reasoning output tokens to enhance the standard of their responses. SambaNova’s excessive output speeds will help the usage of reasoning fashions in latency delicate use instances,” stated George Cameron, Co-Founder, Synthetic Evaluation.

    DeepSeek-R1 has revolutionized AI by collapsing coaching prices by tenfold, nonetheless, widespread adoption has stalled as a result of DeepSeek-R1’s reasoning capabilities require considerably extra compute for inference, making AI manufacturing costlier. In actuality, the inefficiency of GPU-based inference has saved DeepSeek-R1 out of attain for many builders.

    SambaNova has solved this drawback. With a proprietary dataflow structure and three-tier reminiscence design, SambaNova’s SN40L Reconfigurable Dataflow Unit (RDU) chips collapse the {hardware} necessities to run DeepSeek-R1 671B effectively from 40 racks (320 of the newest GPUs) right down to 1 rack (16 RDUs) — unlocking cost-effective inference at unmatched effectivity.

    “DeepSeek-R1 is likely one of the most superior frontier AI fashions accessible, however its full potential has been restricted by the inefficiency of GPUs,” stated Rodrigo Liang, CEO of SambaNova. “That adjustments right this moment. We’re bringing the subsequent main breakthrough — collapsing inference prices and decreasing {hardware} necessities from 40 racks to only one — to supply DeepSeek-R1 on the quickest speeds, effectively.”

    “Greater than 10 million customers and engineering groups at Fortune 500 firms depend on Blackbox AI to rework how they write code and construct merchandise. Our partnership with SambaNova performs a important function in accelerating our autonomous coding agent workflows. SambaNova’s chip capabilities are unmatched for serving the total DeepSeek-R1 671B mannequin, which supplies significantly better accuracy than any of the distilled variations. We couldn’t ask for a greater associate to work with to serve hundreds of thousands of customers,” said Robert Rizk, CEO of Blackbox AI.

    Sumti Jairath, Chief Architect, SambaNova, defined: “DeepSeek-R1 is the proper match for SambaNova’s three-tier reminiscence structure. With 671 billion parameters R1 is the most important open supply massive language mannequin launched thus far, which implies it wants a whole lot of reminiscence to run. GPUs are reminiscence constrained, however SambaNova’s distinctive dataflow structure means we will run the mannequin effectively to realize 20000 tokens/s of complete rack throughput within the close to future — unprecedented effectivity when in comparison with GPUs as a result of their inherent reminiscence and knowledge communication bottlenecks.”

    SambaNova is quickly scaling its capability to satisfy anticipated demand, and by the top of the 12 months will provide greater than 100x the present world capability for DeepSeek-R1. This makes its RDUs essentially the most environment friendly enterprise answer for reasoning fashions.

    DeepSeek-R1 671B full mannequin is on the market now to all customers to expertise and to pick out customers by way of API on SambaNova Cloud.





    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMira Murati, OpenAI’s Former Chief Technology Officer, Starts Her Own Company
    Next Article Notebooks vs IDE-Based Modular Python Repository for Data Science Projects | by Mete Can Akar | Feb, 2025
    Team_AIBS News
    • Website

    Related Posts

    Data Science

    Futurwise: Unlock 25% Off Futurwise Today

    July 1, 2025
    Data Science

    National Lab’s Machine Learning Project to Advance Seismic Monitoring Across Energy Industries

    July 1, 2025
    Data Science

    University of Buffalo Awarded $40M to Buy NVIDIA Gear for AI Center

    June 30, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    How to: Polynomial linear regression | by Michal Mikulasi | Mar, 2025

    March 21, 2025

    Submarine Stealth Vs. AI, Drones, and Sensor Networks

    December 29, 2024

    6 Common Mistakes to Avoid When Developing a Data Strategy

    April 24, 2025
    Our Picks

    🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025

    July 1, 2025

    Futurwise: Unlock 25% Off Futurwise Today

    July 1, 2025

    3D Printer Breaks Kickstarter Record, Raises Over $46M

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.