Close Menu
    Trending
    • Designing a Machine Learning System: Part Five | by Mehrshad Asadi | Aug, 2025
    • Innovations in Artificial Intelligence That Are Changing Agriculture
    • Hundreds of thousands of Grok chats exposed in Google results
    • Workers Over 40 Are Turning to Side Hustles — Here’s Why
    • From Pixels to Perfect Replicas
    • In a first, Google has released data on how much energy an AI prompt uses
    • Mastering Fine-Tuning Foundation Models in Amazon Bedrock: A Comprehensive Guide for Developers and IT Professionals | by Nishant Gupta | Aug, 2025
    • The Key to Building Effective Corporate-Startup Partnerships
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»Data Science»Cerebras Reports Fastest DeepSeek R1 Distill Llama 70B Inference
    Data Science

    Cerebras Reports Fastest DeepSeek R1 Distill Llama 70B Inference

    Team_AIBS NewsBy Team_AIBS NewsFebruary 3, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Cerebras Programs right now introduced what it stated is record-breaking efficiency for DeepSeek-R1-Distill-Llama-70B inference, reaching greater than 1,500 tokens per second – 57 instances sooner than GPU-based options.

    Cerebras stated this velocity allows prompt reasoning capabilities for one of many business’s most refined open-weight fashions, operating solely on U.S.-based AI infrastructure with zero knowledge retention.

    “DeepSeek R1 represents a brand new frontier in AI reasoning capabilities, and right now we’re making it accessible on the business’s quickest speeds,” stated Hagay Lupesko, SVP of AI Cloud, Cerebras. “By reaching greater than 1,500 tokens per second on our Cerebras Inference platform, we’re reworking minutes-long reasoning processes into near-instantaneous responses, basically altering how builders and enterprises can leverage superior AI fashions.”

    Powered by the Cerebras Wafer Scale Engine, the platform demonstrates real-world performance improvements. A normal coding immediate that takes 22 seconds on aggressive platforms completes in simply 1.5 seconds on Cerebras – a 15x enchancment in time to consequence. This breakthrough allows sensible deployment of refined reasoning fashions that historically require in depth computation time.

    DeepSeek-R1-Distill-Llama-70B combines the superior reasoning capabilities of DeepSeek’s 671B parameter Combination of Specialists (MoE) mannequin with Meta’s widely-supported Llama structure. Regardless of its environment friendly 70B parameter dimension, the mannequin demonstrates superior efficiency on advanced arithmetic and coding duties in comparison with bigger fashions.

    “Safety and privateness are paramount for enterprise AI deployment,” continued Lupesko. “By processing all inference requests in U.S.-based knowledge facilities with zero knowledge retention, we’re making certain that organizations can leverage cutting-edge AI capabilities whereas sustaining strict knowledge governance requirements. Knowledge stays within the U.S. 100% of the time and belongs solely to the client.”

    The DeepSeek-R1-Distill-Llama-70B mannequin is accessible instantly by way of Cerebras Inference, with API entry accessible to pick clients by way of a developer preview program. For extra details about accessing prompt reasoning capabilities for functions, go to www.cerebras.ai/contact-us.





    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMEMS Clocks: Saving Power For AI
    Next Article The Stargate Program: A Financial Black Hole? | by Avinash Saravanan (アビナッシュ・サラバナン) | Feb, 2025
    Team_AIBS News
    • Website

    Related Posts

    Data Science

    Innovations in Artificial Intelligence That Are Changing Agriculture

    August 21, 2025
    Data Science

    Generative AI in Human Resources: Transforming Talent, Learning & Leadership

    August 21, 2025
    Data Science

    LambdaTest Unveils Agent-to-Agent AI Testing Platform

    August 20, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Designing a Machine Learning System: Part Five | by Mehrshad Asadi | Aug, 2025

    August 21, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    Want to Be a Stronger Leader? Don’t Make These 5 Mistakes

    May 23, 2025

    Teen With Cerebral Palsy Starts Business Making $5M a Year

    March 19, 2025

    Why AI Projects Fail | Towards Data Science

    June 7, 2025
    Our Picks

    Designing a Machine Learning System: Part Five | by Mehrshad Asadi | Aug, 2025

    August 21, 2025

    Innovations in Artificial Intelligence That Are Changing Agriculture

    August 21, 2025

    Hundreds of thousands of Grok chats exposed in Google results

    August 21, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.