Close Menu
    Trending
    • Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025
    • The New Career Crisis: AI Is Breaking the Entry-Level Path for Gen Z
    • Musk’s X appoints ‘king of virality’ in bid to boost growth
    • Why Entrepreneurs Should Stop Obsessing Over Growth
    • Implementing IBCS rules in Power BI
    • What comes next for AI copyright lawsuits?
    • Why PDF Extraction Still Feels LikeHack
    • GenAI Will Fuel People’s Jobs, Not Replace Them. Here’s Why
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»Artificial Intelligence»Virtualization & Containers for Data Science Newbies
    Artificial Intelligence

    Virtualization & Containers for Data Science Newbies

    Team_AIBS NewsBy Team_AIBS NewsFebruary 12, 2025No Comments13 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Virtualization makes it doable to run a number of digital machines (VMs) on a single piece of bodily {hardware}. These VMs behave like unbiased computer systems, however share the identical bodily computing energy. A pc inside a pc, so to talk.

    Many cloud companies depend on virtualization. However different applied sciences, reminiscent of containerization and serverless computing, have grow to be more and more vital.

    With out virtualization, lots of the digital companies we use on daily basis wouldn’t be doable. After all, it is a simplification, as some cloud companies additionally use bare-metal infrastructures.

    On this article, you’ll discover ways to arrange your individual digital machine in your laptop computer in just some minutes — even in case you have by no means heard of Cloud Computing or containers earlier than.

    Desk of Contents
    1 — The Origins of Cloud Computing: From Mainframes to Serverless Architecture
    2 — Understanding Virtualization: Why it’s the Basis of Cloud Computing
    3 — Create a Virtual Machine with VirtualBox
    Final Thoughts
    Where can you continue learning?

    1 — The Origins of Cloud Computing: From Mainframes to Serverless Structure

    Cloud computing has basically modified the IT panorama — however its roots return a lot additional than many individuals assume. Actually, the historical past of the cloud started again within the Nineteen Fifties with enormous mainframes and so-called dumb terminals.

    • The period of mainframes within the Nineteen Fifties: Corporations used mainframes in order that a number of customers may entry them concurrently by way of dumb terminals. The central mainframes had been designed for high-volume, business-critical knowledge processing. Large companies nonetheless use them right this moment, even when cloud companies have lowered their relevance.
    • Time-sharing and virtualization: Within the subsequent decade (Nineteen Sixties), time-sharing made it doable for a number of customers to entry the identical computing energy concurrently — an early mannequin of right this moment’s cloud. Across the identical time, IBM pioneered virtualization, permitting a number of digital machines to run on a single piece of {hardware}.
    • The beginning of the web and web-based functions within the Nineteen Nineties: Six years earlier than I used to be born, Tim Berners-Lee developed the World Wide Web, which revolutionized on-line communication and our whole working and residing atmosphere. Are you able to think about our lives right this moment with out web? On the identical time, PCs had been turning into more and more standard. In 1999, Salesforce revolutionized the software program trade with Software program as a Service (SaaS), permitting companies to make use of CRM options over the web with out native installations.
    • The massive breakthrough of cloud computing within the 2010s:
      The fashionable cloud period started in 2006 with Amazon Web Services (AWS): Corporations had been in a position to flexibly lease infrastructure with S3 (storage) and EC2 (digital servers) as an alternative of shopping for their very own servers. Microsoft Azure and Google Cloud adopted with PaaS and IaaS companies.
    • The fashionable cloud-native period: This was adopted by the subsequent innovation with containerization. Docker made Containers standard in 2013, adopted by Kubernetes in 2014 to simplify the orchestration of containers. Subsequent got here serverless computing with AWS Lambda and Google Cloud Capabilities, which enabled builders to jot down code that mechanically responds to occasions. The infrastructure is absolutely managed by the cloud supplier.

    Cloud computing is extra the results of a long time of innovation than a single new expertise. From time-sharing to virtualization to serverless architectures, the IT panorama has constantly developed. At the moment, cloud computing is the muse for streaming companies like Netflix, AI functions like ChatGPT and international platforms like Salesforce.

    2 — Understanding Virtualization: Why Virtualization is the Foundation of Cloud Computing

    Virtualization means abstracting bodily {hardware}, reminiscent of servers, storage or networks, into a number of digital cases.

    A number of unbiased methods may be operated on the identical bodily infrastructure. As a substitute of dedicating a complete server to a single software, virtualization permits a number of workloads to share assets effectively. For instance, Home windows, Linux or one other atmosphere may be run concurrently on a single laptop computer — every in an remoted digital machine.

    This protects prices and assets.

    Much more vital, nonetheless, is the scalability: Infrastructure may be flexibly tailored to altering necessities.

    Earlier than cloud computing grew to become extensively out there, corporations typically needed to keep devoted servers for various functions, resulting in excessive infrastructure prices and restricted scalability. If extra efficiency was all of a sudden required, for instance as a result of webshop site visitors elevated, new {hardware} was wanted. The corporate had so as to add extra servers (horizontal scaling) or improve present ones (vertical scaling).

    That is totally different with virtualization: For instance, I can merely improve my digital Linux machine from 8 GB to 16 GB RAM or assign 4 cores as an alternative of two. After all, provided that the underlying infrastructure helps this. Extra on this later.

    And that is precisely what cloud computing makes doable: The cloud consists of big knowledge facilities that use virtualization to offer versatile computing energy — precisely when it’s wanted. So, virtualization is a elementary expertise behind cloud computing.

    How does serverless computing work?

    What when you didn’t even need to handle virtual machines anymore?

    Serverless computing goes one step additional than Virtualization and containerization. The cloud supplier handles most infrastructure duties — together with scaling, upkeep and useful resource allocation. Builders ought to deal with writing and deploying code.

    However does serverless actually imply that there aren’t any extra servers?

    After all not. The servers are there, however they’re invisible for the person. Builders not have to fret about them. As a substitute of manually provisioning a digital machine or container, you merely deploy your code, and the cloud mechanically executes it in a managed atmosphere. Assets are solely supplied when the code is working. For instance, you should utilize AWS Lambda, Google Cloud Capabilities or Azure Capabilities.

    What are some great benefits of serverless?

    As a developer, you don’t have to fret about scaling or upkeep. Which means that if there may be much more site visitors at a specific occasion, the assets are mechanically adjusted. Serverless computing may be cost-efficient, particularly in Perform-as-a-Service (FaaS) fashions. If nothing is working, you pay nothing. Nevertheless, some serverless companies have baseline prices (e.g. Firestore).

    Are there any disadvantages?

    You may have a lot much less management over the infrastructure and no direct entry to the servers. There may be additionally a danger of vendor lock-in. The functions are strongly tied to a cloud supplier.

    A concrete instance of serverless: API with out your individual server

    Think about you could have a web site with an API that gives customers with the present climate. Usually, a server runs across the clock — even at instances when nobody is utilizing the API.

    With AWS Lambda, issues work otherwise: A person enters ‘Mexico Metropolis’ in your web site and clicks on ‘Get climate’. This request triggers a Lambda perform within the background, which retrieves the climate knowledge and sends it again. The perform is then stopped mechanically. This implies you don’t have a completely working server and no pointless prices — you solely pay when the code is executed.

    3 — What Information Scientists ought to Find out about Containers and VMs — What’s the Distinction?

    You’ve in all probability heard of containers. However what’s the distinction to digital machines — and what’s significantly related as an information scientist?

    Each containers and digital machines are virtualization applied sciences.

    Each make it doable to run functions in isolation.

    Each supply benefits relying on the use case: Whereas VMs present sturdy safety, containers excel in pace and effectivity.

    The primary distinction lies within the structure:

    • Digital machines virtualize the whole {hardware} — together with the working system. Every VM has its personal operational system (OS). This in flip requires extra reminiscence and assets.
    • Containers, then again, share the host working system and solely virtualize the applying layer. This makes them considerably lighter and sooner.

    Put merely, digital machines simulate whole computer systems, whereas containers solely encapsulate functions.

    Why is that this vital for knowledge scientists?

    Since as an information scientist you’ll come into contact with machine studying, knowledge engineering or knowledge pipelines, additionally it is vital to know one thing about containers and digital machines. Positive, you don’t must have in-depth information of it like a DevOps Engineer or a Website Reliability Engineer (SRE).

    Digital machines are utilized in knowledge science, for instance, when an entire working system atmosphere is required — reminiscent of a Home windows VM on a Linux host. Information science tasks typically want particular environments. With a VM, it’s doable to offer precisely the identical atmosphere — no matter which host system is on the market.

    A VM can be wanted when coaching deep studying fashions with GPUs within the cloud. With cloud VMs reminiscent of AWS EC2 or Azure Digital Machines, you could have the choice of coaching the fashions with GPUs. VMs additionally fully separate totally different workloads from one another to make sure efficiency and safety.

    Containers are utilized in knowledge science for knowledge pipelines, for instance, the place instruments reminiscent of Apache Airflow run particular person processing steps in Docker containers. Which means that every step may be executed in isolation and independently of one another — no matter whether or not it entails loading, remodeling or saving knowledge. Even if you wish to deploy machine studying fashions by way of Flask / FastAPI, a container ensures that all the pieces your mannequin wants (e.g. Python libraries, framework variations) runs precisely because it ought to. This makes it tremendous simple to deploy the mannequin on a server or within the cloud.

    3 — Create a Digital Machine with VirtualBox

    Let’s make this a bit extra concrete and create an Ubuntu VM. 🚀

    I take advantage of the VirtualBox software program with my Home windows Lenovo laptop computer. The digital machine runs in isolation out of your essential working system in order that no modifications are made to your precise system. When you’ve got Home windows Professional Version, you can too allow Hyper-V (pre-installed by default, however disabled). With an Intel Mac, you also needs to be capable of use VirtualBox. With an Apple Silicon, Parallels Desktop or UTM is outwardly the higher different (not examined myself).

    1) Set up Digital Field

    Step one is to obtain the set up file for VirtualBox from the official Virtual Box website and set up VirtualBox. VirtualBox is put in together with all obligatory drivers.

    You possibly can ignore the notice about lacking dependencies Python Core / win32api so long as you don’t want to automate VirtualBox with Python scripts.

    Then we begin the Oracle VirtualBox Supervisor:

    2) Obtain the Ubuntu ISO file

    Subsequent, we obtain the Ubuntu ISO file from the Ubuntu website. An ISO Ubuntu file is a compressed picture file of the Ubuntu working system. Which means that it comprises an entire copy of the set up knowledge. I obtain the LTS model as a result of this model receives safety and upkeep updates for five years (Lengthy Time period Help). Be aware the placement of the .iso file as we are going to use it later in VirtualBox.

    Screenshot taken by the writer

    3) Create a digital machine in VirtualBox

    Subsequent, we create a brand new digital machine within the VirtualBox Supervisor and provides it the identify Ubuntu VM 2025. Right here we choose Linux as the kind and Ubuntu (64-bit) because the model. We additionally choose the beforehand downloaded ISO file from Ubuntu because the ISO picture. It will even be doable so as to add the ISO file later within the mass storage menu.

    Screenshot taken by the writer

    Subsequent, we choose a person identify vboxuser2025 and a password for entry to the Ubuntu system. The hostname is the identify of the digital machine inside the community or system. It should not comprise any areas. The area identify is non-obligatory and could be used if the community has a number of gadgets.

    We then assign the suitable assets to the digital machine. I select 8 GB (8192 MB) RAM, as my host system has 64 GB RAM. I like to recommend 4GB (4096) at the least. I assign 2 processors, as my host system has 8 cores and 16 logical processors. It will even be doable to assign 4 cores, however this fashion I’ve sufficient assets for my host system. Yow will discover out what number of cores your host system has by opening the Job Supervisor in Home windows and searching on the variety of cores beneath the Efficiency tab beneath CPU.

    Screenshot taken by the writer

    Subsequent, we click on on ‘Create a digital arduous disk now’ to create a digital arduous disk. A VM requires its personal digital arduous disk to put in the OS (e.g. Ubuntu, Home windows). All applications, information and configurations of the VM are saved on it — identical to on a bodily arduous disk. The default worth is 25 GB. If you wish to use a VM for machine studying or knowledge science, extra cupboard space (e.g. 50–100 GB) could be helpful to have room for giant knowledge units and fashions. I maintain the default setting.

    We are able to then see that the digital machine has been created and can be utilized:

    Screenshot taken by the writer

    4) Use Ubuntu VM

    We are able to now use the newly created digital machine like a traditional separate working system. The VM is totally remoted from the host system. This implies you’ll be able to experiment in it with out altering or jeopardizing your essential system.

    In case you are new to Linux, you’ll be able to check out fundamental instructions like ls, cd, mkdir or sudo to get to know the terminal. As an information scientist, you’ll be able to arrange your individual growth environments, set up Python with Pandas and Scikit-learn to develop knowledge evaluation and machine studying fashions. Or you’ll be able to set up PostgreSQL and run SQL queries with out having to arrange an area database in your essential system. You can too use Docker to create containerized functions.

    Last Ideas

    For the reason that VM is remoted, we will set up applications, experiment and even destroy the system with out affecting the host system.

    Let’s see if digital machines stay related within the coming years. As corporations more and more use microservice architectures (as an alternative of monoliths), containers with Docker and Kubernetes will definitely grow to be much more vital. However understanding the right way to arrange a digital machine and what it’s used for is definitely helpful.

    I simplify tech for curious minds. Should you get pleasure from my tech insights on Python, knowledge science, knowledge engineering, machine studying and AI, think about subscribing to my substack.

    The place Can You Proceed Studying?



    Source link
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleThe Future Of Transportation: Autonomous Vehicles and ML | by Arya College | Feb, 2025
    Next Article Fired Meta Workers Say They Have Records of Good Performance
    Team_AIBS News
    • Website

    Related Posts

    Artificial Intelligence

    Implementing IBCS rules in Power BI

    July 1, 2025
    Artificial Intelligence

    Become a Better Data Scientist with These Prompt Engineering Tips and Tricks

    July 1, 2025
    Artificial Intelligence

    Lessons Learned After 6.5 Years Of Machine Learning

    July 1, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    Windows 11 Pro for $20: Built for Business Owners Who Do It All

    February 6, 2025

    How a Grandfather Started a $500,000 Side Hustle on Amazon

    January 4, 2025

    Apple Plans to Bring Live Translation to AirPods: Report

    March 16, 2025
    Our Picks

    Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

    July 1, 2025

    The New Career Crisis: AI Is Breaking the Entry-Level Path for Gen Z

    July 1, 2025

    Musk’s X appoints ‘king of virality’ in bid to boost growth

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.