Close Menu
    Trending
    • Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025
    • The New Career Crisis: AI Is Breaking the Entry-Level Path for Gen Z
    • Musk’s X appoints ‘king of virality’ in bid to boost growth
    • Why Entrepreneurs Should Stop Obsessing Over Growth
    • Implementing IBCS rules in Power BI
    • What comes next for AI copyright lawsuits?
    • Why PDF Extraction Still Feels LikeHack
    • GenAI Will Fuel People’s Jobs, Not Replace Them. Here’s Why
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»Machine Learning»02. The ML Engineering Journey: From Theory to Practice — Setting Up Our Development Environment | by Vitor Yago | Feb, 2025
    Machine Learning

    02. The ML Engineering Journey: From Theory to Practice — Setting Up Our Development Environment | by Vitor Yago | Feb, 2025

    Team_AIBS NewsBy Team_AIBS NewsFebruary 15, 2025No Comments5 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Bear in mind how in our final publish we talked concerning the instruments we’ll be utilizing on our ML Engineering journey? Properly, it’s time to roll up our sleeves and begin constructing. At present, we’re going to arrange our improvement surroundings — and belief me, this isn’t your typical “simply set up these packages and also you’re good to go” tutorial.

    Vital to say, every part I’ll present right here, relies on Home windows, sorry guys.

    Earlier than we write a single line of code, let’s discuss one thing that took me embarrassingly lengthy to study: correct mission construction issues. Rather a lot. You see, ML initiatives have this sneaky method of rising from “only a fast experiment” into “okay, that is really going into manufacturing” sooner than you may say “jupyter pocket book.”

    Let me share one thing private: I’ve had the privilege of working with two of essentially the most organized individuals I’ve ever met. These two people achieved one thing outstanding — they constructed a complete division that will usually require a crew of 10 folks. Their secret? Pure group and construction. They proved that being organized isn’t nearly preserving issues tidy; it’s about effectivity and scalability. I discovered from them that if you’re actually organized, you may develop initiatives rapidly and effectively, doing the work of a complete crew with only a fraction of the sources.

    Consider your mission construction like the inspiration of a home. Certain, you would throw collectively a tent fairly rapidly, however you wouldn’t wish to construct a skyscraper on that very same basis. Let me present you the best way to construct one thing that may scale along with your ambitions, utilizing the rules I discovered from these engineering wizards.

    Earlier than we dive into creating our mission construction, we’d like a approach to observe our adjustments and retailer our code safely. That is the place GitHub is available in. I gained’t spend an excessive amount of time explaining the fundamentals of Git and GitHub — there are wonderful sources already out there. Should you’re new to Git and GitHub, I extremely suggest:

    1. GitHub’s personal tutorial movies on YouTube
    2. The “Git & GitHub Crash Course” by Traversy Media
    3. “Git and GitHub for Inexperienced persons” by freeCodeCamp

    You gained’t see me educating you some issues you may simply entry on YouTube.

    Upon getting GitHub arrange (it ought to take about half-hour following any of these tutorials), create a brand new repository known as “ML-Engineering-Journey”. Make sure that to:

    • Initialize it with a README file
    • Add a .gitignore file for Python
    • Select the MIT license (it’s beginner-friendly and permissive)

    After creating your repository, you’ll get a URL that appears one thing like:

    https://github.com/vitoryago/ML-Engineering-Journey.git

    In case you are new in GitHub, bear in mind, you NEED to study the immediate instructions.

    Let’s skip now to what issues on this part: How can we arrange our mission? Right here is the construction we’ll comply with>

    ML-Engineering-Journey/
    ├── knowledge/ # For storing datasets
    │ ├── uncooked/ # Unique, immutable knowledge
    │ └── processed/ # Cleaned and processed knowledge
    ├── fashions/ # The place skilled fashions stay
    ├── notebooks/ # For experimentation
    ├── src/ # Supply code
    │ ├── knowledge/ # Knowledge processing scripts
    │ ├── fashions/ # Mannequin-related code
    │ └── utils/ # Utility features
    ├── assessments/ # For making certain code high quality
    └── README.md # Challenge documentation

    This mission construction would possibly appear to be overkill at first — sort of like organizing your sock drawer by colour, materials, AND event. However belief me, there’s methodology to this insanity!

    Consider it like organising your kitchen. You wouldn’t retailer uncooked rooster subsequent to your ready-to-eat salad (except you’re a fan of meals poisoning), proper? That’s precisely why we separate our uncooked and processed knowledge. It’s like having a “deal with with care” part to your pristine, authentic datasets, safely away from the “chopped, diced, and ready-to-serve” processed knowledge.

    To create our workspace, we’ll want Python’s digital surroundings — consider it as creaating a clear room the place we management precisely what instruments now we have out there:

    # Create and activate digital surroundings

    python -m venv venv
    venvScriptsactivate

    When the digital surroundings prompts, your command immediate adjustments to indicate (venv) firstly. This small change signifies one thing essential: you’re now working in an remoted surroundings the place you may set up precisely the packages you want with out affecting different initiatives.

    Subsequent, we set up our important instruments:

    pip set up pandas numpy scikit-learn torch
    pip set up transformers
    pip set up python-dotenv pytest jupyter
    pip set up langchain langchain-experimental

    Every of those packages serves a selected objective in our ML Engineering toolkit. Pandas and NumPy will deal with our knowledge processing, scikit-learn and PyTorch present our machine studying capabilities, and instruments like pytest guarantee our code works reliably.

    Thus far, we’ve laid down a stable basis for our MLE Journey. Sure, possibly this isn’t the enjoyable half you’re anticipating, nevertheless it’s actually crucial.

    In our subsequent publish, we’ll dive into one thing that revolutionized how we deploy ML techniques: Docker containers. We’ll learn to:

    • Create your first Dockerfile
    • Construct and run containers for ML improvement
    • Perceive why ML containers are typically bigger than typical software program containers
    • Arrange your mission to be cloud-ready from the beginning

    And right here’s slightly teaser (for the publish #5): we’ll additionally discover some methods for preserving your Docker pictures environment friendly with out sacrificing performance. As a result of whereas your ML fashions would possibly must be massive, your containers don’t need to be!

    See you within the subsequent publish! Bear in mind, let’s keep linked:

    LinkedIn: Vitor Carvalho



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleOpenAI Rejects Elon Musk’s Bid to Gain Control of the Company
    Next Article Want to Make Money With AI? Here Are Easy Steps to Unlock Explosive Profits in 2025
    Team_AIBS News
    • Website

    Related Posts

    Machine Learning

    Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

    July 1, 2025
    Machine Learning

    Why PDF Extraction Still Feels LikeHack

    July 1, 2025
    Machine Learning

    🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025

    July 1, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    LLM for Data Visualization: How AI Shapes the Future of Analytics | by Chengzhi Zhao | Jan, 2025

    January 18, 2025

    Apple and Android Appear Powerless Against Toll Scam Texts

    March 14, 2025

    How to Survive High-Demand Seasons Without Losing Customers

    January 4, 2025
    Our Picks

    Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

    July 1, 2025

    The New Career Crisis: AI Is Breaking the Entry-Level Path for Gen Z

    July 1, 2025

    Musk’s X appoints ‘king of virality’ in bid to boost growth

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.