Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization | by Zubair

Cover-and-Search is a coaching technique developed to reinforce weakly-supervised object and motion localization by encouraging convolutional neural networks to be taught extra complete visible representations. Proposed by Krishna Kumar Singh and Yong Jae Lee, this methodology addresses a typical downside in weakly-supervised studying: neural networks are inclined to focus solely on probably the most discriminative components of an object or motion, equivalent to an individual’s face or a single body in an motion video, whereas ignoring the much less outstanding however nonetheless informative areas. Cover-and-Search introduces a novel but easy strategy to mitigate this subject by randomly hiding patches of the enter throughout coaching, successfully forcing the community to hunt various cues for studying. This method improves localization efficiency whereas remaining environment friendly and straightforward to combine into current deep studying pipelines.

Introduction
Cover-and-Search is particularly designed to function in weakly-supervised settings, the place solely image-level or video-level class labels can be found for coaching, and fine-grained annotations equivalent to bounding containers or temporal labels are absent. The tactic was impressed by the necessity to make neural networks extra meticulous in how they attend to visible knowledge. In conventional coaching, a CNN rapidly learns to latch onto the obvious visible characteristic that correlates with a category label. Whereas this works effectively for classification, it results in poor localization for the reason that realized consideration maps typically spotlight solely a small a part of the item. Cover-and-Search addresses this by modifying the coaching knowledge slightly than the community itself. By hiding random patches of the enter picture or video body, it ensures that the mannequin should discover a number of areas to succeed at classification, thereby studying a extra distributed and full illustration of the item or motion.

Structure and Mechanism
The Cover-and-Search technique is easy to implement and doesn’t require any modification to the mannequin structure or loss operate. Throughout coaching, the enter picture or body is split right into a grid of patches. A set proportion of those patches are randomly chosen to be hidden. These hidden patches are changed with a impartial worth, usually the dataset’s imply pixel worth, in order that they don’t contribute any class-specific info. The mannequin is educated utilizing these partially obscured photos in the identical approach it…

Source link

Data Analysis Lecture 2 : Getting Started with Pandas | by Yogi Code | Coding Nexus | Aug, 2025

Current Landscape of Artificial Intelligence Threats | by Kosiyae Yussuf | CodeToDeploy : The Tech Digest | Aug, 2025

Optimizing ML Costs with Azure Machine Learning | by Joshua Fox | Aug, 2025

Bots Are Taking Over the Internet—And They’re Not Asking for Permission

I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

Amazon and eBay to pay ‘fair share’ for e-waste recycling

Artificial Intelligence Concerns & Predictions For 2025

Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

Most Popular

Exploring Text-to-Speech Technology for Video Game Narration

Reinforcement Learning in the Age of Modern AI | by @pramodchandrayan | Jul, 2025

The Misunderstood Intelligence: Addressing Public Concerns About AI | by Professor Lucius | May, 2025

Our Picks

Bots Are Taking Over the Internet—And They’re Not Asking for Permission

Data Analysis Lecture 2 : Getting Started with Pandas | by Yogi Code | Coding Nexus | Aug, 2025

TikTok to lay off hundreds of UK content moderators

Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization | by Zubair | Apr, 2025

Related Posts