No Train, All Gain: Enhancing Deep Frozen Representations with Self-Supervised Gradients

Enhancing Deep Learning Representations

A major challenge in deep learning is creating strong representations without needing a lot of retraining or labeled data. Many applications rely on pre-trained models, but these often miss specific details needed for the best performance. Retraining can be impractical, especially in fields like medical diagnostics and remote sensing where resources and labeled data are limited. Therefore, a method that improves fixed representations without retraining would greatly benefit various tasks and domains.

Current Approaches and Their Limitations

Techniques like k-nearest neighbor (kNN), Vision Transformers (ViTs), and self-supervised learning (SSL) methods such as SimCLR and DINO have made progress in using unlabeled data. However, these methods often require specific architectures, heavy fine-tuning, or large amounts of labeled data, limiting their generalizability. Many SSL techniques overlook gradient information that could improve the adaptability of learned representations for different applications.

Introducing FUNGI

Researchers from the University of Amsterdam and valeo.ai have developed a new method called FUNGI (Features from UNsupervised GradIents). This method enhances frozen embeddings by using gradient information from self-supervised learning objectives. FUNGI is adaptable and can be applied to any pre-trained model without changing its parameters, making it both flexible and efficient.

How FUNGI Works

FUNGI operates in three main stages:

Gradient Extraction: It computes gradients from the final hidden layers of Vision Transformer models to capture relevant features.
Dimensionality Reduction: High-dimensional gradients are downsampled to match a target size using binary random projection.
Concatenation: The downsampled gradients are combined with the embeddings and further compressed using PCA, resulting in efficient and informative feature sets.

Performance Improvements

FUNGI significantly enhances performance across various benchmarks, including visual, text, and audio datasets. In kNN classification, it shows a 4.4% increase over all ViT models, with the highest improvements on datasets like Flowers and CIFAR-100. In low-data scenarios, FUNGI achieves a 2.8% increase in accuracy, demonstrating its effectiveness where data is scarce. It also improves segmentation accuracy by up to 17% in retrieval-based tasks on Pascal VOC.

Conclusion and Value of FUNGI

In summary, FUNGI efficiently enhances pre-trained model embeddings by utilizing unsupervised gradients from SSL objectives. It improves frozen model representations without retraining, offering adaptability and efficiency, especially in low-data environments. This advancement is crucial for applying AI in practical scenarios with limited labeled data and computational resources.

For more information, check out the Paper and GitHub Page. Follow us on Twitter, join our Telegram Channel, and connect with our LinkedIn Group. If you appreciate our work, subscribe to our newsletter and join our 55k+ ML SubReddit.

Free AI Webinar

Join our free webinar on implementing intelligent document processing with GenAI in financial services and real estate transactions.

Leverage AI for Your Business

To stay competitive and evolve your company with AI:

Identify Automation Opportunities: Find customer interaction points that can benefit from AI.
Define KPIs: Ensure measurable impacts on business outcomes.
Select an AI Solution: Choose tools that fit your needs and allow customization.
Implement Gradually: Start with a pilot, gather data, and expand AI usage wisely.

For AI KPI management advice, connect with us at hello@itinai.com. Stay updated on leveraging AI through our Telegram or Twitter.

Explore how AI can transform your sales processes and customer engagement at itinai.com.

List of Useful Links:

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Researchers from University of Waterloo and CMU Introduce Critique Fine-Tuning (CFT): A Novel AI Approach for Enhancing LLM Reasoning with Structured Critique Learning

Transforming Language Model Training with Critique Fine-Tuning Limitations of Traditional Training Methods Traditional training for language models often relies on imitating correct answers. While this works for simple tasks, it limits the model’s ability to think…

AI Tech News
Detecting Generative AI Content

The advances in generative AI raise ethical issues regarding the detection of AI-generated content. Detecting the origin of content becomes akin to a Turing Test, where distinguishing between human and AI-generated content becomes difficult. Although detection…

AI Tech News
UI-R1 Framework: Enhancing GUI Action Prediction with Rule-Based Reinforcement Learning

UI-R1 Framework: Enhancing GUI Action Prediction with AI Introducing the UI-R1 Framework for GUI Action Prediction Overview of the Challenge Supervised fine-tuning (SFT) is the conventional method used to train large language models (LLMs) and graphical…

AI Tech News
Researchers from the University of Wisconsin-Madison Challenge the Efficacy of Score-based Generative Models: A Surprising Revelation of Gaussian Mimicry in High-Quality Data Generation

Score-based Generative Models (SGMs) are lauded for producing high-quality samples from complex data distributions, with empirical success and strong theoretical support. Recent theories provide error bounds for assessing distribution disparity, showing SGMs’ imitation abilities. However, a…

AI Tech News
Llama-Deploy: A Fully Open-Source Way to Deploy Your Agents as Production Microservices

Practical AI Solutions with Llama-Deploy Introduction The llama-deploy solution simplifies the deployment of AI-driven agentic workflows, making it easier to scale and deploy them as microservices. This practical solution bridges the gap between development and production,…

AI Tech News
LUMOS: An Open-Source Generalizable Language Agent Training Framework

AI Tech News
DeepSeek-V3: Revolutionizing Language Modeling with Enhanced Efficiency

Optimizing Language Modeling for Efficiency with DeepSeek-AI’s DeepSeek-V3 The evolution of large language models (LLMs) like DeepSeek-V3, GPT-4o, Claude 3.5 Sonnet, and LLaMA-3 has been driven by breakthroughs in architecture, the availability of vast datasets, and…

AI News
Creating your own code writing agent. How to get results fast and avoid the most common pitfalls

AI Tech News
Researchers from Allen Institute for AI Developed SPECTER2: A New Scientific Document Embedding Model via a 2-Step Training Process on Large Datasets

Researchers at the Allen Institute for AI developed SPECTER2, a new scientific document embedding model that outperforms previous models like SPECTER and SciNCL. SPECTER2 uses a novel two-step training process, incorporating format-specific adapters, and is trained…

AI Tech News
Contextual Retrieval: An Advanced AI Technique that Reduces Incorrect Chunk Retrieval Rates by up to 67%

The Power of Contextual Retrieval in AI Enhancing AI Performance with Contextual Retrieval Contextual Retrieval is a cutting-edge AI technique that significantly boosts information retrieval accuracy in AI models. By incorporating Contextual Embeddings and Contextual BM25,…

AI Tech News
This Paper Explores AI-Driven Hedging Strategies in Finance: A Deep Dive into the Use of Recurrent Neural Networks and k-Armed Bandit Models for Efficient Market Simulation and Risk Management

Artificial intelligence is widely used in finance for managing risks associated with derivative contracts. A recent study explored the application of reinforcement learning (RL) agents in hedging derivative contracts, addressing challenges with data scarcity and model…

AI Tech News
Alibaba Qwen3: Next-Gen Large Language Model with Hybrid Reasoning and Multilingual Support

Introduction to Qwen3: A New Era in Large Language Models The Alibaba Qwen team has recently launched Qwen3, the latest advancement in the Qwen series of large language models (LLMs). Designed to tackle existing challenges in…

AI Tech News
ReMamba: Enhancing Long-Sequence Modeling with a 3.2-Point Boost on LongBench and 1.6-Point Improvement on L-Eval Benchmarks

Enhancing Long-Sequence Modeling with ReMamba Addressing the Challenge In natural language processing (NLP), effectively handling long text sequences is crucial. Traditional transformer models excel in many tasks but face challenges with lengthy inputs due to computational…

AI Tech News
NVIDIA AI Introduces Cosmos World Foundation Model (WFM) Platform to Advance Physical AI Development

Understanding the Challenges of Physical AI The development of Physical AI, which helps simulate and optimize real-world physics, faces major hurdles. Creating accurate models often requires a lot of computing power and time, with some simulations…

AI Tech News
Google Researchers Unveil ReAct-Style LLM Agent: A Leap Forward in AI for Complex Question-Answering with Continuous Self-Improvement

Researchers at Google have introduced a ReAct-style Large Language Model (LLM) agent intended to tackle complex question-answering. By incorporating external information and fine-tuning with reduced parameterization, this approach aims to overcome challenges in answering difficult questions…

AI Tech News
Collecting Data with Apache Airflow on a Raspberry Pi

The article discusses the versatility of the Raspberry Pi as a single-board computer capable of handling various tasks.

AI Tech News
This AI Paper from CMU Introduces AgentKit: A Machine Learning Framework for Building AI Agents Using Natural Language

AI Tech News
Researchers from MIT and Harvard Developed UNITS: A Unified Machine Learning Model for Time Series Analysis that Supports a Universal Task Specification Across Various Tasks

UniTS, a revolutionary time series model developed through collaboration between researchers from Harvard University, MIT Lincoln Laboratory, and the University of Virginia, offers a versatile tool to handle diverse time series tasks, outperforming existing models in…

AI Tech News
Optimizing Large Language Models with Granularity: Unveiling New Scaling Laws for Mixture of Experts

The rapid progress in large language models (LLMs) has impacted various areas but raised concerns about the high computational costs. Exploring Mixture of Experts (MoE) models addresses this, utilizing dynamic task allocation and granular control over…

AI Tech News
What Are Deepfakes: Everything You Want to Know (Research)

Deepfakes, a product of AI generative models, create convincing fake images and videos that can deceive and defraud people. They’ve advanced from trivial uses to more concerning applications, including misinformation and identity fraud. Understanding their creation…

AI Tech News