Request for Recommendations on Incremental Learning Segmentation Papers Based on Vision-Language Models?

More Jiahui Shi's questions See All

What is the principle/mechanism behind aging of carbon (graphite) containing refractory mix for isostatically pressed refractories?

The resin bonded carbon containing refractories are aged before use. How time of aging is determined? And mechanism behind aging.

23 July 2024 3,205 0 View

When I am trying to distill triethyl amine while drying it over calcium hydride. How do I design the setup to let hydrogen escape without losing NEt3?

I am new to research. The boiling point of triethyl amine (NEt3) is 89 degrees centigrade. I am skeptical that when I will let the hydrogen escape which is getting generated in situ, I will also...

21 July 2024 7,284 4 View

Why my gel electrophoresis have shadow bands? Please see the attached picture for the gel electrophoresis ?

Sometimes I see the shadow like bands and its not true band. I want to know that what's the reason for it. I am using 2% gel for running genotyping samples I have uploaded the gel picture in both...

19 July 2024 148 6 View

What is the future scope of acoustic emission?

17 July 2024 1,510 1 View

Is the protecting group boc of the amino group stable at 37°C?

I have a small molecule reagent with a boc-protected amino group. Now the reaction needs to be reacted at 37°C for 30 h. Is this protection group stable?

12 July 2024 3,745 2 View

Why can't I detect the plasmon resonance angle with water?

I am trying to measure the plasmon resonance angle of gold film and pure water using the Kretschmann configuration and a 633nm laser. Without flowing water over the gold, I can detect a clear...

10 July 2024 4,719 3 View

How can we generate topology file for water and helium system?

Hello! I am facing a problem, I tried using pdb2gmx to generate topology file but it shows error HEA residue not found then I tried copying the forcefield file and edited accordingly but again...

24 June 2024 1,242 0 View

What is the difference between creep deformation and stress relaxation ?

I understand the basic definition for both. I found a literature to describe both of them. In the paper, they states: If a stress is applied to a concrete body, the body experiences an elastic...

19 June 2024 525 3 View

Difference between creep deformation and stress relaxation deformation ?

I understand the basic definitions of creep and pressure relaxation. However, I am confused about the exact difference between them. See attached photo. Because a compression force is applied,...

17 June 2024 5,344 2 View

Best Tools for Wrist PPG Signal Analysis: Your Recommendations?

I'm conducting research on photoplethysmography (PPG) signals obtained from smartwatches worn on the wrist. The main goal is to analyze the PPG waveform and extract key fiducial points and...

13 June 2024 3,286 0 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Is there anyone with experience in TEM analysis who can assist with a manuscript for an upcoming journal?

Hello dear colleagues, We have prepared a manuscript on NiTi-based alloys and are seeking a second opinion on our current TEM results. If you are a Ph.D. holder with experience in TEM and have...

07 August 2024 9,563 0 View

How to get links for copyrights for papers?

how to get links for copyrights for papers?

06 August 2024 7,410 1 View

How to determine positive-stained cells in FACS? Use isotype or unstained control?

To compare positive and negative cell populations in flow cytometry, should I compare unstained cells with antibody stained cells? Or with the isotype control? Most papers show comparison with...

06 August 2024 6,728 6 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Dirty and clean?

Hi everyone I need a file with a dirty and clean potato image

04 August 2024 7,199 4 View

Hello, regarding Mxene 2D titanium carbide?

I fabricated Ti3C2Tx using concentrated HF 40%, I plot an XRD as attached image below.. please let to know if I obtained it or not.

02 August 2024 6,789 4 View

How do i get an account to upload my published papers?

need to open an account to upload my published papers

01 August 2024 9,255 1 View

CHO-K1 suspension adaptation protocol?

I am trying to adapt adherent CHO-K1 cells in F12K+10%FBS to suspension culture in only F12K, but in vain. I have followed Mirus bio as well as Thermo protocols, please suggest any alternative....

31 July 2024 3,756 0 View

What do you consider to be the most relevant elements of EEG for studying cognitive biases?

I've seen articles that primarily focus on alpha and beta activity in the frontal regions, but these studies often compare healthy subjects with those having various pathologies. I haven't seen a...

31 July 2024 7,259 1 View

Saisuman Singamsetty

You're absolutely right—incremental segmentation with VLMs is an emerging area with limited but growing literature. I’d recommend looking into:

Yu et al., CVPR 2023 – "Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation"

They leverage CLIP for class-incremental segmentation under weak supervision.

"Learning from the Web: Language-Driven Weakly-Supervised Incremental Learning" (ECCV 2024)

This uses vision-language data (like image captions) to add new segmentation classes without full supervision.

Both address catastrophic forgetting while incrementally expanding class coverage using VLMs.

On a related note, in my own recent work titled "Advanced Crop Recommendation System: Leveraging Deep Learning and Fuzzy Logic for Precision Farming", we explore domain-adaptive image segmentation under evolving agricultural categories, where incremental learning for multi-class expansion is essential. While not vision-language driven, the framework bridges segmentation with evolving class labels—an adjacent challenge to your focus.

Please read more on my work titled "Advanced Crop Recommendation System: Leveraging Deep Learning and Fuzzy Logic for Precision Farming" :-

Article ADVANCED CROP RECOMMENDATION SYSTEM: LEVERAGING DEEP LEARNIN...

Stéphane Breton

As is often the case on the web, search results are largely dependent on the language vocabulary (keywords) used...

In artificial intelligence, continuous (or online) learning almost exclusively implements the mathematical concepts of recurrence (recurrent) and recursion (recursivity). Also, a new token has recently been introduced into the machine learning literature: “continual learning”.

By combining these terms with VLM, vision, language and segmentation, you will access a whole range of publications, including (but not limited to):

Yu et al., "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters", 2024 - https://arxiv.org/pdf/2403.11549
Hou et al., "VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models", 2025 - https://aclanthology.org/2025.coling-main.694.pdf
L. Pellegrini, "Continual Learning for Computer Vision Applications", 2022 - https://amsdottorato.unibo.it/id/eprint/10401/1/Lorenzo%20Pellegrini%20-%20PhD%20Thesis.pdf
Tang et al., "Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models", 2024 -
Preprint Mind the Interference: Retaining Pre-trained Knowledge in Pa...
Sun et al., "CLIP as RNN:Segment Countless Visual Concepts without Training Endeavor", 2024 - https://openaccess.thecvf.com/content/CVPR2024/papers/Sun_CLIP_as_RNN_Segment_Countless_Visual_Concepts_without_Training_Endeavor_CVPR_2024_paper.pdf
Sokar et al., "Continual Learning in Vision-Language Models via Aligned Model Merging", 2025 - https://arxiv.org/pdf/2506.03189
Zang et al., "Continual Learning of Image Classes with Language Guidance from a Vision-Language Model", 2024 - https://openreview.net/pdf?id=Z4OpKd7wOD
Hannan et al., "ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos", 2024 -
Preprint ReVisionLLM: Recursive Vision-Language Model for Temporal Gr...

And so on...

Shafagat Mahmudova

Dear Jiahui Shi ,

Several papers recommend exploring incremental learning for segmentation based on Vision-Language Models (VLMs). These papers often address the challenges of catastrophic forgetting and the need for efficient learning of new classes with limited data.

Incremental learning addresses this challenge by enabling models to adapt continually to new and nonoverlapping tasks, while ensuring the maximum retention of knowledge from previous tasks to facilitate real-time inference.

https://arxiv.org/pdf/2405.01040?

Regards,

Shafagat