Poor quality in feature-guided Diffusion models: mismatched features between generated images and input targets?

17 August 2025 0 8K Report

Research Context

Objective:

To verify whether a diffusion model can generate images that match target features when conditioned on two paired input features(initially simplified by using features extracted from the same source image).

•Ideal outcome: The model should generate images whose features align with the original image's features.
•Current setup: In real-world scenarios, the correspondence between features and original images is unknown. This experiment uses paired featuresas a simplified proxy.

Methodology

•Input: Two paired features extracted from the same image (e.g., Feature A + Feature B).
•Control: Constrain the generated image’s features to match the target feature (e.g., Feature A) via similarity metrics (L2 loss or feature-space distance).
•Model: Modified DDPM architecture with dual-feature conditioning.

Observations

1.Pixel-space mismatch: Generated images show no visual resemblance to the original.

2.Feature-space discrepancy: Re-extracted features from generated images exhibit low similarity to targets (e.g., cosine similarity ≈ 0.3).

Attempted Solutions

•Adjusted loss weights (feature-matching loss vs. diffusion noise).
•Tested different noise schedules (linear vs. cosine).

Key Questions

Model Selection

1.Is this task suitable for diffusion models?

2.If proceeding with diffusion, are specialized conditioning mechanisms needed (e.g., cross-attention instead of FiLM)?

Feature Representation

1.Could feature-space disparities (e.g., scale mismatches between EfficientNet/ResNet features) hinder convergence? •Would feature normalization/disentanglement help?

Requests for Advice

1.Literature: Are there papers on diffusion models generating images from non-image features(e.g., paired embeddings)?

2.Technical suggestions: •Loss function design (e.g., hybrid pixel/feature losses). •Architectural modifications (e.g., cross-attention, feature fusion strategies).

Badges
Science topic

Similar topics
Education
Students

More Zheng Yilin's questions See All

Problems about acousto-optic coupling in Comsol, how to check whether the moving mesh is correct?

I’m trying to reproduce the acousto-optic coupling simulation in the paper Frequency–angular resolving LiDAR using chip-scale acousto-optic beam steering. LNOI(lithium niobate on insulator) is...

13 August 2024 9,795 0 View

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

How are Large Models Exploring and Outputting Knowledge Understanding in Specific Content Areas, and What Does Academic Research Say About It?

Hello everyone！ I am currently exploring the performance of large models in understanding knowledge in specific domains, and attempting to construct a knowledge framework similar to what...

05 August 2024 5,729 2 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

How do i get an account to upload my published papers?

need to open an account to upload my published papers

01 August 2024 9,255 1 View

What is the problem with these tissue culture plants?

All plants are green but some of these plants becomes yellow. I did not found any reason. Please help me to find out the real problem.

01 August 2024 589 4 View

How to correctly use the UTE and ZTE pulse sequences in Bruker's ParaVision software?

I am using a Bruker 600M solid-state NMR spectrometer with a Micro 2.5 microimaging system. The test sample is a tube of 1M LiCl aqueous solution, and the nucleus detected is 1H. I am trying to...

01 August 2024 9,227 1 View

Is artifacts in XPS possible to build high deviation in binding energy larger than 5 eV??

Hello. Thanks for your consideration to see my question. Recently, I conducted XPS anaylsis of g-CN that is prepared from thermal polycondensation of DCDA, so-called conventional bulk-g-CN,...

30 July 2024 9,824 2 View

Which statistical test should we use?

N=6 Comparing pre and post test likert scale responses. Participants are mix of practicing & preservice teachers.

30 July 2024 7,233 4 View

How to build my own lab made four point probe set up?

Hello, I'm trying to measure the conductivity of semiconductor films but since I don't have a commercial four point probe set up I would like to build one on my own in my lab. I have generators,...

30 July 2024 906 2 View

What procedure should I follow for current controlled PPy film electrochemical deposition on ITO?

For example, I perform passivation process before electropolymerization of Py for adhesion to titanium surface. Is a similar or different procedure required for ITO? If necessary, I would be glad...

06 August 2024 3,176 0 View

Why is the molecule's orientation with an electric field affect polarizability?

Why is the molecule's orientation with an electric field affect polarizability? Electrons are diffuse enough to be independent with respect to orientation and effect of electric field on...

03 August 2024 7,843 1 View

Entropy measure and QSPR modeling in Graph Theor. How to construct the table for lengthy equation?

The entropy measured of molecular graphs plays a crucial rule. The network structures in some cases are very lengthy calculations to handle. Some author avoid to construct table where as most...

30 July 2024 3,126 0 View

How can I better use imageJ to get good result on calculating surface area of a film accurately?

I have images of film formed on the bottom of a 50mL beaker, and I have been trying to figure out the surface area. I could easily use features on imageJ to do that, but I want to know which area...

23 July 2024 4,461 1 View

Franklin M. Fisher (1983) Disequilibrium Foundations of Equilibrium Economics. How is it estimated now? How is it related SMD results?

Fisher studied if there is an out-of-equilibrium process that rapidly converges to some equilibrium points. He claims that Hahn process has a Lyapounof function and therefore convergent to an...

15 July 2024 1,460 4 View

Why can't I detect the plasmon resonance angle with water?

I am trying to measure the plasmon resonance angle of gold film and pure water using the Kretschmann configuration and a 633nm laser. Without flowing water over the gold, I can detect a clear...

10 July 2024 4,719 3 View

I want to ask why the poly aniline film does not stick to the ITO type substrate knowing that I used the electrodeposition method?

In order to make a polyaniline film, I used the electrodeposition method and set all the conditions, but I noticed that the polymerization process is happening, but it doesn't stick to the...

09 July 2024 2,708 2 View

Warning: convergence tolerance of 1.000000e-06 not reached?

Hola a todos, Me gustaría realizar una consulta con relación al mensaje de advertencia que se muestra en la imagen anexada. Es un mensaje que aparece al inicializar la solución de una simulación...

05 July 2024 974 4 View

What is polymer/chemicals can increase the swelling index of a polymeric film?

Polymeric film such as latex rubber substrate or polymer

01 July 2024 8,343 4 View

How can we reduce the porosity of tungsten oxide (WO3) thin films during DC sputtering deposition (target: W, power: 100 W, 75 W, and 50 W)?

We are fabricating a tandem structure where a ~50 nm thick WO3 film is deposited between two metal films using DC sputtering. We use a mixture of Ar and O2 gases at different flow rates while...

01 July 2024 2,686 2 View