Tuning sigma for a kernel PCA - using sum of eigenvalues of first n components?

More Rafter Sass Ferguson's questions See All

Medical question: Is OSA the cause of Afib and HTN?

Does Obstructive Sleep Apnea either cause or exacerbate Atrial fibrillation? Hypertension? We know they are comorbid factors in 50-70% of all OSA patients, and with OSA being so hard to detect...

29 July 2024 7,760 1 View

Are there studies of conflict self-efficacy, particularly in the workplace or for supervisors/leaders??

Self-efficacy is one of the constructs of my quantitative study examining the effectiveness of leadership development program, conflict training for frontline supervisors to manage challenging...

17 February 2024 2,479 1 View

Where can I buy Dionex Autosampler sampling needle?

Hi, The sampling needle (P/N 054271) on our Dionex AS autosampler has broken and I am really struggling to find a replcemnt part. ThermoFisher say the parts are now obsolete but I can see they are...

12 November 2023 9,074 0 View

Why are 2 citations showing up for the 2023 Mindfulness article?

31 July 2023 212 0 View

Ad hoc test for ordinal regression?

Hello! I have a set of survey questions I am analysing. Response answers are 5 point [strongly disagree, agree, unsure, disagree, strongly disagree]. I did recode into 3 [disagree, unsure, agree]...

21 April 2020 7,254 0 View

Will sodium pentobarbital overdose in rats, lead to diminished activity/inhibition of the dopamine transporters in the brain?

Will Sodium Pentobarbital overdose when sacrificing rats lead to DAT inactivity? Will DAT be shown during western blotting analysis?

24 April 2019 1,677 1 View

What is the best microphone/voice recorder for recording natural speech of preschool children as they play?

The quality of the sound produced is important as I will conduct auditory analysis of speech using Praat.

19 October 2018 7,443 6 View

What is the shelf life of calcium chloride at 4C?

Hi all, I've recently found that my calcium phosphate based HEK293T transfections have been unsuccessful due to the lack of formation of the calcium/DNA precipitate. After a little testing, I...

20 December 2017 1,303 4 View

How to reduce Methanol emission from sodium methylate solution storage tank?

We have a vapor collection system in place which collects chemical vapors from different storage tanks and burn them via thermal oxidiser. We have difficulty with sodium methylate storage tank. We...

10 August 2017 4,080 1 View

Lentiviral production/transduction problem

I am trying to establish a lentivirus system in a new lab. Purchased pMuLE lenti destination vectors, psPAX2, and pMDG.2 from addgene and 293T cells from Dharmacon. First batch of virus tittered...

09 May 2017 4,167 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

Osep Hijuzaman

Send her data to me, hopefully I can learn it and can answer your problem

Anna M. Bartkowiak

Dear Rafter,

I have no answer, only questions.

1. I would like to be sure that 'p' means for you variables and 'n' means objects.

And you want to cluster the objects.

And reducing dimmensionality means for you constructing few new features (

Rafter Sass Ferguson

Dear Anna,

Thank you for your response.

To answer your questions in turn:

1. a. Yes, I use p to mean variables and n to mean observations.

b. Yes, I want to cluster the objects.

c. I would be interested in dimensionality reduction if it were interpretable, but since it appears that nonlinear DR methods are difficult to interpret, I will let that go in favor of producing a useful clustering.

2. I am using the kpca function from the kernlab package in R (see https://cran.r-project.org/web/packages/kernlab/kernlab.pdf) . It seems to be based on the Scholkopf, Smola, and Muller paper you cite below. I believe it uses the kernel trick. It provides the sorted eigenvalues, which are non-negative.

3. I do know the paper, but it is well outside of my area of expertise so my understanding of it is rather approximate.

I see that the python package scikit-learn has a function to calculate the approximate reconstruction pre-image and thereby tune hyperparameters by minimizing the reconstruction error. I have not worked in python, however, so that would be a hurdle.

Thank you for your time and input.

Warmly,

Rafter

Thanks for the replay.

We are working on a similar problem. For the moment, we have sampled from the big N only n=1000 data vectors. Our p is equal 15.

We wanted to see if kernel PCA can give more as traditional (classic PCA). Our data contain generally 4 subgroups, what is known by us, but of course is unknown both to the PCA and kPCA algorithms.

The results are described in the enclosed manuscript.

Both methods computed principal components, but only the first 3 PC-s

yielded meaningful graphical interpretation on the group structures.

Concerning the reconstruction issues, the classical PCA yields a perfect

reconstruction when using all 15 PC-s: and one can make in steps better and better reconstruction when using 1, 2, 3, ...etc PCs. This is well known.

However, I think I have seen it somewhere in Scholkopf's group papers that

when the PC-s were obtained by the kernel PCA method, this cannot be done (myself, I think, this is impossible if the kernel PCA uses the 'kernel trick').

I would be interested what do you think about our results.

You might write me directly by email.

Wishing much success with your project

Anna

Thank you, Anna. I will read your paper with interest.

The problem of a calculating measure to use to tune kernel hyperparameters remains. For others facing a similar problem, the Python library Scikit Learn allows for the calculation of reconstruction error using approximate pre-images. Unfortunately I don't use Python, and have not found a similar function in R!