I have a 14*70000 matrix,i want to reduce the dimension of that matrix without losing much information.what is the easiest way to do it?

More Sowmith Reddy's questions See All

Can any one let me know the PDE value for Tert-Butylamine in order to control in the API?

PDE value for Tert-Butylamine

24 July 2024 1,166 0 View

Cloning restrction digestion needed before sequencing?

Hello, recently i am an issue swith cloning where I am using the subclone method to insert my gene length is 1.5kb and my vector(pMAL-c6T) is 5kb I got the colonies which i screened through the...

04 July 2024 7,977 2 View

How does a Man-in-the-Middle (MitM) attack work in the context of Transport Layer Security (TLS), and what specific mechanisms can be employed ?

Context In the realm of cybersecurity, Transport Layer Security (TLS) is widely used to secure communications over a computer network. Despite its robust encryption mechanisms, TLS is still...

19 June 2024 6,595 3 View

How to perform static structural analysis in ansys to calculate Gear body & Gear tooth deformation due to rotational speed?

I would like to calculate the gear wheel and gear tooth deformation due to rotational speed? The gear shaft is mounted on hydrodynamic bearings. What are the boundary conditions need to apply?...

21 May 2024 8,067 0 View

Why We Can Not See God ?

Dear In the UNIVERSE from a sand grain to Sun including man, all are bodies of different sizes. Every body has a SHAPE//FORM A body with a shape gets a NAME Every body has a LIMIT And every body...

18 May 2024 3,012 0 View

Anybody can suggest me that which ls-dyna material card is suitable for windshield adhesive sealant to bond with biw components ?

Please anybody suggest me which ls-dyna material card is suitable for windshield sealant material to prepare material card and to validate for crash performance Thank you in advance.

16 May 2024 2,569 0 View

Error in number of electrons while performing Bader charge analysis using quantum espresso?

Hello friends, i am performing bader charge analysis for methylcyclo hexane (MCH)/ceria system using quantum espresso. while performing analysis, the number of electrons generated in ACF.dat is...

12 May 2024 7,264 3 View

I would like to know more about ls-dyna material *Mat_252 ,please if anyone having more information or papers on this please share?

more details about ls_dyna material *Mat_252

22 April 2024 5,398 0 View

How to insert the molecules in gromacs based on surface area of polymer molecules ?

And also how to compress the system based on surface area of system such that we get new gromacs file at different surface area of polymer molecule? i heard on the fly approach can be used but i...

16 April 2024 2,409 2 View

No comments from 3 months after required reviews completed ?

I submitted a paper to the Iranian Journal of Science in December 2023. required reviews completed in the first week of January 2024, till now, no comments from the editor about the paper, i.e.,...

08 April 2024 2,675 2 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Hello all, Looking for international reviewer to review Ph.D thesis in wireless sensor network.Can anybody help?

My name is Apurva Saoji. I am a Ph.D scholar in Computer engineering in India. I am looking for international expert in reviewing my PhD thesis, "Competitive Optimization Techniques to Minimize...

07 August 2024 4,600 2 View

Should I include H atom into C3N5 when i am doing DFT modelling?

Hi all, my experimental XPS results shown that my C3N5 sample consists of N-H bond, hence in this case I should incorporate the N-H bond into my DFT modelling. However, I do notice several papers...

07 August 2024 8,414 2 View

Are there any good simple systems or platforms to recommend?

In order to show people the beauty of control and enhance enthusiasm for learning control theories, are there any good simple systems or platforms to recommend?

05 August 2024 10,034 1 View

Why do exism movements become permanent dictatorship threats within liberal democracy thinking under majority rule-independent rule of law system?

Exism movements after gaining power within liberal democracies under majority rule and independent rule of law system become permanent dictatorship threats, but why this is the case is not clear...

04 August 2024 8,125 3 View

How to use Density Functional Theory to calculate carrier mobilities of solid system?

Hello, everyone. I have tried to determine carrier motilities of some materials, by Density Functional Theory, using Quantum ESPRESSO. There are a few methods to do it, like a package called...

04 August 2024 8,894 1 View

How can I calculate formation energy using VASP-Ab-initio?

I would like to calculate the formation energy of P2-Na0.67Fe0.5Mn0.5O2 based on DFT, what should I do step by step. Any help would be appreciated. Thanks.

29 July 2024 8,248 2 View

In terms of chaos, what is the necessary and sufficient condition for authoritarianism, permanent or temporary, to come to exist and persist?

Since 2016 Brexit, the world needed to change the thinking behind traditional democracy as the democratic landscape changed, yet traditional democratic thinkers and actors have been acting as if...

28 July 2024 6,515 1 View

What is effective targeted chaos?

Perfect democracy thinking assumes no chaos so no need for independent rule of law system and liberal democracies assume the possibility of normal democratic chaos that can be sorted out by an...

28 July 2024 473 1 View

All math can be explained by iterator of code?

all math can be traversed by code? all math can be translate to code?

26 July 2024 9,530 0 View

Arseniy Gorin

You can apply on of the common techniques for dimensionality reduction. Have a look at singular value decomposition (SVD) or principal component analysis (PCA). The latter may need more time to be trained

You can easily experiment with these methods in python sklearn library. Both SVD and PCA reduce your input significantly provided that some of the features are correlated. They also allow to check the remaining variance in order to have an idea of how much information you loose.

Another idea is to reduce dimension with variational or neural network based autoencoder, which could be a little more tricky to optimize.

Let us know if you need more references or tools to be suggested. I would start with SVD and see how it goes

Sowmith Reddy

currently i am working in c++. so any suggestions on how to implement svd or pca in C++

You have a library for that, no need to implement https://code.google.com/archive/p/redsvd/

As a matter of advice, I'd still recommend debugging in sklearn, cause it is nice for analysis. If you reduce your feature vectors, nothing stops from dumping them to file and then run your C++ code on compressed features.

Paul Sturgess

I have had some luck with random orthogonal projections for sparse data.

http://vision.fe.uni-lj.si/docs/danas/cvww10final.pdf

I used matlab to generate the random matrix in eq4 and plugged into eq 1. 2 lines of code. It worked very nicely for my application that had very sparse data. It is also very fast.

check out:

http://scikit-learn.org/stable/modules/unsupervised_reduction.html

for some code and other ideas.

Bhowmik Subrata

Calculate the weight or sum or some measure or lenghth of a row and column if find that row or column not much effect on main result delete one row and one column at a time. That will reduce the dimension of the matrix.

Sir Soumitra Kumar Mallick

Dear Sowmith,

If you find faith in our nested algorithm technology (a different set of coauthors) for information and financial processing (check my website on RG, especially the paper on math stat. pricing) you should divide your stack into groups of 7000 using some criteria and check whether repeated estimation gives stable results. Or else take averages. The proof of this is slightly complex but it exists. SKM, NH, SM

Francisco Viera

I tjhink that is not a big matrix 13*70000=980 KB this is a very little space for a PC with one GB. You can to manage this in C that is adaptable to almost any package.

Rupaj Kumar Nayak

I agree with Gorin because it is inline with the requirement of the question.

David Franklin Rogers

I am not familiar with the time requirements for working with neural nets as you mention, but in general, the size of the matrix can be substantial for efficient processing and aggregation/disaggregation techniques may be employed. The size can be quite consequential if doing, for example, integer or nonlinear optimization. Search Operations Research journal 1991 for a framework on agg/disagg.

Simple cluster analysis can be quite effective at determining which of the 70000 are indeed identical/close to each other, and that may be the "easiest" way. As previously mentioned, factor/principal component analysis can be used, but other methods such as discriminant analysis or other also exist. In general, I would think cluster analysis with Ward's method or Avg. Linkage would be indicated. How far do you need to cluster (level of clustering) is a question....50,000? 35,000? 10,000? .... In general more accuracy is lost the more you cluster.

Noha A. Mostafa

singular value decomposition may be a good option

Ali Mansoor

If most of your matrix elements are zero, you may convert it into a sparse matrix.

Your new sparse matrix will have just one dimension with a number of elements equal to the number of non-zero elements in the original matrix.

To process sparse matrices, you should write your own algorithms in your desire programming language. You can find several examples in C and C++.

Ralf Gollmer

The answer by Ali Mansoor doesn't concern the reduction of the dimension, but how to store a sparse matrix in dense form. This was not the question.

Btw.: Which matrix do you want to look at?

Is it really 14 electrodes giving just one signal, i.e. a vector of dimension 14, which the network should use to identify 70000 features? Or is it a sequence of signals of dimension 14 over time? That would of course give much more information.

If the task really is to identify such big number of features on the basis of just 14 given values: Who determined that number 14 beforehand?

You want to reduce the number of features classified by the network - somehow you want to solve the problem of how the network should react on data before training it.

You could see this e.g. as a question of statistical analysis of the data (regarding the features): What are the covariances? Do there exist dependencies between the features in the real data?

It simply says that there are alternatives to using a neural network.

Irving U Ojalvo

I do not know what a 14*70000 matrix is. However, Lanczos reduction methods are very good for keeping important parts of a matrix in greatly reduced sizes.