Feature Selection for big data?

More Noura H. Al Nuaimi's questions See All

Publisher Accept Survey Paper - Journal Suggestion?

I have a survey paper on feature selection and big data. I need a suggestion for a journal that accepts the survey. I am looking for a good impact factor and review period of around one month.

06 July 2018 7,479 0 View

Journals that accept survey papers?

Any suggestion for journals that accept survey papers in data mining/big data? Short reviewing period max 2 months?

10 November 2017 1,994 0 View

Get Linkedhashmap item index?

Is there a way to get the index of Linkedhashmap key or value?

07 August 2016 5,168 0 View

K-greedy algorithm JAVA?

Any recommendation for k-greedy algorithm implemented in JAVA for data mining?

04 May 2016 5,793 4 View

How can we select specific attributes using WEKA API?

I have a question regarding WEKA API. I need to read the ARFF file and save specific selected attributes only to new ARFF file. Currently, I can only delete the unwanted attributes. Thanks.

01 January 1970 2,447 12 View

Top Ten Learning Algorithm?

From your point of view which is your first choice? 6: C4.5, k-Means, SVM, Apriori, EM, PageRank, AdaBoost, kNN, Naive Bayes, and CART

01 January 1970 2,951 34 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

Why can't academics earn the money they deserve?

Only Journals make money from the articles we have worked on for years. Academics do not earn money from their refereeing. Then shouldn't the solution be a system in which academics can earn...

01 August 2024 6,469 6 View

Conjugation of PEG-Amine to an Amino Acid Using EDC?

I am attempting to conjugate PEG to an amino acid at the C-terminus, for the purposes of producing nanoparticles. I have been told that PEG modified with amine groups can be used for this purpose,...

31 July 2024 2,033 1 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

Mohamed El-Sharkawy

Dear Noura, the other way is to use the big data new techniques such as Hadoop geoportal database, please check attached links. Regards

http://www.tutorialspoint.com/hadoop/hadoop_big_data_overview.htm

http://www.sas.com/en_my/insights/big-data/hadoop.html

http://bigdatauniversity.com/courses/hadoop-course/

Chanin Nantasenamat

Hi Noura,

You could go old school and compute an intercorrelation matrix showing the pairwise correlation coefficient of two features at a time. And remove them if they have: (1) very low variance such as less than 0.01 or (2) if their pairwise correlation coefficient is greater than, for example, 0.99.

Please see Figure 4 from the full text of the link provided below.

Hope this helps.

Best regards,

Chanin

Article Unraveling the origin of splice switching activity of hemogl...

Marco San Biagio

see also this: https://www.researchgate.net/publication/282576688_Infinite_Feature_Selection

Anis Ben Ishak

Hi,

This work http://content.iospress.com/articles/intelligent-data-analysis/ida795

could be useful for you.

Sergio Ramírez-Gallego

We have made some contribution to this field. You can find some code and more info in the link below:

http://sci2s.ugr.es/BigData#Preprocessing

Here, my github repo:

https://github.com/sramirez/spark-infotheoretic-feature-selection

This framework is basically built on mutual information, but it's shown that it's actually useful in real scenarios.

Giorgio Roffo

See ECFS -> Feature Selection via Eigenvector Centrality the code is available for Matlab https://it.mathworks.com/matlabcentral/fileexchange/56937-feature-selection-library?requestedDomain=www.mathworks.com

Mostafa Rahmani

Spatially fair feature sampling:

https://arxiv.org/pdf/1705.03566.pdf