Which algorithms can identify linear pattern from random data points?

More Mohammad Kazemi-Beydokhti's questions See All

Is (Are) there any valid corpus containing geographical events queries?

Hi all, I'm currently looking for a question corpus to have a plausible number of spatial events queries. I found MS MARCO dataset in which I could extract a reasonable number of geographical...

08 December 2020 8,093 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Do you know best mines of western part of Afghanistan?

I want to know more about Mn deposits in west of Afghanistan.

07 August 2024 3,427 1 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

Why can't academics earn the money they deserve?

Only Journals make money from the articles we have worked on for years. Academics do not earn money from their refereeing. Then shouldn't the solution be a system in which academics can earn...

01 August 2024 6,469 6 View

Conjugation of PEG-Amine to an Amino Acid Using EDC?

I am attempting to conjugate PEG to an amino acid at the C-terminus, for the purposes of producing nanoparticles. I have been told that PEG modified with amine groups can be used for this purpose,...

31 July 2024 2,033 1 View

Mohammad Kazemi-Beydokhti

Thank you dear Yuriy...

Do you know any other methods except linear regression for this purpose?

Tarik A. Rashid

Dear Mohamad,

You can use Linear SVM or ANN with linear function at the output layer.

Warm regards

Tarik

Thank you so much Yuriy and Tarik...

Navjyotsinh Jadeja

SVM could be a better approach to achieve it

Patricia Ryser-Welch

I would try a form of GP to complete some symbolic regression.

Dear Patricia, what do you mean about GP ? can you explain more. thanks.

Genetic Programming uses an evolutionary algorithm to evolve mathematical expression or algorithms. The search space focuses on finding a solver that solves well a problem. In genetic programming, many researchers evolves some programmes or mathematical expression then apply them to solve the problem. They are interested in good problem solutions. Some other researchers (like me) prefers studying the algorithms or mathematical expressions and solves other instances of a type of problems. These results give us an idea how general is the generated solver.

You may find these papers useful. I would research some work from these authors: Koza, Banzhaf, Langdon, J.F.Miller and L. Spector.

Patricia

Conference Paper GECCO 2013 tutorial: Cartesian genetic programming

Conference Paper Generating Human-readable Algorithms for the Travelling Sale...

Rob Podolski

Douglas Peucker algorithm is great for extracting curves from many points. Perhaps a start-point?

Diego C. F. Queiroz

If your aim is to find the best line that represents those data points, there are numerous ways in which one can be found. One of the simplest ways is using a linear regression model.

In the following link check the first two answers. First one gives a general idea of the methods already implemented in software (an example written in R) and the second one of the algebra behind it.

http://stats.stackexchange.com/questions/1829/what-algorithm-is-used-in-linear-regression

Thank you Rob and Diego...but i found SVR(support vector regression) is better than linear regression due to it's lower RMSE... As it mentioned in the following link a comparison between linear regression and SVR is applied for the same dataset as input. The results declare the higher accuracy for SVR which have a better fitness to sample points.

http://www.svm-tutorial.com/2014/10/support-vector-regression-r/

Jack H Hiller

I wonder if you are joking? The SVR provides a curvilinear fit, not a linear fit, and must do so at the sacrifice of degrees of freedom. Any standard linear or multi-linear calculation program will produce the fit with the MSE minimized-- that's how it works to set the line parameters (b and c), and calculate the R, or R-squared for proportion of variance accounted for..

A curvilinear fit ought not be attempted, unless you are testing a theory that calls for a curvilinear model, and then you ought to use a non-linear modeling program. When the SVR obtains a better fit than a linear model, you run the risk of having optimized on error in the original data, which begs for empirical cross validation, and experience implies the cross validation R squared will be disappointing. You might then run a standard linear model, from which you can apply any of the shrinkage estimates w/o having to run an empirical cross validation, although such makes for the best reliability of results.

Ramesh Chandra Bagadi

Dear Mohammed

One can use y=mx+c where y is the y ordinate, x is the x co-ordinate and m is the slope of the strainght line y=mx+c. You need to first find the Statistical Centroid of the data points and let this line pass through it. Now simply revolving it and linearly translating it forward and/ or backward can get you the best linear relationship in the x,y scatter. Actually, in Microsoft Excel and also in MATLAB, there is a facilitation to find and plot such x,y scatter relationship in a linear form.

Hard to know exactly what you're after, but did you think about the Hough Transform?

SVM is a great tool to handle large data patterns.