What is the difference between linear regression in machine learning and statistics?

More Omid Kohandel Gargari's questions See All

Invitation to Participate in Our Research Study for students and advisors??

Hello everyone! We are excited to invite you to participate in a research study aimed at understanding various aspects of user needs, including interactions with students and advisors,...

27 July 2024 9,484 2 View

When we use ICA, how we can detect noise or artifacts from extracted components to exclude them??

04 May 2024 1,668 1 View

Explain the effect of strength exercises on sex?

One of the aspects of the impact of strength sports on the strength of family relationships

24 November 2023 4,105 1 View

I need a high-quality method for efficient extraction of acylcarnitines from fecal specimens?

Dear All, I need a high-quality method/protocol to efficiently extract acylcarnitines from fecal specimens for further LC-MS/MS analysis. This method must be feasible and removes the interfering...

28 October 2023 143 4 View

How to find postdoc position?

post doc position in environmental engineering in AMERICA and find proffessors

07 October 2023 8,349 0 View

Looking for a Suitable compatibilizer?

Hi Everyone, I am going to blend different polymers together and I am looking for a suitable compatibilizer for each of them. - blending PP and HDPE - blending HDPE and TPV - blending HDPE and...

27 August 2023 3,980 4 View

How to analyze qualitative data on Instagram?

Hi everybody! I intend to conduct qualitative research (content analysis using MAXQDA) on identifying factors affecting the success of sales pages on Instagram. The goal is to identify the common...

23 August 2023 255 4 View

How much is the Absloute & Relative Error in Drug Delivery project?

3D & 2D

27 July 2023 6,041 3 View

Can anyone help me with this XRD pattern of the High entropy alloy : (Al0.5 CoCrFeTi0.5 Si0.4)?

Can anyone help me with this XRD pattern of the High entropy alloy : (Al0.5 CoCrFeTi0.5 Si0.4) First we produced this material by mechanical alloying (250 hours, 350 prm) and then coated it by...

27 July 2023 8,779 1 View

How to make PPS and PEI more flexible?

Hi everyone, Does anyone know a technique to make PPS or PEI more flexible. I mean reaching at least 300-400% elongation at break. suggesting any rubber or TPE to blend with this material would...

16 July 2023 4,090 2 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Request a single Lecture notes for math as detailed as this that I can find in one place?

- The Existence/Uniqueness of Solutions to Higher Order Linear Differential Equations - Higher Order Homogenous Differential Equations - Wronskian Determinants of $n$ Functions - Wronskian...

03 August 2024 2,366 0 View

Aikeliyaer Ainiwaer

Let me give my insight and let's discuss this issue together.

Perhaps the similarity of the methods used in statistical modeling and machine learning makes people think they are the same thing. For this I can understand, but in fact this is not the case.

The most obvious example is linear regression, which is probably the main reason for this misconception. Linear regression is a statistical method by which we can both train a linear regressor and fit a statistical regression model by least squares.

As you can see, in this case, what the former does is called "training" the model, which uses only a subset of the data, and the performance of the trained model can only be known after testing it with another subset of the data, the test set. In this example, the ultimate goal of machine learning is to obtain the best performance on the test set.

For the latter, we assume in advance that the data is a linear regression volume with Gaussian noise and then try to find a line that minimizes the mean squared error over all the data. No training or test set is required, and in many cases, especially in research (as in the sensor example below), the purpose of modeling is to describe the relationship between the data and the output variables, rather than to make predictions about future data. We refer to this process as statistical inference, not prediction. Although we can use this model to make predictions, which may be what you want, the way to evaluate the model is no longer to test the set, but to evaluate the significance and robustness of the model parameters.

The goal of machine learning (here specifically supervised learning) is to obtain a model that can be predicted iteratively. We usually do not care whether the model is interpretable or not. Machine learning only cares about the results. It's as if your value to a company is measured only by your performance. Statistical modeling, on the other hand, is more about finding relationships between variables and determining the significance of the relationships, which happens to cater to prediction.

Pasko Konjevoda

Regression analysis is regression analysis. However, machine learning can significantly simplify some complex regression problems. Two examples:

1. Cubist program (called M5 in Weka) divides data sets in subgroups, and fits regression models individually to each subgroup. My experience says that this can extremely improve analysis - enormous improvement of r2.

2. Cox proportional hazard model is the standard regression model for survival analysis and identification of variables with predicitive value. Survival trees (machine learning method for survical analysis) is almost always superior to Cox regression and easier to interpret.

Inès François

This article is helpful to understand the difference between statistics and machine learning:

Breiman, L. (2001). Statistical modeling: The two cultures (with comments and a rejoinder by the author). Statistical science, 16(3), 199-231.