LDOS-CoMoDa dataset for Non-Linear Regression Analysis ?

22 August 2024 1 2K Report

Hi everyone,

Hope you're all okay !

I'm working on feature's importance and selection on the LDOS-CoMoDa dataset which a reference in CARS(Contextual-Aware Recommender System), a dataset that contains 12 contextual features and others which are static related to items and users, for movie ratings and recommendation.

My issue is the dataset is completely numerical (all the features are numerical values, even the categorical ones), and because it's a tabular dataset, I'm applying tree-based and ensemble methods like random Forest, XGBoost and other algorithm to assess features importance and selection, but, the results are mediocre (R2 square = 0.36 with optimization using Optuna).

My question is: how can I increase this result to higher performance ? I'm still focusing on features preprocessing and engineering, but i'm getting lost, I don't understand the problem ?

If someone has already worked with this dataset and performed such kind of analysis, please provide me with more explanations.

The csv file related to the dataset is joined to this message below.

Thank you !

Hao Wang

I used LDOS-CoMoDa a lot in the past 4 years. Check my publications by searching for "Hao Wang" and "Ratidar". Not sure if any of my ideas could help you, but it's worth try.

Badges
Science topic

More Nassim Lateb's questions See All

Multi-Task Learning Architecture for Inductive Learning ability ?

Hi folks, I'm a computer scientist PhD student, and I'm working on implementing Multi-Task Learning architecture for a better generalization aims, it will be throughout a Deep Learning model. I...

21 May 2024 8,589 1 View

How to define a step in the design interval using an optimization algorithm?

Hello experts This question is shared with one of my research team. We are dealing with an optimization problem in which the algorithm will choose the cross-section of the column (RC structure)...

26 March 2023 6,328 4 View

Comparing Mike 11 and SWAT?

I need a water quality-quantity model for improving the water quality of the Amirkabir dam, and I don’t know considering the limitations and advantages of both models: SWAT and MIKE 11 which one...

03 May 2022 4,293 6 View

Fitch Connect Database ?

Is there any one on the network who can provide access to the Fitch Connect Database regarding the banking metrics ?

22 April 2020 4,510 0 View

Why value "S" and "U" (shell parameters) is limited to 4 ?

Hi evrybody, when wa calculate stress du to radial local load (Pr), and/or moment (M), on a spherical sherical shell or head Value S to find stresses at distance x from centerline in the...

04 March 2020 3,629 1 View

What is the thermal behaviour of SiGe HBTs?

My name is nassim aliouche, I am a final year student in microtechnology at Marne la Vallée University , i am working on a presentation about the bipolar Transistor SiGe So , for For SiGe HBTs...

11 December 2019 6,578 1 View

Are there any vehicle models or tool boxes for testing fuel (pulse width) and ignition maps?

I'm currently doing a Master's project for mapping an ECU and was wondering whether I'd be able test those maps, using a model, as though I was running the vehicle on a dyno, or even testing the...

26 March 2019 5,704 1 View

How to protect the transmitted information by channel coding?

I have secondary informatins to protected and must be coded in the transmission channel. I'm looking for ideas or matlab codes that explain how to encode this informations.

25 March 2019 4,601 6 View

Can we write a function to define upper and lower bounds of the genetic algorithm?

Dear researchers I need to write a script to define the upper and the lower bounds for a genetic optimization using functions scripts. Each function (for Upper and Low bound) we be handled in...

10 February 2019 2,596 7 View

In a researcher proposal, should we outline contribution to theory throughout the literature review or separately in the conclusion of the intent?

To explain contribution to theory of a research study in a research intent, should we do it all along the document or in the conclusion?

06 January 2019 339 5 View

Could you recommend some articles on Urban Transportation System optimization and Innovation?

13 August 2024 2,595 3 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Ethylene glycol is newtonian or non newtonian fluid?

Some sources say it is treated as Newtonian as ethylene glycol has a higher viscosity than water, which affects the flow characteristics in simulations. But some also say it is non-Newtonian.

09 August 2024 2,111 2 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View