Dear all,

I have a dataset thet consists of 43 Msc. educational students records, which have the following attributes:

  • < Student ID, Students Bachelor AGPA, Student Age, Student Bachelor specialization, Msc Course ID, Msc Course Instructors, and Msc. Course Grade>

The problem that i am facing is that students are from differant batches, and for the latest 7 students there was a changes that led to decrease the number of taken Msc courses from 8 to 6 courses.

I arrange the attributes in 2 differant ways, as you can see in the attached 1234.png screenshot, and my questions are:

  • Which theme is better for analysis, with more attributes (2) or with less attributes (1) ? [Noting that i want to predict 2 things students MSc AGPA and Each Msc. Course Grade]
  • For the last 7 students, since they dont have the last 2 courses records, I am planning to make all the 43 records with six cources and i will delete the last 2 courses of the old batches. Whats your advice?

Another question, in substituting missing records, what is better for small datasets, i.e. to substituion missings with mean or median? or what is the most reliable way to substitute with for smal size dataset? [Link to a papers related to that will be much appreciated]

I know that there is no classifiers that could be suggested for a certain datasets, but since i am going to predict ordinal and continues numbers, and based on your experiance which classification algorithims are willing to handle small size datasets? and which descriptive statistics or visualizationtechnique that can tell me this classification algorithim is best for your case?

[links to papers that can clarify that will be much appreciated]

Last, for the scores of the prediction accuracy, how to judge if its good acuracy rate or not? some are using the probabilities of occurance of that grade number, but what if the number is continuase not ordinal ?

[links to papers that can clarify that will be much appreciated]

Hope to learn more from your experiance.

Best Regards,

Lubna

Similar questions and discussions