My goal is to use large-scale assessment data (answers from the respondents to the items and the respective Q-matrix) to verify whether it is possible to infer prerequisite relationships among the skills assessed in a multi-year application of a national large-scale assesment (like NAEP or SAT) using perfomance data.
I researched some psychometric and data mining methods like Tatsuoka's Rule-Space which could be used to determine those relations. They have been used in intelligent tutoring systems, but in a much greater granularity.
Few skills and items are applied each year, but considering multiple applications, I might have a better picture with sufficient coverage, and, as a summative assessment, is a snapshot disregarding the learning present in the process.
Is there any study in the literature that claims such relationships from the flat model used in these assessments (although it is quite)? Is this a valid path?