Download Microdata User's Guide - Microdata Analysis and Subsetting with SDA
Transcript
The mean deviation among the RP80s was 4.7, with a standard deviation of 15.3. This means that the average variation among the RP80s for the literacy tasks was 4.7 points on a 500-point scale, or less than 10 per cent of the 50 points making up a particular literacy level. In addition, a small number of items had large deviations, accounting for a significant percentage of this variation. Only 2 per cent of the deviations observed account for about 35 per cent of the average deviation. In other words, 98 per cent of the deviations have a mean of 3.0, or a 35 per cent reduction from the average of 4.7. Table 4.4 shows the average deviation of the RP80s for each of the 24 country or language groups; the average is seen to range from a low of 1.1 for the French-speaking Swiss to 10.3 for Hungary. Table 4.4 Average deviation of RP80 values by country or language group Australia Belgium (Flanders) Canada (English) Canada (French) Chile Czech Republic Denmark Finland 7.6 5.8 3.6 3.2 3.5 3.7 3.2 3.6 Germany Great Britain Hungary Ireland Italy Netherlands New Zealand Northern Ireland 5.3 5.2 10.3 4.5 3.8 3.4 7.2 6.9 Norway (Bokmål) Poland Slovenia Sweden Switzerland (French) Switzerland (German) Switzerland (Italian) United States 2.7 5.4 5.3 5.2 1.1 4.0 6.0 2.0 4.7 Conclusion One of the goals of large-scale surveys is to provide information that can help policy makers during the decision-making process. Presenting that information in a way that will enhance understanding of what has been measured and the conclusions to be drawn from the data is important to reaching this goal. This guide has offered a framework for understanding the consistency of task responses demonstrated by adults from a number of countries. The framework identifies a set of variables that have been shown to underlie successful performance on a broad array of literacy tasks. Collectively, they provide a means for moving away from interpreting survey results in terms of discrete tasks or a single number, and towards identifying levels of performance sufficiently generalised to have validity across assessments and groups. The concept of test design is evolving. Frameworks such as the one presented here can assist in that evolution. No longer should testing stop at assigning a numerical value; it should assign meaning to that number. And, as concern ceases to centre on discrete behaviours or isolated observations and focus is more on providing a meaningful score, a higher level of measurement is reached (Messick, 1989). 39 Culture, Tourism and the Centre for Education Statistics