0 Bewertungen0% fanden dieses Dokument nützlich (0 Abstimmungen)
296 Ansichten31 Seiten
Tertiary education requires applicants to pass the screening process set by schools. Aptitude tests are used to measure one's fundamental intellectual abilities. Schools use the Aptitude Test to predict future outcomes of students' performance.
Tertiary education requires applicants to pass the screening process set by schools. Aptitude tests are used to measure one's fundamental intellectual abilities. Schools use the Aptitude Test to predict future outcomes of students' performance.
Copyright:
Attribution Non-Commercial (BY-NC)
Verfügbare Formate
Als PDF, TXT herunterladen oder online auf Scribd lesen
Tertiary education requires applicants to pass the screening process set by schools. Aptitude tests are used to measure one's fundamental intellectual abilities. Schools use the Aptitude Test to predict future outcomes of students' performance.
Copyright:
Attribution Non-Commercial (BY-NC)
Verfügbare Formate
Als PDF, TXT herunterladen oder online auf Scribd lesen
Adelaida de Perio De La Salle University - Manila Background Admission to tertiary education requires applicants to pass the screening process set by schools. One of the assessment tools used to select their potential applicants is the use of aptitude tests. Aptitude tests are used to measure one’s fundamental intellectual abilities. Background The Abstract Reasoning is a non-verbal test measuring one’s ability to identify patterns in a series. Numerical ability on the other hand, measure’s one’s ability in solving mathematical problems. Verbal Reasoning measures one’s ability to understand analogies and covers areas in English language. The spatial ability measures one’s ability to manipulate shapes. Background Mechanical reasoning measures one’s knowledge of physical and mechanical principles. Lastly, spelling measures one’s ability to detect errors in grammar punctuation and capitalization (Magno & Quano, 2010). Review of Related Literature Because many studies link aptitude with academic performance, schools use the aptitude test to predict future outcomes of students’ performance. Long standing key predictors of academic success is students’ abilities measured by SAT or ACT, or high school GPA in predicting academic success (Covington, 1992; Lavin, 1965; Willingham, Lewis, Morgan, & Ramist, 1990). Review of Related Literature Garavila, Gredler & Margaret (1997) examined the extent to which college students’ learning strategies, prior achievement and aptitude predicted course achievement. Analyses showed that each of the predictor was significantly correlated with achievement. These variables accounted for 45% of the variance in course achievement. Review of Related Literature Garcia (1997) found the same results in his study examining the relations of motivation, attitude, and aptitude on second language achievement. The findings of the study revealed that aptitude (β=.43) Motivation (β =.41) and Ethnic Membership (β =.14) explained more than 50% of the variance in language achievement. Review of Related Literature In secondary education, little has been done to screen in students before entering the high school. This is the reason why some students lack the necessary skills and come unprepared to meet the demands and expectations of high school education. The use of an aptitude test therefore will not only serve as a screening tool but moreover, it will provide teachers with information on the areas students have to improve on. Objective The present study therefore aims to compare CTT and IRT results in evaluating the Aptitude Test developed for High School in terms of item difficulty, and item discrimination. Review of CTT and IRT The CTT model, also called the “True Score Theory” espouses the idea that responses of examinees are only due to variation of the examinee’s ability.
In CTT, item difficulty is indicated by the frequency of
responses; item discrimination is indicated by item total correlation; and frequency of responses is used to examine distracters (Impara & Plake, 1997). Review of CTT and IRT Traditionally, CTT has been used as a method of analysis in evaluating test although it has several limitations.
First, the person statistic or the observed score is item
dependent. Second, item statistics or the difficulty level and item discrimination are examinee dependent. The Item Response Theory answers these major limitations of the CTT. Review of CTT and IRT The Rasch model, which is also referred to as the IRT, estimates the probability of a correct response to an item as a function of the person’s ability and difficulty of the item. In IRT, each item in a test has its own characteristic curve which describes the probability of getting the item correctly or depending on the test taker’s ability (Kaplan & Saccuzzo, 1997). Review of CTT and IRT IRT asserts that the easier question, the more likely a student will be able to respond to it correctly, and the more able the student, the more likely he or she will be able to answer the question correctly as compared to a student who is less able. Rasch model is based on the assumption that guessing and item differences in discrimination are negligible (Anastasi and Urbina (2002). Method Participants A total of 63 incoming 1st year High School students, both male and female participated in the study. The participants in the study were composed of grade 6 students from different elementary schools in Manila. The participants have finished the grade 6 level and were applying in a Science High School. Age ranges from 11-13 years old. Method Instrument The Aptitude Test for High School was developed to measure fundamental intellectual abilities in abstract reasoning, verbal reasoning and quantitative reasoning. The instrument consists of a total of 100 multiple choice items. The AHP consists of 30 items for abstract reasoning; 30 items for numerical reasoning, and 40 items for verbal reasoning. Method Psychometric properties of the test show the following reliability estimates for each subtest. Obtained reliability coefficients for each subtest are .70 for abstract reasoning, .77 for numerical reasoning, and .78 for verbal reasoning. Method Procedure The test was administered to incoming 1st year high school students in a Science High School in Manila. The AHP was given as one of the assessment tools in their selection of potential applicants who will be accepted in the Science High School. A trained examiner administered the test for one hour. Data Analysis Data gathered were analyzed in terms of its reliability coefficients, item difficulty and discrimination using both CTT and IRT.
In terms of item difficulty and item discrimination using
the Rasch model, two samples were tested and compared. The following computer software was used: SPSS version 16, and Microsoft Excel version 2007, and Winsteps for the IRT. Results Reliability Indices Using the Classical Test Theory, reliability coefficients for abstract reasoning, numerical reasoning and verbal reasoning were as follows: .70, .77, and.78. Table 1 Summary of Person and Item Measure for Abstract Reasoning Person Input Measured Infit Outfit Cou OMN Score nt Measure Error IMNSQ ZSTD SQ ZSTD Mea n 21.8 30 1.33 0.5 1 0.1 0.94 0.1
SD 4 0 0.88 0.11 0.15 0.6 0.31 0.6
Real Person RMSE 0.51 True SD 0.72 Separation 1.39 Reliability 0.66
Person Input Measured Infit Outfit
Mea n 45.9 63 0 0.36 1 0.1 0.94 0
SD 10.2 0 1.09 0.14 0.11 0.8 0.22 0.9
Real RMSE 0.39 True SD 1.02 Separation 2.65 Item reliability 0.88 Table 2 Summary of Person and Item Measure for Numerical Reasoning
Results also reveal that in terms of item and person
separation, the sample can still be separated into groups and the test can still be divided into groups. Discussion In terms of item discrimination, the same items were found to have poor discrimination index for numerical reasoning and verbal reasoning using CTT and IRT. Therefore, these items should be subjected to revision.
However, for abstract reasoning 2 out of 5 items
considered poor using CTT was also considered poor using IRT. In terms of item difficulty, similar items considered difficult were seen using both models. Discussion However, there is discrepancy in the number of items considered difficult for both CTT and IRT. These findings suggest that there is a relative degree of stability across CTT and IRT in terms of item discrimination. Overall results showed that there appears to have consistency in the results using both CTT and IRT. Discussion However, in this study, one of the advantages of using the IRT over CTT was evidently seen. IRT is sample- free nature of its results. This means that item parameters are invariant when computed using different groups of different abilities. Thank you!