Sie sind auf Seite 1von 4

Standardized Testing and Personality Assessments

Chapter 12 introduces the reader to the true definition of statistics, without scaring them half to death. The book breaks statistics down in two parts: descriptive and inferential. The type that is dealt with in this chapter is descriptive statistics. The simple definition of descriptive statistics are that they are just numbers in different forms, for example, percentages, numerals, fractions, and decimals. The book gives an example of a grade point average being a descriptive statistic. It is becoming increasingly important for classroom teachers to be able to understand and interpret statistics because of increasing calls for acountablity. Being able understand various types of statistics, there uses and limitations, will put the educator who does at an advantage. Instead of just averaging grades, there are important questions that every educator should want to know the answers to regarding their classroom. Questions like, how many people are above average and how many scored above the cutoff passing score, are questions that can't be answered without some working knowledge of statistics. The bar graph explanation was pretty clear in the chapter, but that might be just because it's the most frequently used graph to convey statistical data. Everyone is familiar with the bar graph, but when it comes down to frequency polygons things get a little fuzzy. Chapter 13 explains how distributions can have the same values for the mode, median, and mean but are different in the way the scores are spread out. The variability estimate helps determine how compressed or expanded the distributions are. The range is the easiest way to estimate variability and its determined from subtracting the lowest score from the highest score. In the case of the range, things can get thrown off if an extreme score is present. One way of preventing this from happening is to use the semi-interquartile range. This score is determined by taking the middle 50% of the scores in a distribution. The upper 25% and the lower 25% are not entered into its final computation. Standard deviation is an estimate of variability that accompanies the mean in describing a distribution. You are taking a look at each distribution to see how far away each score deviates from the mean. The normal distribution is a type of symmetrical distribution that is mathematically determined and has fixed properties. It is basically used as a model to base statistical decisions. These are methods that can most definitely be used in the classroom when analyzing data. After going over normal distribution the book gets into z scores a nd t scores. I'm not sure about how much these concepts will actually get put to use in a classroom. Chapter 14 covers the last statistsical topic in the book, correlation. Correlation is the concept of seeing if two things none related are in fact related, not to confuse that with causation. In this case we are looking at two different distributions of scores and if they correlate. Do left handed persons make better grades than right handed persons? Do students who make high marks in elementary school do better in high school? These are examples of questions that can be answered by using the correlation coefficient. Distributions can correlate positively or negatively, and even not correlate at all. When high scores in distribution A are associated with high scores in distribution B or the same with low scores, there is a positive correlation. When high scores in distribution A are associated with low scores in distribution B or

low scores in distribution A are associated with high scores in distribution B, there is a negative correlation. Just because two things correlate does not mean that one of those variables actually caused the other. A correlation only shows that some kind of relationship exists between the two distributions. Usually the two variables correlated because of a third variable which is unidentified. Chapter 15 deals with validity as it applies to testing. Does the test accurately measure what it says it measures? There are different ways to determine whether a test has sufficient validity evidence. The first way the book discusses is content validity which is also the simplest. Content validity is done by examining the test to see if they go along with what the user has decided should be covered on the test. As long as the test matches the instructional objectives it appears to "look" valid, but it doesn't tell whether the reading level is too high on the test or whether the questions are written poorly. The second type of validity evidence is criterion related, it's established when scores from a test are correlated with an external criterion. The two types are concurrent and predictive. Predictive validity evidence refers to how well the test predicts some future behavior of the examinee. A test should measure what its supposed to measure. I have realized after this chapter that even though the tests I was giving to my students were made by the authors of the textbooks, they were not the most valid because some of the things I was emphasizing in the text were a lot of times not reflected on the tests given. I now know that I can't expect the results of the tests to be what they should if the tests are not valid. Chapter 16 is about how reliability, consistency of rankings over time, affects testing. The book covers three ways to estimate the reliability of a test. The three basic methods used the most are called testretest, alternate form, and internal consistency. Test-retest is simply giving the same test twice and finding out the correlation between the first set of scores and the second set of scores. Alternate forms or equivalence method is used when there are two equivalent forms of a test. They are both administered to a group of students, and correlations between the two sets of scores are determined. If there are large differences between the two sets of tests that supposedly measure the same thing, the test is unreliable. Internal consistency is used when only one basic concept is being measured. The test is then split into halves and then determining the correlation between the two. I understand now that if tests are not reliable, you end up getting results that are not accurate. Making decisions and judgments on inaccurate data can become disastrous and in many cases this is what happens. Chapter 17 deals with the accuracy of tests scores and errors dealing with testing. It is a known fact that no test is 100 percent reliable or valid, all tests are subject to error and test scores are almost always inaccurate. The author purpose for discussing error is to make us aware of the fallibility of test scores, and to sensitize the reader to the factors that can affect scores. The sources of error can range from the test takers, the test itself, test administration, and test scoring. All

of these categories were discussed thoroughly in the chapter. Examples of error from the test takers can be things like fatigue and illness. Errors within the test are common, and there are many ways to do so. Trick questions, reading level that's too high, ambiguous questions, and items that are too difficult are all things that contribute to sources of error which affect test reliability and accuracy. Most people have been in the situation of taking a test in a room that was too cold, but never actually thought about how that could play apart in the outcome of the test. Physical comfort, differences in instruction and explanations, and the attitude of the administrator all play apart in errors in test administration. Error in scoring has decreased because of the use of computerized test scoring, but even in this case, error can occur. Standardized tests are the topic of discussion for Chapter 18. Standardized tests are tests that are administered and scored according to specific and uniform procedures. They are constructed by test construction specialists, with the help of curriculum experts, teachers, and school administrators. The purpose of these tests is to determine the students' level of performance compared to the performance of other students of similar age and grade, or compared to state standards or criterion. These tests usually take years to construct, unlike teacher made tests which can be made in a few days. Standardized tests can be norm referenced or criterion referenced. Norm referenced test scores are determined by comparing their obtained scores to a nationally representative sample that also took the test under standardized conditions. All states give some type of high stakes tests, but it varies from state to state whether they are norm or criterion based. These tests are currently being used to compare districts, schools, and students over time. These tests are not as useful to the teacher as teacher made tests are, however an increase in accountability has made it necessary for teachers to administer and interpret them. Differences in student related factors have a large part to play in the results of standardized tests. When interpreting the results, the teacher must be able to take that into account. Since we have been introduced to what a standardized test is Chapter 19 goes into depth on the different types of standardized tests. The achievement test batteries are the most widely used standardized achievement tests. Standardized academic aptitude tests measure students potential for learning. These tests can determine whether students are achieving up to their potential, and are used to predict future learning. These tests became known as IQ tests and instead of measuring potential for learning, people started to see them as measuring intelligence. After the age of six, IQ scores remain rather stable for the rest of a persons life. Prior to age six they tend to be unstable. Personality assessments are the least used in the classroom setting. Personality is defined as the typical and characteristic ways people behave. Objective personality assessments usually are done by self reported questionnaires and the items usually based on questions used in psychiatric interviews. In most cases its not the individual responses useful for the interpretation, but the patterns of responses is used instead. Projective personality assessments involves requiring examinees to respond to unstructured or ambiguous stimuli.

How to Cite this Page


MLA Citation: "Standardized Testing and Personality Assessments." 123HelpMe.com. 04 Jan 2014 <http://www.123HelpMe.com/view.asp?id=167003>.

Das könnte Ihnen auch gefallen