| Sign In to gain access to subscriptions and/or personal tools. |
Contributions of Qualitative Research to Research on Teacher QualificationsMichigan State University
The influence of teachers qualifications on their teaching practice has been subject to debate. Literature reviews do not settle these debates, partly because the literature is uneven and partly because reviews capture only narrow slices of literature. In particular, many reviews eliminate qualitative studies. Yet without examining qualitative evidence, variations in quantitative findings are difficult to interpret, disappointing findings are difficult to understand, and plausible explanations of patterns are in short supply. The present review focuses on qualitative studies and compares their findings with those from quantitative literature. The author finds that the qualitative literature agrees with quantitative literature in its inability to distinguish between teachers with different types of certificates or different teacher education backgrounds. On the other side, the author finds more evidence of benefit from content knowledge than quantitative studies have typically found. The qualitative studies also reveal competing influences and offer hypotheses about why the outcomes look the way they do.
Key Words: teacher qualifications qualitative methods teacher education THE volume of research on teachers qualifications has grown to several hundred studies, but it has not settled arguments about the merits of teacher education programs. One reason for this stalemate is that the question is difficult to address, and virtually all studies are susceptible to confounding variables. For instance, teachers self-select their educational programs; consequently, their own values and predispositions are confounded with their credentials. And once certified, they engage in nonrandom job-seeking practices, while districts engage in nonrandom hiring practices, so the resulting pattern of job placements confounds teachers educational backgrounds, credentials, attitudes, and predispositions with the communities they serve (Boyd, Lankford, Loeb, & Wykoff, 2003; Lankford, Loeb, & Wykoff, 2002; Strauss, 1999). Kennedy, Ahn, and Choi (2008) call the resulting allocation of teachers to schools a system of affinity assignments, meaning that teachers social backgrounds and qualifications often match the social backgrounds and "qualifications" of their students. The problem is compounded when literature reviews use different inclusion rules. Some reviewers (e.g., R. Greenwald, Hedges, & Laine, 1996; Wayne & Youngs, 2003; Wilson, Floden, and Ferrini-Mundy, 2001) do not reach beyond journal articles, whereas others cast a broader net. Some consider only studies that use student achievement as their outcome measure (e.g., R. Greenwald et al., 1996; Wayne & Youngs, 2003), whereas others include an array of indicators of quality teaching (Rice, 2003; Wilson et al., 2001). Some (e.g., Kennedy et al., 2008) require that outcomes be measured after teachers have finished their education and are employed as teachers of record, whereas others (e.g., Wilson et al., 2001) include studies whose outcomes were measured while teachers were still participating in student teaching. Some are unclear about their inclusion rules (e.g., Walsh, 2001). Finally, each review also imposes its own criteria for study quality. The number of studies included in reviews of literature on teacher qualifications ranges from 21 in Wayne and Youngss (2003) review to 200 in Walshs (2001) review. In part because different readers of this literature focus on different subsets within it, we also can see very different conclusions arising as different authors perceive the available evidence in very different ways. Here are some illustrative quotes: More than 200 studies have found that teachers who have more background in their content areas and have greater knowledge of teaching and learning are more highly rated and more successful with students in fields ranging from early childhood and elementary education to mathematics, science and vocational education. (Darling-Hammond & McLaughlin, 1999, pp. 377–378) These quotes reveal that the relationship between teacher qualifications and teaching quality—a relationship that should be self-evident—is not self-evident at all. Moreover, the field lacks coherent theoretical arguments about how qualifications are expected to influence teaching practice. This is an area, therefore, where qualitative research could make a strong contribution, for qualitative research is more able to delve into the mechanisms and processes by which these qualifications actually influence teaching practice. My contribution to this debate is to add qualitative studies to the corpus of literature under review. Because these studies have rarely been considered in reviews, I also offer a framework for evaluating the validity of their causal claims. And to gauge the import of their contribution, I contrast their conclusions with those from other literature reviews, and I ask whether they help fill the theoretical gap regarding how knowledge and educational backgrounds influence teaching practice. Most reviews exclude qualitative research, either because its outcomes are not quantifiable or because of a perception that qualitative studies are useful for learning about social meanings but not about causal influences. Yet many qualitative studies do examine the influence of policies or programs on teaching practice, and they can help us understand these influences in a way that quantitative studies cannot. Maxwell (2004a, 2004b) has argued that qualitative studies address causation not by establishing patterns of regularities, as quantitative studies do, but by revealing the mechanisms and processes of influence. In the case of teaching, qualitative studies can help us see how a teachers knowledge either misdirects practice or enhances it and see where and how educational backgrounds have their influences, if they do. Qualitative research is also the preferred method of teacher educators, as evidenced by the large fraction of qualitative studies in journals such as the Journal of Teacher Education, so when reviewers exclude this literature, they are excluding the bulk of work that has been done by teacher educators themselves. Qualitative research has been a staple of social sciences for some time but ascended in educational research in the 1980s. Since then, researchers have struggled to articulate the differences between quantitative and qualitative research and to articulate standards of validity for qualitative research. It has often been referred to not merely as an alternative method but as an alternative paradigm, with different assumptions, methods, rationale, and purposes. Some reviewers (e.g., Wilson et al., 2001) label this work "interpretive," suggesting that the nature of ones data automatically implies an alternative epistemology. Others (e.g., Becker, 1996) insist that although qualitative and quantitative approaches are different, their epistemologies are not. Maxwell (2004a, 2004b) argued that the two approaches are subject to many of the same concerns regarding validity but that the former relies on variance and regularities to establish causality, whereas the latter relies on revealing the causal mechanisms themselves. Donmoyer (2001) argues that there is no single way to characterize qualitative studies, for they vary substantially in their aims. He suggests that they be defined by their purpose rather than by their epistemologies or paradigms. Included in his list of purposes is one called "truth-seeking." Truth-seeking qualitative studies differ from those whose purpose is, say, to create a personal interpretation and those whose purpose is social change. The fact that meta-analysts are now seeking approaches to incorporating qualitative studies into their work (see, e.g., Hundersmarck, 2004; Au, 2007) suggests a growing interest in the truth-seeking potential of qualitative studies. The central question guiding the present review is whether and how truth-seeking qualitative studies can amend, clarify, or otherwise add to the knowledge gained from the larger quantitative body of literature regarding the influence of teachers qualifications on the quality of their teaching practices. All of the studies reviewed here examine the influence of teachersknowledge or educational backgrounds on the quality of their teaching practice, and all seemed to expect their findings to be generally relevant. In this sense, they fit Donmoyers (2001) category of truth-seeking studies. The article has three main sections. In the first, I describe the literature search process. In the second, I examine methodological issues associated with truth-seeking qualitative studies and offer some criteria for evaluating their validity. In the third, I review the findings from these studies. I organize the findings around different kinds of qualifications.
This review is based on literature in the Teacher Qualifications and the Quality of Teaching (TQQT) database. This database contains a collection of research articles, reports, dissertations, conference presentations, and books that examine the relationship between at least one teacher qualification and at least one indicator of the quality of teaching, with qualifications defined as aspects of education or certification that have been the target of education policy. Literature in ERIC, PsycInfo, and EconLit was searched using more than 20 terms to characterize teachers educational backgrounds: teacher knowledge, test scores, preparation, education, courses taken, and so forth. The search also included more than 20 terms to characterize evidence of teaching quality: authentic practice, quality of practice, student achievement, classroom practice, classroom activities, and so forth. Studies had to be published after 1960 and had to investigate K–12 teachers of record in the United States. Studies of student teachers were eliminated, as were those that placed teachers in artificial settings or in simulations. This last requirement eliminated a number of small-scale experimental studies. Each study also had to establish a link between the qualification and the quality of practice, rather than simply describing both and asserting that they were related. Here are two examples of qualitative studies that were rejected for failure to establish a link. In one case (Pool, Ellett, Schiavone, & Carey-Lewis 2001), the authors purported to test the link between National Board Certification and teaching quality. They drew a sample of six board-certified teachers, interviewed them, and observed their practices. They found that the teachers practices were quite variable and argued that this variability constituted evidence that board certification was not associated with teaching quality. We rejected the argument that variability within a group was, by itself, evidence that group was not distinguished relative to the population as a whole. In another study, Wozniak (1990) set out to learn the backgrounds of a sample of K–12 art teachers who had been identified as outstanding. The author then reported common themes and argued that the characteristics she found must have been responsible for the quality of teachers practice. Nothing in the study showed readers how these teacher characteristics actually fostered teaching quality, nor did the author examine a comparison group of less outstanding teachers to see if they lacked these qualifications or had them. Notice that in both of these cases, studies were rejected because of the logic of their claim, not the quality of the evidence. The results of this search-and-screen process was a database of more than 450 study reports, including 23 studies describing findings that were primarily qualitative but that spoke to the causal question of whether or how teachers knowledge or educational backgrounds influenced the quality of their teaching practice. Below, I discuss methodological issues uniquely associated with these 23 qualitative studies.
A great deal of ink has been spilled in the cause of defining validity for qualitative research, and my aim is not to review all of that here. Instead, I concern myself with validity issues associated with a narrow slice of qualitative research that is truth seeking in its focus and that specifically addresses the influence of teachers knowledge and educational backgrounds on the quality of their teaching practices. For this group of studies, criteria for validity are very similar to those faced by quantitative researchers: Studies must show evidence that an influence actually occurred or failed to occur, they must persuade readers that the observed teachers are not idiosyncratic in some way, and they must acknowledge, and try to eliminate, any alternative explanations of their findings. I address each of these issues below, beginning with the alternative explanations that must be overcome and then reviewing study designs and analytic strategies that these researchers employed. The issues I enumerate below are not intended to apply to all truth-seeking qualitative research but to the specific studies in this database.
Alternative Explanations for Qualitative Findings
Accommodation bias
Confirmation bias
Hindsight
Reactivity
Misspecified influences
Confounded contexts
Establishing Influence in the Face of Alternative Explanations
Between-teacher comparisons
Within-teacher comparisons
Longitudinal and content-matching studies
Teacher attribution of learning
Detailed descriptions of the influence itself
Establishing Representation With a Small Sample Critics of qualitative research often argue that these small samples lack value because they automatically deny any possibility of representativeness. And some qualitative researchers apparently agree, for instead of making a case for the representativeness of their samples, they rely instead on Yins (1989) argument that qualitative studies do not generalize to populations but only to theory. This argument does not absolve researchers, however, for readers will necessarily ask, Who are these teachers and where did they come from? Readers need to know what the researchers saw as important in a prospective research subject, and they need to know the kind of teaching assignments, school environments, and students that sampled teachers contended with. This kind of information can help readers make their own judgments about the logic of any causal conclusions, about potential sources of bias in the study, and about the theoretical relevance of the findings as well. In fact, the most widespread weakness in the studies reviewed here was a general failure to articulate how teachers were selected. Four studies gave no indication at all of how cases were selected, one based selection on geographic proximity, and another stated explicitly that the study capitalized on a serendipitous event. Many of the researchers already knew their sampled teachers because they had taught them, and yet they failed to say why they selected these graduates rather than some others.
Findings are presented according to the particular type of qualification examined. I found studies examining the following: (a) influence of content knowledge that is stipulated on the basis of a credential, (b) influence of content knowledge as assessed by the researcher, (c) influence of particular teacher education courses, and (d) influence of broader patterns of courses or entire teacher education programs.
Influence of Stipulated Content Knowledge In this section I review three studies that examine the role of content knowledge when the knowledge itself is not directly assessed but instead is stipulated by the presence of a degree or credential. The first study involves an author contrasting his own teaching in the field for which he felt prepared and in another field where he felt less qualified. The second study compared a teacher with professional work experience as a scientist with one who had no applied experience and who had majored in a different branch of science. The third compared two secondary English teachers, one of whom was certified to teach English, the other of whom was certified to teach a different subject. These studies are summarized in Table 1.
In the first study, Carlsen (1997) compared his own teaching in a subject he knew well (biology) with his teaching in a subject he knew less well (chemistry). I consider this to be stipulated knowledge because Carlsen did not provide any evidence of his knowledge other than to say that he had it. To see how his knowledge influenced his practice, he tape-recorded his lessons so that he could systematically examine the transcripts. He was particularly interested in the role that his questions played in controlling conversation and in reinforcing the teachers authority. He felt that an overreliance on questions could create an inquisition atmosphere in the classroom and that an overreliance on recall questions, without attention to rationale, reduced the authority of the content. In his comparison, Carlsen saw that he used remarkably more questions when teaching chemistry than when teaching biology and, in particular, that he used more low-level, recitational questions. He also found that his chemistry lessons were much thinner in terms of revealing warrants for knowledge claims, so his chemistry students learned less about how facts are generated and evaluated than his biology students learned. He theorized that the difference between his questioning tendencies in chemistry versus biology occurred because he himself was more uncertain of the content and therefore relied on questions that tacitly discouraged, rather than encouraged, student discussion. The second study (Powell, 1997) contrasted two novice teachers who graduated from the authors own teacher education program. Both had completed their teaching degrees and had taken positions teaching earth sciences to lower-track high school students. They also shared many teaching values: They both espoused progressive pedagogies, both viewed science as a way of thinking and reasoning that students should experience firsthand, and both wanted to create a science curriculum that was embedded in activities rather than in the textbook. However, Jills content expertise was in biological sciences, not earth sciences, whereas Dan had a masters degree in geology and had been a field hydrogeologist for 6 years before entering the teacher education program. Powell found that both teachers had difficulty implementing their vision. Jill had difficulty because of her lack of knowledge of earth science. Dan, who had greater content knowledge and practical experience as well, did not know how to transform his deeper content knowledge into meaningful learning activities. So both teachers, despite their differences in content knowledge, ultimately compromised their plans to make science more active and meaningful to students and instead relied heavily on the textbook and on recitational teaching. In the third study, Ringstaff and Sandholtz (2002) compared two secondary English teachers, one teaching out of field (Brian, certified in science) and one certified in English (David). Both graduated from the same university, participated in the same teacher preparation program, were 1st-year teachers, and received little support from their colleagues. The authors watched them teach Cannery Row and found that the presence of a teaching certificate in English did not promote better teaching. Brian, the science major, taught a low-track class but set high goals for his students, whereas David, the English major, taught a college preparation class but deemphasized literary analysis and focused instead on low-level knowledge. The authors conclude that the official degree is not a perfect indicator of content knowledge, nor of intellectual interest in the content, for Brian was an avid reader. Second, they note that participation in teacher education is not a perfect indicator. Both teachers attended the same teacher preparation program, but Brian was generally more able to manage his students and garner their interest in the content. Third, time available for planning could have been a complicating factor, because David had four preparations per day whereas Brian had only two.
Influence of Assessed Content Knowledge
Elementary mathematics Three studies examined the influence of teachers content knowledge on teaching practices in elementary mathematics. The first study examined 20 experienced teachers who had been in the same school for at least 3 years. Blasquez (1998) used both quantitative and qualitative methods to see the extent to which teachers understanding of mathematical concepts influenced their ability to use more constructive pedagogies. She used a set of structured tasks to assess their understanding of mathematical concepts and used a short questionnaire to learn their beliefs about procedural versus constructivist approaches to teaching. In her quantitative analysis, Blasquez found that teachers practices were more strongly associated with their pedagogical beliefs than with their mathematical knowledge. For the qualitative portion of the study, Blasquez selected three pairs of teachers matched in their beliefs about constructivist versus procedural instruction, but differing in their understanding of mathematical concepts, and then observed their lessons. Within each pair, Blasquez was able to see that the teacher with higher conceptual understanding was more likely to ask students to justify their ideas, more likely to extend or elaborate on student ideas, and more likely to monitor for understanding. So her qualitative analysis allowed her to see an influence that her quantitative work could not reveal. At the same time, however, the author was no longer "blind" for this portion of the study and knew which teacher belonged in each knowledge and belief category, so the things she noticed and wrote about could have been influenced by her own confirmation bias. The second study (Buckreis, 1999) was a within-teacher comparison of a fourth-grade teacher, Meg, selected because she had experience with the fourth-grade curriculum but also because her understanding of one mathematical concept, multiplication, was much stronger than her knowledge of another concept, division. Buckreis evaluated Megs knowledge by asking Meg to list subtopics that fell within each of these two general curricular domains and to provide sketches or diagrams as needed to show how the topics related to one another. Megs knowledge of division was both faulty and incomplete on several topics, including the different meanings of division, the conceptual underpinnings of division procedures, and the idea of divisibility itself. The difference between her understanding of multiplication and of division made Meg an ideal candidate to study the influence of subject matter knowledge on teaching. In his observations, Buckreis saw that Meg did not provide students with a complete development of the full range of division situations. Moreover, at the conclusion of the observations, in a posttest, Buckreis found that students had significantly more success with multiplication problems than with division problems. In the third study, Stein et al. (1990) studied a fifth-grade teacher, Mr. Gene, as he taught a 25-lesson unit on functions and graphing. To ascertain the teachers knowledge of this content, these authors relied on an interview and a cardsort task, which was also administered to a mathematics educator, so that the teachers knowledge could be defined relative to a presumed experts knowledge. The contrast allows them to see that Mr. Genes conception of functions was missing important ideas about functional relationships and that much of his thinking was relatively superficial. For instance, in the card sort, Mr. Gene sorted cards by their format (equations vs. graphs, for instance), whereas the mathematics educator sorted them according to the nature of the relationship that they portrayed, regardless of the format used to portray the relationship. However, the authors also suspected that the teachers understanding of this content had become somewhat distorted by a metaphor he adopted from his textbook. Mr. Genes textbook frequently presented problems based on a metaphor of a function machine that produced values of Y for each value of X it received. Students would be asked to predict outputs for different inputs. The problems also focused on point estimation more than on functional relationships as a whole. When plotting ordered pairs, Mr. Gene emphasized the benefit of graphs to check for errors in predicting individual values and gave less attention to the nature of the function itself. Although the authors thought that Mr. Genes representation of functions was influenced by the textbook metaphor, they also argued that his less sophisticated understanding of functions led to several missed teaching opportunities and led him to view the value of this unit in terms of strengthening arithmetic knowledge, rather than providing a groundwork for future content learning. In a sense, these three studies all test a relatively commonsense hypothesis that "you cant teach what you dont know." However, they also provide a more nuanced notion of what content knowledge actually looks like. In each case, the missing knowledge was not discrete facts but, rather, seemed to be the structural relations among concepts. At the same time, two studies were susceptible to confirmation bias. Moreover, their references to what the teacher failed to say to their students are likely influenced by hindsight. The Buckreis study overcame both of these potential weaknesses by augmenting observation data with student test data. All three studies could have been strengthened if they had more explicitly sought evidence regarding alternative explanations for what teachers failed to say. What if, for instance, Mr. Gene had been interviewed after a lesson in which he failed to say something, and he volunteered in the interview that in retrospect he realized he should have mentioned this issue? In this case, we would have to attribute the failure to speak as caused by something other than missing content knowledge.
Elementary and middle school science The second study (Magnusson, Borko, Krajcik, & Layman, 1992) examined the relationship between teacher knowledge and gains in student content knowledge in the context of a computer-based laboratory unit on heat energy and temperature. The study used a logic that is more akin to quantitative studies than to qualitative: Researchers did not observe classroom practice at all but focused entirely on qualitative assessments of both teacher knowledge and student knowledge before and after the unit. The knowledge of six eighth-grade teachers and a sample of their students was assessed via structured tasks and interviews whose transcripts were coded for correct or incorrect statements and for misconceptions. The authors then present complex tables showing patterns of what teachers knew and what their students learned. For example, they show that when teachers offered incorrect knowledge or misconceptions on the structured tasks, their students learned less during the year. And when teachers lacked knowledge of the misconceptions their students might have, their students learned less. However, the single greatest predictor of student learning was not what teachers knew, according to this assessment, but rather the number and type of learning activities they said they provided for their students. Although it may be the case that these learning activities derived from teachers knowledge, they apparently did not derive from the knowledge that the researchers assessed. No clear path between knowledge and practice was established. These two science studies offer an interesting contrast because they provide such different approaches to evidence and inference. Smith inundated us with examples of specific classroom events, providing a two-volume study of a single teacher, whereas Magnuson and colleagues provided only pre- and postinterviews with no observations. Smiths study shows us where and how Megs knowledge influences her teaching practice, whereas Magnuson and colleagues show us a pattern of regularities that suggest student learning is more influenced by learning activities than by the teachers knowledge. Any number of interpretations can be offered for the Magnusson data—that the teachers knowledge really is not relevant, that teachers with greater or better knowledge were more likely to spend time teaching these topics, that teachers with greater insight into their students comprehension were more likely to teach the additional units, and so on. A more in-depth approach to qualitative work might have provided insights into these issues.
Secondary science In the second study, Gess-Newsome (1992) examined "subject matter structure" (SMS), referring both to the central concepts and themes in biology and to how different specific topics relate to one another and to these central concepts. Focusing on five secondary biology teachers, Gess-Newsome deciphered the SMSes that were implied in their textbooks, the SMSes that were conveyed through teachers classroom presentations and discussions, and the SMSes that teachers revealed in a postinterview about the relationship among curricular units and the teachers rationale for a variety of other teaching decisions. Analyses of classroom lessons focused on explicated relationships among curricular topics and on explicit themes. Ultimately, the author found that these observed SMSes were not strongly related to teachers own maps of the content. Only one teacher demonstrated a direct translation of his own understanding of the material into his instruction. The remaining four differed in their desire to provide such an organizing framework to their students. However, even when this was not their explicit goal, teachers with greater knowledge of relations among topics did spontaneously make integrative connections and provided real-life examples during their classroom lessons. The third study of secondary science teacher knowledge focused on teachers understanding of the nature of scientific knowledge (NSK; Lederman & Zeidler, 1987). These authors recruited 18 experienced biology teachers and assessed their knowledge with a set of Likerttype scales asking teachers to agree or disagree with statements describing scientific knowledge. Items were then aggregated into subscales measuring teachers perceptions of the extent to which scientific knowledge is amoral, parsimonious, testable, tentative, creative, and parsimonious. Classroom observers had no knowledge of teachers scores on the NSK scales, thus protecting themselves from possible confirmation bias. Ultimately, these authors used quantitative procedures to test for relationships between teachers knowledge and their practices. They ranked teachers according to their scores regarding the nature of scientific knowledge and then looked at the patterns in the messages that different teachers conveyed to their students about these issues. From the classroom data, they generated 44 categories of messages but found only 1 of them to be statistically significantly related to teachers NSK scores. Because statistical significance typically means there is 1 chance in 20 that the finding is because of chance alone, and because 44 tests were made by these authors, chance alone would lead us to expect at least two "significant" links in this set of data, thus suggesting that their measure of teacher knowledge was not related to any of their measures of classroom practice. These three studies offer an interesting comparison in part because they focus on such different aspects of scientific knowledge and in part because they use such different approaches to ascertaining influence. Cunningham (1995) looks at the sociology of science, Gess-Newsome (1992) at how knowledge is organized, and Lederman and Zeidler (1987) at characteristics of scientific knowledge. The presence of such diverse approaches to defining and assessing content knowledge illustrates an advantage of studying teachers knowledge qualitatively, for qualitative approaches allow more nuanced notions of knowledge. Yet as a set, the findings do not provide a strong case. Cunningham found an effect of sociological understanding, but her finding is susceptible to confirmation bias, and the other two authors found little or no influence.
Summary of studies focusing on content knowledge With respect to the first question, the findings are uneven across the set of studies. Three of 11 studies showed a clear and unambiguous influence whose validity was not threatened in an obvious way. Two others found an unambiguous lack of influence, and the rest either found weak influence or were compromised by threats to their validity. That these findings are as uneven as those of their quantitative brethren suggests that the problem of establishing this relationship may not be a methodological problem. However, one difference in study methodology is worth noting. The two studies that saw no visible influence were actually hybrid qualitative–quantitative studies. Both started with qualitative data but then coded and quantified it to a point where they could present tabular findings. It is possible that these efforts to standardize their data resulted in masking the very things the authors sought. So how do these studies add to our understanding of the role of teachers content knowledge? They do offer a variety of insights and hypotheses for further investigation. For example, Stein et al. (1990) thought their teachers knowledge of functions and graphs had been influenced by the function machine analogy his textbook used as a pedagogical tool, thus hinting that practice may influence knowledge even as we seek evidence that knowledge is influencing practice. Other authors, such as Blasquez (1998) and Gess-Newsome (1992), found that teachers beliefs and goals for teaching content had greater influences on practice than did knowledge, a possibility that also raises questions about how knowledge, beliefs, and values may influence one another. Both of these insights also remind us that knowledge is not a fixed resource that teachers repeatedly draw from over time, as chalk is, but rather is constantly changing in response to context, experience, beliefs, and values. Perhaps the very question of how knowledge influences practice is ill conceived. It assumes knowledge is a relatively fixed entity that can be called on at will and overlooks the role of goals, beliefs, and spontaneous interpretations of events in determining what teachers do at any given moment.
Influence of Specific Teacher Education Courses Three studies examined the influence of specific teacher education courses. One (Boedecker, 1997) examined the influence of science methods courses; the other two (Artiles et al., 1998; Causey, Thomas, & Armeto, 2000) examined the influence of courses that were designed to foster cultural sensitivity in their students. All three studies followed students from their teacher education programs into their first teaching jobs and looked for evidence of course teachings in either the teachers practices or their reasoning about their practices. The studies are summarized in Table 3.
Boedecker (1997) was interested in a secondary science methods course that emphasized the use of hands-on and inquiry-based pedagogies. Her study focused on three graduates of her own teacher education program, interviewing teachers directly about program influences and also observing them each four times. Boedecker saw some examples of constructivist approaches but more often saw lectures. In their interviews, teachers attributed their pedagogy to the science courses they had taken rather than their science methods courses. Boedecker believed that the science methods courses were more consistent with National Science Education Standards and speculated that teachers would be more able to implement these standards if their science teachers had used them. The next two studies were interested in the effect of courses designed to alter cultural attitudes. In one study, Artiles and others (1998) followed two novice bilingual teachers who had been students in a course that the authors themselves taught. The students were followed through 2 years of full-time practice. These authors conducted in-depth interviews and stimulated-recall interviews and also asked teachers to create concept maps on several occasions. The article presents great detail about the concept maps and how they changed over time. From these multiple sources of evidence, the authors concluded that some important transformations had occurred in their thinking over time but not in all domains. For example, teachers added knowledge details to their maps but often did not change superordinate beliefs, and their rationale for teaching decisions often did not match self-reported pedagogical beliefs. The authors concluded that the teacher education program gave them general ideas but not the tools they needed to cope with the demands of a culturally diverse classroom. Finally, Causey et al. (2000) were interested in a middle school social studies methods course designed to foster a more culturally sensitive attitude. They identified a set of beliefs that entering students tended to hold and that they hoped to alter. In particular, they felt students did not appreciate their own privileged positions. In their study, they examined student reflections throughout the course and found that their beliefs did not change much. However, two students did appear to have genuinely changed, and so they followed these two students after they graduated. The follow-up consisted of one observation and a stimulated-recall interview each year. The authors found that one of the two students had reverted to her former, less culturally sensitive beliefs, but the other appeared to have sustained the changes she had made during the class. However, this teacher worked in a predominantly White, middle-class suburb—where her new beliefs may not have been challenged—whereas the teacher who reverted taught in a more diverse, lower-middle-class school. Ultimately, then, the course had no discernable effect on the practices of urban teachers.
Influence of Whole Teacher Education Programs Just as it would be a mistake to assume that teachers education consisted solely of their specific teacher education program, so would it be a mistake to assume that any of these studies examines the impact of an entire educational program. Most study one aspect of the program, such as the science preparation program or the literacy preparation program. Nine studies are described here and are summarized in Table 4. These studies were quite various, focusing on elementary literacy, mathematics, and urban education and on secondary science, English, and physical education.
Elementary reading and literacy Two studies followed teachers who had studied in elementary literacy programs. One study, the Beginning Teacher Study (Flint et al., 2001; Hoffman et al., 2005; Maloch et al., 2003) was largely quantitative (described in Hoffman et al., 2005) but included some qualitative reports as well (described in Flint et al., 2001, and Maloch et al., 2003). For the qualitative component, researchers conducted structured telephone interviews with 1st-year teachers who had graduated from several programs identified as exemplary approaches to literacy teaching. They also interviewed teachers who had earned generic elementary credentials at the same institutions. The researchers wanted to learn if teachers self-reported practices differed as a function of the particular institution they attended or the type of program within the institution. The study included more teachers than most qualitative studies (42 in Flint et al., 2001; 101 in Maloch et al., 2003) but much less intense data collection. No face-to-face interviews were conducted nor any direct observation of practice. However the interviews were repeated three times during the course of the teachers 1st year of full-time teaching. Analyses of themes suggested that graduates from exemplary programs felt more confident in their ability to teach reading, were more likely to mention specific features of their programs that they valued, and were more likely to base instructional decisions on student needs rather than curricular mandates. Because these findings derive entirely from self-reported practices, they are highly susceptible to accommodation bias. All interviewees understood that they were being asked by representatives of their alma maters to evaluate the quality of their preparation programs. The second study followed two teachers longitudinally and focused on the interaction between program influences, teachers beliefs, and teachers practices. Deal and White (2006) describe the evolution of teacher thought from preservice student teaching through the 1st year of professional practice, and Deal and White (2005) describe the 2nd year of professional practice. These researchers begin with the assumption that teachers a priori beliefs have a great deal of influence on practice and so focus their study on whether or how the teacher education program influenced teaching decisions relative to school context and personal beliefs. They interviewed teachers about influences on their decisions and then sorted these into main categories, for example, students, institutional rules or norms, program ideas, personal beliefs, and so on. During student teaching and 1st year of teaching, teachers made numerous references to their teacher education program as an influence, but these references virtually disappeared by the 2nd year of professional practice, and decisions seemed to be dominated by contextual influences. However, the program emphasized the importance of attending closely to student thinking, and in the authors analysis, students were categorized as part of the school context. Thus if teachers said their decisions were influenced by their students, the researchers categorized the influence as related to context rather than to the program. It is unfortunate that they did not present more specific examples of how teachers perceptions of their children influenced their decision making so that potential program influences might be more apparent. Ironically, this apparent lack of program influence may be a case of confirmation bias in the sense that the researchers expected to find stronger influences from context and prior beliefs than for their program.
Elementary mathematics
Elementary urban education
Secondary science
Secondary English The second study of elementary and secondary English teachers (Grossman et al., 2000) involves a longitudinal study of 3 novice teachers followed from their student-teaching year through their first 2 years as professional teachers. The authors set out specifically to address the problem of why teacher education programs lack a stronger influence on practice. They studied 3 teachers selected from a larger sample of 10 but do not say how either the 10 or the 3 teachers were selected. Data collection consisted of five observations and interviews per year. The authors found that teachers adopted instructional concepts, such as scaffolding, a process orientation to writing, and the value of ownership in writing, from their teacher education programs but that they had to go elsewhere to find the practical tools they needed to actually teach. They depended heavily on district curriculum materials even when these were antithetical to the programs concepts yet did not abandon program concepts. From these observations, the authors conclude that teacher education can play an important role in helping teachers, for instance, by providing teachers with a vision of writing instruction, but that teachers would benefit more if programs also provided specific instantiations of how these visions could be realized.
Physical education
Secondary math
Summary of studies of teacher education But the qualitative studies do allow us to see more clearly why program influences yield the patterns that they do. Several themes appear across these studies. One is that teachers are very strongly influenced by the specific circumstances in which they find themselves—the students they serve, the curriculum and other materials at their disposal, organizational constraints and norms in their buildings. Another is that program influences are mediated by teachers prior beliefs and sometimes are entirely revised by these beliefs. Indeed, these two themes are so pervasive that the more recent publications reviewed here begin with the assumption that program influences will be overwhelmed by these other influences. The third important theme is that program content tends to lack concrete examples. Several authors found that teachers claimed to embrace program concepts but that they were unable to translate those ideas into practice. These themes are not new to most teacher educators. Studies of student teaching conducted in the 1970s pointed to the strong influence of local contexts as counteracting the influence of teacher education programs (see, e.g., Hoy & Rees, 1977; Ryan et al., 1979; Zeichner, 1980). I eliminated studies of student teaching from this review, but the similarities between the literature reviewed here and the literature on student teaching suggest that the teacher education community has been aware for several decades that its programs were not powerful enough to alter school norms.
The current spate of arguments about the knowledge needed for teaching, and about the value of teacher education for teaching, began in earnest after the National Commission on Teaching and Americas Future (1996) released its report, "Teaching and Americas Future." That report spawned a vigorous debate about the importance of teacher education and certification (Ballou & Podgursky, 1998 Ballou & Podgursky, 2000a, 2000b; Darling-Hammond, 2000) that continues today. The studies reviewed here are accompanied by hundreds of quantitative studies that have also sought associations between teachers knowledge or educational backgrounds and the quality of their practice. Each set of studies is susceptible to its own alternative explanations of findings, so each literature is difficult to review and summarize. Qualitative studies offer several advantages for looking at these relationships. Qualitative researchers tend to have more detailed and nuanced assessments of teachers knowledge and to know more about what teachers actually learned in their educational programs and when they completed those programs. Quantitative researchers often have only rough indicators of knowledge: tallies of courses taken or test scores but no knowledge of the content of those courses or tests. Moreover, quantitative samples often include teachers who completed their educational programs at wildly different times in the past. That the two sets of studies yield messy and ambiguous conclusions attests to the difficulty of sorting out causal influences in a dynamic world. It should be noted, too, that the value of qualitative studies for questions of causal influence has received relatively less attention, and as a result, less attention has been given to explicit design decisions. I have tried to facilitate progress along these lines by outlining some of the prominent alternative explanations that researchers need to anticipate and control. Probably the most disappointing aspect of these research reports is their lack of detail about sampling rationale. Yet when samples were justified, their rationales contributed to our grasp of the findings. For instance, when Causey et al. (2000) say that the two teachers they followed were the only two who demonstrated change during their participation in class, we understand that their transition into professional practice could represent the best possible program outcome, not a random program outcome. When Cady et al. (2006b) tell us they followed teachers whose initial beliefs were differentially aligned with program goals, we understand that we will be learning how the programs messages become differentially translated into practice by these two teachers. But when Grossman and others (2000) say they studied 3 teachers from a sample of 10, without saying how they selected either the 10 or the 3, we do not know if the sample represents the best case of learning from teacher education or if it represents some particular kind of learning from teacher education. One also wonders how much more might have been learned from these studies if qualitative researchers were more self-conscious about controlling for confounding variables, documenting contexts and mitigating circumstances, and taking a more skeptical stance toward their own interpretations and evidence. Fifteen of the 23 studies reviewed here were compromised by design flaws that raised questions about their findings, thus limiting their potential to help us understand how teachers knowledge and credentials help them teach. Notice, too, that the limitations I have outlined are not limitations inherent in qualitative research. They do not suggest that qualitative research is inherently weaker than quantitative research. But they do demonstrate that when qualitative researchers establish truth-seeking goals, they need to pay more attention to their own methods. More important than their limitations is the messages about the role of knowledge, credentials, and teacher education that these studies offer. Because they can look at practice as it unfolds, they offer observations that quantitative studies can rarely provide. One important message that these studies provide has to do with the tremendous power of classroom and school contexts over practice. Schools structure teaching practice through schedules, textbooks, materials, and rules. Students also influence teaching practices. They are the primary audience from whom teachers can get feedback about their work. The problem of situational press is not unique to teaching, of course. We all forget about admonitions to eat less sugar and fat when we are hungry and surrounded by vending machines full of potato chips and candy bars. Another important message here is that studies seeking evidence of a role for content knowledge were more successful than those seeking evidence of a role for teacher education programs. This is somewhat of a surprise, for quantitative studies have been more equivocal about the role of content knowledge. That studies of teacher education programs were so rarely able to reveal any clear and visible influences is particularly surprising, because most of this literature was generated by people who were themselves teacher educators who were deeply knowledgeable of their courses and programs and thus, presumably, would be more able to see influences if they were there. The third message offered by these studies is that teacher education courses provided only general ideas or concepts, without the kind of detailed guidance teachers need to translate these concepts into specific strategies. The studies introduce us to a teacher with extensive field experience in earth science who is unable to translate that experience into engaging lessons, despite having completed a teacher education program, and to teachers who learned to listen to their students but did not know how to incorporate what they heard into their lessons. Teaching entails translating ideas into events, and many of the teachers described here were unable to do that. We could speculate, as some of these authors did, that teacher education could be more influential if it provided examples of how to translate its ideas into specific events. This hypothesis reinforces arguments for the use of such tools as cases and hypermedia to examine specific lessons, student work, or school contexts (for a review of this work, see Grossman, 2005). Finally, several studies suggest that there is a two-way interaction between knowledge and practice. Knowledge did not remain fixed in teachers minds after they left college but instead continued to evolve as it interacted with both beliefs and teaching experience itself. Most quantitative work is based on an assumption that the knowledge measured by licensure tests, or instilled by curriculum requirements, remains unchanged over time and that it is just as relevant to a 30-year veteran as it is to a new teacher. But if knowledge continues to evolve over time, we would not expect to see a relationship between credentials or college course work in a random sample of teachers who graduated at different times in the past. So knowledge and program messages, both intended to be used to control events in the classroom, are themselves influenced by these same events. These studies might also provide an important stimulus for teacher educators. They demonstrate many reasons why program ideas may never find their way into classroom practices and in so doing may stimulate a new generation of hypotheses about how programs can have a greater influence, and a new generation of programs that are more grounded in school life.
The author acknowledges financial support from the U.S. Department of Educations Office of Educational Research and Improvement, now Institute for Educational Sciences; by the National Science Foundations Division of Research, Evaluation, and Communication; and by the National Science Foundations Math and Science Partnerships. However, the opinions expressed here are the authors, and no institutional endorsement should be inferred.
MARY M. KENNEDY is a professor of education at Michigan State University, College of Education, 116 Erickson Hall, East Lansing, MI 48824; e-mail:mkennedy{at}msu.edu. Her scholarship focuses on defining teacher quality and on identifying the things that most influence teacher quality. She has examined the influences of teacher education, research knowledge, attitudes and beliefs, credentials, and school context. Received for publication November 21, 2007. Revision received August 18, 2008. Accepted for publication September 3, 2008.
Educational Evaluation and Policy Analysis, Vol. 30, No. 4,
344-367 (2008)
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





