Listing 1 - 9 of 9 |
Sort by
|
Choose an application
This first volume of PISA 2009 survey results provides comparable data on 15-year-olds' performance on reading, mathematics and science across 65 countries. The volume opens with an introduction explaining what PISA is and how PISA 2009 is different from previous PISA surveys. The introduction also explains what PISA 2009 measures and how. A reader's guide provides information needed to interpret the data. Chapter 2 provides a summary of the findings related to performance in reading, the focus of the 2009 survey. Chapter 3 provides a summary of the findings related to performance in mathematics and science. A final chapter explores policy implications in five areas: low performance, pursuing excellence, strengths and weaknesses in different kinds of reading, student performance in math and science, and the potential to improve performance across the world. Annexes provide detailed statistical data and technical information.
Education --- Social Sciences --- Theory & Practice of Education --- Examinations --- Interpretation. --- Interpretation of examinations --- Test interpretation --- Test results --- Validity
Choose an application
Integrating Timing Considerations to Improve Testing Practices synthesizes a wealth of theory and research on time issues in assessment into actionable advice for test development, administration, and scoring. One of the major advantages of computer-based testing is the capability to passively record test-taking metadata-including how examinees use time and how time affects testing outcomes. This has opened many questions for testing administrators. Is there a trade-off between speed and accuracy in test taking? What considerations should influence equitable decisions about extended-time accommodations? How can test administrators use timing data to balance the costs and resulting validity of tests administered at commercial testing centers? In this comprehensive volume, experts in the field discuss the impact of timing considerations, constraints, and policies on valid score interpretations; administrative accommodations, test construction, and examinees' experiences and behaviors; and how to implement the findings into practice. These 12 chapters provide invaluable resources for testing professionals to better understand the inextricable links between effective time allocation and the purposes of high-stakes testing. The Open Access version of this book, available at https://www.taylorfrancis.com, has been made available under a Creative Commons Attribution-Non Commercial-No Derivatives 4.0 license.
Examinations --- Educational tests and measurements --- Validity. --- Educational assessment --- Educational measurements --- Mental tests --- Tests and measurements in education --- Psychological tests for children --- Psychometrics --- Students --- Psychological tests --- Test results --- Test validity --- Validity of examinations --- Rating of --- Interpretation
Choose an application
This open access book examines the challenges and issues caused by a move to a marketized education system in Sweden. Observing the introduction of the school voucher system and a postmodern social constructivist view of knowledge, the move away from objective knowledge is identified as the core reason for Sweden’s current education crisis. The impact of declining education standards on the labor market is also discussed. This book highlights the issues seen in Sweden and suggests policies that can improve education in the rest of the Western world as well. It will be relevant to students and researchers interested in education and labor economics. .
Economics --- Labour economics --- Political economy --- Swedish Educational System --- Educational Performance in Swedish Schools --- Marketized Education --- Test Results and National Economic Performance --- Dysfunctional work environment --- Fragmentation of the Swedish School System --- Open Access --- Education --- Labor economics. --- Economics. --- Education Economics. --- Labor Economics. --- Political Economy and Economic Systems. --- Economic aspects. --- Economic theory --- Social sciences --- Economic man --- Education and state --- Educational change --- Privatization in education --- Standards
Choose an application
Kernel Equating (KE) is a powerful, modern and unified approach to test equating. It is based on a flexible family of equipercentile-like equating functions and contains the linear equating function as a special case. Any equipercentile equating method has five steps or parts. They are: 1) pre-smoothing; 2) estimation of the score-probabilities on the target population; 3) continuization; 4) computing and diagnosing the equating function; 5) computing the standard error of equating and related accuracy measures. KE brings these steps together in an organized whole rather than treating them as disparate problems. KE exploits pre-smoothing by fitting log-linear models to score data, and incorporates it into step 5) above. KE provides new tools for diagnosing a given equating function, and for comparing two or more equating functions in order to choose between them. In this book, KE is applied to the four major equating designs and to both Chain Equating and Post-Stratification Equating for the Non-Equivalent groups with Anchor Test Design. This book will be an important reference for several groups: (a) Statisticians and others interested in the theory behind equating methods and the use of model-based statistical methods for data smoothing in applied work; (b) Practitioners who need to equate tests—including those with these responsibilities in testing companies, state testing agencies and school districts; and (c) Instructors in psychometric and measurement programs. The authors assume some familiarity with linear and equipercentile test equating, and with matrix algebra. Alina von Davier is an Associate Research Scientist in the Center for Statistical Theory and Practice, at Educational Testing Service. She has been a research collaborator at the Universities of Trier, Magdeburg, and Kiel, an assistant professor at the Politechnical University of Bucharest and a research scientist at the Institute for Psychology in Bucharest. Paul Holland holds the Frederic M. Lord Chair in Measurement and Statistics at Educational Testing Service. He held faculty positions in the Graduate School of Education, University of California, Berkeley and the Harvard Department of Statistics. He is a Fellow of the American Statistical Association, the Institute of Mathematical Statistics, and the American Association for the Advancement of Science. He is an elected Member of the International Statistical Institute and a past president of the Psychometric society. He was awarded the (AERA/ACT) E. F. Lindquist Award, in 2000, and was designated a National Associate of the National Academies of Science in 2002. Dorothy Thayer currently is a consultant in the Center of Statistical Theory and Practice, at Educational Testing Service. Her research interests include computational and statistical methodology, empirical Bayes techniques, missing data procedures and exploratory data analysis techniques.
Examinations --- Educational tests and measurements --- Scoring. --- Interpretation. --- Design and construction. --- Standards. --- Econometrics. --- Statistics. --- Educational tests and measuremen. --- Psychometrics. --- Statistics for Social Sciences, Humanities, Law. --- Assessment, Testing and Evaluation. --- Statistics . --- Assessment. --- Measurement, Mental --- Measurement, Psychological --- Psychological measurement --- Psychological scaling --- Psychological statistics --- Psychology --- Psychometry (Psychophysics) --- Scaling, Psychological --- Psychological tests --- Scaling (Social sciences) --- Statistical analysis --- Statistical data --- Statistical methods --- Statistical science --- Mathematics --- Econometrics --- Economics, Mathematical --- Statistics --- Measurement --- Scaling --- Methodology --- Educational assessment --- Educational measurements --- Mental tests --- Tests and measurements in education --- Psychological tests for children --- Psychometrics --- Students --- Test construction --- Test design --- Interpretation of examinations --- Test interpretation --- Test results --- Remote scoring of examinations --- Scoring of examinations --- Self-scoring of examinations --- Test scoring --- Rating of --- Validity
Choose an application
Score reporting research is no longer limited to the psychometric properties of scores and subscores. Today, it encompasses design and evaluation for particular audiences, appropriate use of assessment outcomes, the utility and cognitive affordances of graphical representations, interactive report systems, and more. By studying how audiences understand the intended messages conveyed by score reports, researchers and industry professionals can develop more effective mechanisms for interpreting and using assessment data.Score Reporting Research and Applications brings together experts who design and evaluate score reports in both K-12 and higher education contexts and who conduct foundational research in related areas. The first section covers foundational validity issues in the use and interpretation of test scores; design principles drawn from related areas including cognitive science, human-computer interaction, and data visualization; and research on presenting specific types of assessment information to various audiences. The second section presents real-world applications of score report design and evaluation and of the presentation of assessment information. Across ten chapters, this volume offers a comprehensive overview of new techniques and possibilities in score reporting.
Educational tests and measurements --- Examinations --- Grading and marking (Students) --- Evaluation. --- Validity. --- Graded schools --- Marking (Students) --- Students --- School reports --- Test results --- Test validity --- Validity of examinations --- Grading and marking --- Interpretation --- Rating of --- Educational tests and measurements - Evaluation --- Examinations - Validity --- Andrew Krumm --- April L. Zenisky --- Francis O'Donnell --- Gautam Puhan --- Gavin T. L. Brown --- John A. C. Hattie --- Linda Corrin --- Lisa A. Keller --- Marc Silver --- Mary Hegarty --- Mingyu Feng --- Priya Kannan --- Rebecca Zwick --- Richard J. Tannenbaum --- Ronald K. Hambleton --- Samuel A. Livingston --- Sandip Sinharay --- Sharon Slater --- Shelby J. Haberman --- Shuchi Grover --- Stephen G. Sireci --- Timothy M. O'Leary --- Yooyoung Park
Choose an application
Ability --- Educational tests and measurements --- Examinations --- Aptitude --- Tests et mesures en éducation --- Examens --- Testing --- Design and construction --- Interpretation --- Tests --- Elaboration --- Interprétation des résultats --- 159.9 --- -Educational tests and measurements --- -Examinations --- -Competitive examinations --- Questions and answers --- Educational assessment --- Educational measurements --- Mental tests --- Tests and measurements in education --- Psychological tests for children --- Psychometrics --- Students --- Psychological tests --- Abilities --- Proficiency --- Skill --- Skills --- Talent --- Talents --- Expertise --- Psychologie: zie ook: Psychiatrie: n-{616.89-008} en n-{615.851} --- Rating of --- -Psychologie: zie ook: Psychiatrie: n-{616.89-008} en n-{615.851} --- 159.9 Psychologie --- 159.9 Psychologie: zie ook: Psychiatrie: n-{616.89-008} en n-{615.851} --- Psychologie --- -Educational assessment --- Competitive examinations --- Tests et mesures en éducation --- Interprétation des résultats --- Interpretation of examinations --- Test interpretation --- Test results --- Test construction --- Test design --- Ability testing --- Aptitude tests --- Testing, Ability --- Validity --- 159.9 Psychology --- Psychology
Choose an application
This book provides an introduction to test equating, scaling, and linking, including those concepts and practical issues that are critical for developers and all other testing professionals. In addition to statistical procedures, successful equating, scaling, and linking involves many aspects of testing, including procedures to develop tests, to administer and score tests, and to interpret scores earned on tests. Test equating methods are used with many standardized tests in education and psychology to ensure that scores from multiple test forms can be used interchangeably. Test scaling is the process of developing score scales that are used when scores on standardized tests are reported. In test linking, scores from two or more tests are related to one another. Linking has received much recent attention, due largely to investigations of linking similarly named tests from different test publishers or tests constructed for different purposes. In recent years, researchers from the education, psychology, and statistics communities have contributed to the rapidly growing statistical and psychometric methodologies used in test equating, scaling, and linking. In addition to the literature covered in previous editions, this new edition presents coverage of significant recent research. In order to assist researchers, advanced graduate students and testing professionals, examples are used frequently, and conceptual issues are stressed. New material includes model determination in log-linear smoothing, in-depth presentation of chained linear and equipercentile equating, equating criteria, test scoring, and a new section on scores for mixed-format tests. In the third edition, each chapter contains a reference list, rather than having a single reference list at the end of the volume The themes of the third edition include: * the purposes of equating, scaling and linking and their practical context * data collection designs * statistical methodology * designing reasonable and useful equating, scaling, and linking studies * importance of test development and quality control processes to equating * equating error, and the underlying statistical assumptions for equating.
Examinations -- Interpretation. --- Examinations -- Scoring. --- Examinations --Design and construction. --- Mathematical statistics. --- Psychological tests --Standards. --- Examinations --- Psychological tests --- Educational tests and measurements --- Education --- Social Sciences --- Theory & Practice of Education --- Scoring --- Interpretation --- Design and construction --- Standards --- Scoring. --- Interpretation. --- Design and construction. --- Standards. --- Educational assessment --- Educational measurements --- Mental tests --- Tests and measurements in education --- Psychological assessment --- Tests, Psychological --- Test construction --- Test design --- Interpretation of examinations --- Test interpretation --- Test results --- Remote scoring of examinations --- Scoring of examinations --- Self-scoring of examinations --- Test scoring --- Statistics. --- Assessment. --- Psychometrics. --- Statistics for Social Science, Behavorial Science, Education, Public Policy, and Law. --- Assessment, Testing and Evaluation. --- Statistics for Social Science, Behavioral Science, Education, Public Policy, and Law. --- Measurement, Mental --- Measurement, Psychological --- Psychological measurement --- Psychological scaling --- Psychological statistics --- Psychology --- Psychometry (Psychophysics) --- Scaling, Psychological --- Scaling (Social sciences) --- Statistical analysis --- Statistical data --- Statistical methods --- Statistical science --- Mathematics --- Econometrics --- Measurement --- Scaling --- Methodology --- Psychological tests for children --- Psychometrics --- Students --- Testing --- Clinical psychology --- Rating of --- Validity --- Educational tests and measuremen. --- Statistics for Social Sciences, Humanities, Law. --- Statistics .
Choose an application
This book describes how to use test equating methods in practice. The non-commercial software R is used throughout the book to illustrate how to perform different equating methods when scores data are collected under different data collection designs, such as equivalent groups design, single group design, counterbalanced design and non equivalent groups with anchor test design. The R packages equate, kequate and SNSequate, among others, are used to practically illustrate the different methods, while simulated and real data sets illustrate how the methods are conducted with the program R. The book covers traditional equating methods including, mean and linear equating, frequency estimation equating and chain equating, as well as modern equating methods such as kernel equating, local equating and combinations of these. It also offers chapters on observed and true score item response theory equating and discusses recent developments within the equating field. More specifically it covers the issue of including covariates within the equating process, the use of different kernels and ways of selecting bandwidths in kernel equating, and the Bayesian nonparametric estimation of equating functions. It also illustrates how to evaluate equating in practice using simulation and different equating specific measures such as the standard error of equating, percent relative error, different that matters and others.
Examinations --- Educational tests and measurements --- Scoring. --- Interpretation. --- Standards. --- Educational assessment --- Educational measurements --- Mental tests --- Tests and measurements in education --- Interpretation of examinations --- Test interpretation --- Test results --- Remote scoring of examinations --- Scoring of examinations --- Self-scoring of examinations --- Test scoring --- Education. --- Assessment. --- Statistics. --- Psychometrics. --- Assessment, Testing and Evaluation. --- Statistics for Social Science, Behavorial Science, Education, Public Policy, and Law. --- Psychological tests for children --- Psychometrics --- Students --- Psychological tests --- Rating of --- Validity --- Educational tests and measuremen. --- Statistics for Social Sciences, Humanities, Law. --- Statistical analysis --- Statistical data --- Statistical methods --- Statistical science --- Mathematics --- Econometrics --- Measurement, Mental --- Measurement, Psychological --- Psychological measurement --- Psychological scaling --- Psychological statistics --- Psychology --- Psychometry (Psychophysics) --- Scaling, Psychological --- Scaling (Social sciences) --- Measurement --- Scaling --- Methodology --- Statistics . --- R (Computer program language). --- GNU-S (Computer program language) --- Domain-specific programming languages --- Educational tests and measurements. --- Social sciences --- Assessment and Testing. --- Statistics in Social Sciences, Humanities, Law, Education, Behavorial Sciences, Public Policy. --- Statistical methods.
Choose an application
The goal of this book is to emphasize the formal statistical features of the practice of equating, linking, and scaling. The book encourages the view and discusses the quality of the equating results from the statistical perspective (new models, robustness, fit, testing hypotheses, statistical monitoring) as opposed to placing the focus on the policy and the implications, which although very important, represent a different side of the equating practice. The book contributes to establishing “equating” as a theoretical field, a view that has not been offered often before. The tradition in the practice of equating has been to present the knowledge and skills needed as a craft, which implies that only with years of experience under the guidance of a knowledgeable practitioner could one acquire the required skills. This book challenges this view by indicating how a good equating framework, a sound understanding of the assumptions that underlie the psychometric models, and the use of statistical tests and statistical process control tools can help the practitioner navigate the difficult decisions in choosing the final equating function. This book provides a valuable reference for several groups: (a) statisticians and psychometricians interested in the theory behind equating methods, in the use of model-based statistical methods for data smoothing, and in the evaluation of the equating results in applied work; (b) practitioners who need to equate tests, including those with these responsibilities in testing companies, state testing agencies, and school districts; and (c) instructors in psychometric, measurement, and psychology programs. Dr. Alina A. von Davier is a Strategic Advisor and a Director of Special Projects in Research and Development at Educational Testing Service (ETS). During her tenure at ETS, she has led an ETS Research Initiative called “Equating and Applied Psychometrics” and has directed the Global Psychometric Services Center. The center supports the psychometric work for all ETS international programs, including TOEFL iBT and TOEIC. She is a co-author of a book on the kernel method of test equating, an author of a book on hypotheses testing in regression models, and a guest co-editor for a special issue on population invariance of linking functions for the journal Applied Psychological Measurement.
Educational tests and measurements -- Evaluation. --- Examinations -- Design and construction. --- Examinations -- Interpretation. --- Examinations -- Scoring. --- Psychological tests -- Standards. --- Scaling (Social sciences). --- Social sciences -- Statistical methods. --- Examinations --- Educational tests and measurements --- Psychological tests --- Education --- Social Sciences --- Education, Special Topics --- Theory & Practice of Education --- Scoring --- Statistical methods --- Mathematical statistics. --- Scoring. --- Interpretation. --- Mathematics --- Statistical inference --- Statistics, Mathematical --- Interpretation of examinations --- Test interpretation --- Test results --- Remote scoring of examinations --- Scoring of examinations --- Self-scoring of examinations --- Test scoring --- Education. --- Assessment. --- Statistics. --- Psychometrics. --- Assessment, Testing and Evaluation. --- Statistics for Social Science, Behavorial Science, Education, Public Policy, and Law. --- Statistics for Social Science, Behavioral Science, Education, Public Policy, and Law. --- Measurement, Mental --- Measurement, Psychological --- Psychological measurement --- Psychological scaling --- Psychological statistics --- Psychology --- Psychometry (Psychophysics) --- Scaling, Psychological --- Scaling (Social sciences) --- Statistical analysis --- Statistical data --- Statistical science --- Econometrics --- Children --- Education, Primitive --- Education of children --- Human resource development --- Instruction --- Pedagogy --- Schooling --- Students --- Youth --- Civilization --- Learning and scholarship --- Mental discipline --- Schools --- Teaching --- Training --- Measurement --- Scaling --- Methodology --- Statistics --- Probabilities --- Sampling (Statistics) --- Validity --- Educational tests and measuremen. --- Statistics for Social Sciences, Humanities, Law. --- Statistical methods. --- Statistics .
Listing 1 - 9 of 9 |
Sort by
|