Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Efficient Estimation of Mean Ability Growth Using Vertical Scaling
Stockholms universitet, Samhällsvetenskapliga fakulteten, Statistiska institutionen.ORCID-id: 0000-0001-7552-8983
Stockholms universitet, Samhällsvetenskapliga fakulteten, Statistiska institutionen.ORCID-id: 0000-0003-4161-7851
Antal upphovsmän: 22021 (Engelska)Ingår i: Applied measurement in education, ISSN 0895-7347, E-ISSN 1532-4818, Vol. 34, nr 3, s. 163-178Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability between two groups of students that differ substantially. This is performed by a simulation study. One- and two-parameter item response models are assumed and the estimated abilities are vertically scaled using the non-equivalent anchor test design by estimating the abilities in one single run, so-called concurrent calibration. The connection between the test design and the Fisher information is also discussed. The results indicate that the expected a posteriori estimation method is preferred when estimating differences in mean ability between groups. Results also indicate that a test design with common items of medium difficulty leads to better precision, which coincides with previous results from horizontal equating.

Ort, förlag, år, upplaga, sidor
2021. Vol. 34, nr 3, s. 163-178
Nationell ämneskategori
Utbildningsvetenskap Matematik
Identifikatorer
URN: urn:nbn:se:su:diva-195839DOI: 10.1080/08957347.2021.1933981ISI: 000661773400001OAI: oai:DiVA.org:su-195839DiVA, id: diva2:1588174
Tillgänglig från: 2021-08-26 Skapad: 2021-08-26 Senast uppdaterad: 2022-02-25Bibliografiskt granskad
Ingår i avhandling
1. Test Design for Mean Ability Growth and Optimal Item Calibration for Achievement Tests
Öppna denna publikation i ny flik eller fönster >>Test Design for Mean Ability Growth and Optimal Item Calibration for Achievement Tests
2021 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

In this thesis, we examine two topics in the area of educational measurement. The first topic studies how to best design two achievement tests with common items such that a population mean-ability growth is measured as precisely as possible. The second examines how to calibrate newly developed test items optimally. These topics are two optimal design problems in achievement testing. Paper I consist of a simulation study where different item difficulty allocations are compared regarding the precision of mean ability growth when controlling for estimation method and item difficulty span. We take a more theoretical approach on how to allocate the item difficulties in Paper II. We use particle swarm optimization on a multi-objective weighted sum to determine an exact design of the two tests with common items. The outcome relies on asymptotic results of the test information function. The general conclusion of both papers is that we should allocate the common items in the middle of the difficulty span, with the two separate test items on different sides. When we decrease the difference in mean ability between the groups, the ranges of the common and test items coincide more.

In the second part, we examine how to apply an existing optimal calibration method and algorithm using data from the Swedish Scholastic Aptitude Test (SweSAT). We further develop it to consider uncertainty in the examinees' ability estimates. Paper III compares the optimal calibration method with random allocation of items to examinees in a simulation study using different measures. In most cases, the optimal design method estimates the calibration items more efficiently. Also, we can identify for what kind of items the method works worse.

The method applied in Paper III assumes that the estimated abilities are the true ones. In Paper IV, we further develop the method to handle uncertainty in the ability estimates which are based on an operational test. We examine the asymptotic result and compare it to the case of known abilities. The optimal design using estimates approaches the optimal design assuming true abilities for increasing information from the operational test.

Ort, förlag, år, upplaga, sidor
Stockholm: Department of Statistics, Stockholm University, 2021. s. 42
Nyckelord
test design, item response theory, optimal experimental design, SweSAT, item calibration, vertical scaling, ability growth, computerized adaptive tests
Nationell ämneskategori
Sannolikhetsteori och statistik Utbildningsvetenskap
Forskningsämne
statistik
Identifikatorer
urn:nbn:se:su:diva-197928 (URN)978-91-7911-674-3 (ISBN)978-91-7911-675-0 (ISBN)
Disputation
2021-12-10, hörsal 4, hus 2, Albanovägen 12, Stockholm, 10:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2021-11-17 Skapad: 2021-10-26 Senast uppdaterad: 2022-02-25Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltext

Person

Bjermo, JonasMiller, Frank

Sök vidare i DiVA

Av författaren/redaktören
Bjermo, JonasMiller, Frank
Av organisationen
Statistiska institutionen
I samma tidskrift
Applied measurement in education
UtbildningsvetenskapMatematik

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 315 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf