Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Optimizing calibration designs with uncertainty in abilities
Stockholms universitet, Samhällsvetenskapliga fakulteten, Statistiska institutionen. Linköping University, Sweden.ORCID-id: 0000-0001-7552-8983
Stockholms universitet, Samhällsvetenskapliga fakulteten, Statistiska institutionen.ORCID-id: 0000-0003-0528-0083
Stockholms universitet, Samhällsvetenskapliga fakulteten, Statistiska institutionen. Linköping University, Sweden.ORCID-id: 0000-0003-4161-7851
2025 (engelsk)Inngår i: British Journal of Mathematical & Statistical Psychology, ISSN 0007-1102, E-ISSN 2044-8317, Vol. 78, nr 3, s. 889-910Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

In computerized adaptive tests, some newly developed items are often added for pretesting purposes. In this pretesting, item characteristics are estimated which is called calibration. It is promising to allocate calibration items to examinees based on their abilities and methods from optimal experimental design have been used for that. However, the abilities of the examinees have usually been assumed to be known for this allocation. In practice, the abilities are estimates based on a limited number of operational items. We develop the theory for handling the uncertainty in abilities in a proper way and show how optimal calibration design can be derived in this situation. The method has been implemented in an R package. We see that the derived optimal calibration designs are more robust if this uncertainty in abilities is acknowledged.

sted, utgiver, år, opplag, sider
2025. Vol. 78, nr 3, s. 889-910
Emneord [en]
Ability, Computerized Adaptive Tests, Item Calibration, Optimal Experimental Design
HSV kategori
Forskningsprogram
statistik
Identifikatorer
URN: urn:nbn:se:su:diva-198065DOI: 10.1111/bmsp.12387ISI: 001520329900001PubMedID: 40065545Scopus ID: 2-s2.0-105000444923OAI: oai:DiVA.org:su-198065DiVA, id: diva2:1605908
Forskningsfinansiär
Swedish Research Council, 2019-02706Tilgjengelig fra: 2021-10-26 Laget: 2021-10-26 Sist oppdatert: 2025-11-20bibliografisk kontrollert
Inngår i avhandling
1. Test Design for Mean Ability Growth and Optimal Item Calibration for Achievement Tests
Åpne denne publikasjonen i ny fane eller vindu >>Test Design for Mean Ability Growth and Optimal Item Calibration for Achievement Tests
2021 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

In this thesis, we examine two topics in the area of educational measurement. The first topic studies how to best design two achievement tests with common items such that a population mean-ability growth is measured as precisely as possible. The second examines how to calibrate newly developed test items optimally. These topics are two optimal design problems in achievement testing. Paper I consist of a simulation study where different item difficulty allocations are compared regarding the precision of mean ability growth when controlling for estimation method and item difficulty span. We take a more theoretical approach on how to allocate the item difficulties in Paper II. We use particle swarm optimization on a multi-objective weighted sum to determine an exact design of the two tests with common items. The outcome relies on asymptotic results of the test information function. The general conclusion of both papers is that we should allocate the common items in the middle of the difficulty span, with the two separate test items on different sides. When we decrease the difference in mean ability between the groups, the ranges of the common and test items coincide more.

In the second part, we examine how to apply an existing optimal calibration method and algorithm using data from the Swedish Scholastic Aptitude Test (SweSAT). We further develop it to consider uncertainty in the examinees' ability estimates. Paper III compares the optimal calibration method with random allocation of items to examinees in a simulation study using different measures. In most cases, the optimal design method estimates the calibration items more efficiently. Also, we can identify for what kind of items the method works worse.

The method applied in Paper III assumes that the estimated abilities are the true ones. In Paper IV, we further develop the method to handle uncertainty in the ability estimates which are based on an operational test. We examine the asymptotic result and compare it to the case of known abilities. The optimal design using estimates approaches the optimal design assuming true abilities for increasing information from the operational test.

sted, utgiver, år, opplag, sider
Stockholm: Department of Statistics, Stockholm University, 2021. s. 42
Emneord
test design, item response theory, optimal experimental design, SweSAT, item calibration, vertical scaling, ability growth, computerized adaptive tests
HSV kategori
Forskningsprogram
statistik
Identifikatorer
urn:nbn:se:su:diva-197928 (URN)978-91-7911-674-3 (ISBN)978-91-7911-675-0 (ISBN)
Disputas
2021-12-10, hörsal 4, hus 2, Albanovägen 12, Stockholm, 10:00 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2021-11-17 Laget: 2021-10-26 Sist oppdatert: 2022-02-25bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstPubMedScopus

Person

Bjermo, JonasFackle-Fornius, EllinorMiller, Frank

Søk i DiVA

Av forfatter/redaktør
Bjermo, JonasFackle-Fornius, EllinorMiller, Frank
Av organisasjonen
I samme tidsskrift
British Journal of Mathematical & Statistical Psychology

Søk utenfor DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric

doi
pubmed
urn-nbn
Totalt: 370 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf