Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimal Test Design for Estimation of Mean Ability Growth
Stockholm University, Faculty of Social Sciences, Department of Statistics. Linköping University, Sweden.ORCID iD: 0000-0001-7552-8983
2025 (English)In: Applied psychological measurement, ISSN 0146-6216, E-ISSN 1552-3497, Vol. 49, no 1-2, p. 29-49Article in journal (Refereed) Published
Abstract [en]

The design of an achievement test is crucial for many reasons. This article focuses on a population’s ability growth between school grades. We define design as the allocating of test items concerning the difficulties. The objective is to present an optimal test design method for estimating the mean and percentile ability growth with good precision. We use the asymptotic expression of the variance in terms of the test information. With that criterion for optimization, we propose to use particle swarm optimization to find the optimal design. The results show that the allocation of the item difficulties depends on item discrimination and the magnitude of the ability growth. The optimization function depends on the examinees’ abilities, hence, the value of the unknown mean ability growth. Therefore, we will also use an optimum in-average design and conclude that it is robust to uncertainty in the mean ability growth. A test is, in practice, assembled from items stored in an item pool with calibrated item parameters. Hence, we also perform a discrete optimization using simulated annealing and compare the results to the particle swarm optimization. 

Place, publisher, year, edition, pages
2025. Vol. 49, no 1-2, p. 29-49
Keywords [en]
ability growth, item response theory, optimal design, optimum in-average, particle swarm optimization, simulated annealing, test information
National Category
Probability Theory and Statistics Other Educational Sciences
Research subject
Statistics
Identifiers
URN: urn:nbn:se:su:diva-198066DOI: 10.1177/01466216241291233ISI: 001331568100001Scopus ID: 2-s2.0-85206572166OAI: oai:DiVA.org:su-198066DiVA, id: diva2:1606221
Available from: 2021-10-26 Created: 2021-10-26 Last updated: 2025-02-03Bibliographically approved
In thesis
1. Test Design for Mean Ability Growth and Optimal Item Calibration for Achievement Tests
Open this publication in new window or tab >>Test Design for Mean Ability Growth and Optimal Item Calibration for Achievement Tests
2021 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

In this thesis, we examine two topics in the area of educational measurement. The first topic studies how to best design two achievement tests with common items such that a population mean-ability growth is measured as precisely as possible. The second examines how to calibrate newly developed test items optimally. These topics are two optimal design problems in achievement testing. Paper I consist of a simulation study where different item difficulty allocations are compared regarding the precision of mean ability growth when controlling for estimation method and item difficulty span. We take a more theoretical approach on how to allocate the item difficulties in Paper II. We use particle swarm optimization on a multi-objective weighted sum to determine an exact design of the two tests with common items. The outcome relies on asymptotic results of the test information function. The general conclusion of both papers is that we should allocate the common items in the middle of the difficulty span, with the two separate test items on different sides. When we decrease the difference in mean ability between the groups, the ranges of the common and test items coincide more.

In the second part, we examine how to apply an existing optimal calibration method and algorithm using data from the Swedish Scholastic Aptitude Test (SweSAT). We further develop it to consider uncertainty in the examinees' ability estimates. Paper III compares the optimal calibration method with random allocation of items to examinees in a simulation study using different measures. In most cases, the optimal design method estimates the calibration items more efficiently. Also, we can identify for what kind of items the method works worse.

The method applied in Paper III assumes that the estimated abilities are the true ones. In Paper IV, we further develop the method to handle uncertainty in the ability estimates which are based on an operational test. We examine the asymptotic result and compare it to the case of known abilities. The optimal design using estimates approaches the optimal design assuming true abilities for increasing information from the operational test.

Place, publisher, year, edition, pages
Stockholm: Department of Statistics, Stockholm University, 2021. p. 42
Keywords
test design, item response theory, optimal experimental design, SweSAT, item calibration, vertical scaling, ability growth, computerized adaptive tests
National Category
Probability Theory and Statistics Educational Sciences
Research subject
Statistics
Identifiers
urn:nbn:se:su:diva-197928 (URN)978-91-7911-674-3 (ISBN)978-91-7911-675-0 (ISBN)
Public defence
2021-12-10, hörsal 4, hus 2, Albanovägen 12, Stockholm, 10:00 (English)
Opponent
Supervisors
Available from: 2021-11-17 Created: 2021-10-26 Last updated: 2022-02-25Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Bjermo, Jonas

Search in DiVA

By author/editor
Bjermo, Jonas
By organisation
Department of Statistics
In the same journal
Applied psychological measurement
Probability Theory and StatisticsOther Educational Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 299 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf