This paper examines the trade-off between sensitivity and specificity in selecting diagnostic tests for a research study involving participants who have already tested positive for a condition (referred to as "condition Q"). The paper compares two hypothetical tests — Test A with 95% sensitivity and 75% specificity, and Test B with 75% sensitivity and 95% specificity — and argues that Test B is the more appropriate choice for this study design. Drawing on published research, including studies on abdominal mass diagnosis and autism spectrum conditions, the paper justifies the selection of Test B based on its lower false-negative rate and higher capacity for confirming true negatives in a population where all participants are presumed to have the condition.
When selecting a diagnostic test that is most conducive to generating authentic results, a researcher must weigh the trade-offs between two key properties: sensitivity and specificity. In the present scenario, the choice is between Test A, which carries a 95% sensitivity and 75% specificity rating, and Test B, which carries a 75% sensitivity and 95% specificity rating. Choosing Test B is the sensible choice for this particular study.
The researcher intends to use participants who have already tested positive for condition Q. Because it can be assumed that all participants are suffering from condition Q, the sensitivity measure indicates that Test B will correctly identify 75% of all sufferers as positive. At the same time, Test B's specificity rate of 95% means that 95% of participants who test negative will genuinely be negative. Abobaker (2015) used the same type of test to determine a diagnosis of sonographically detected abdominal masses in a similar scenario and was able to conclude a high level of both sensitivity and specificity.
Since the researcher is attempting to determine how many participants are affected once a variable has been introduced into the study, it is important to be able to track both false positives — participants who test positive when they are not — and false negatives — participants who present as not suffering from condition Q when they actually are.
Using Test A, the 95% sensitivity rate means that only 5% of participants would fail to test positive, even though the researcher already knows every participant has been confirmed to have the condition. By contrast, using Test B would show that 25% of participants were not testing positive for condition Q. Counterintuitively, those results could be more easily justified precisely because the researcher already knew that every participant had previously presented with the condition — making the higher false-negative rate under Test B a more transparent and interpretable outcome within this specific design.
"External studies validating sensitivity-specificity methodology"
"The value of fewer false negatives over false positives"
There is a case to be made for both tests. Test A can show that there is a relatively lower number of false positives, which is a good thing. Test B, however, shows that a lower number of false negatives will be found. Comparing the two tests, most researchers would likely find Test B to be the one that is more aligned with the goals of this specific study. That does not mean Test A is entirely without value, but Test B would provide results that are more authentic and more defensible given the study's design.
You’re 54% through this paper. Sign up to read the remaining 2 sections.
Sign Up Now — Instant Access Already a member? Log inAlways verify citation format against your institution’s current style guide requirements.