Post-publication Comment · Critical AI
Comment on “Postgraduate students' perceptions of artificial intelligence integration in research: A cross-sectional study”
Critical AI · published 2026-06-28 · v1.0 · CRIT-000021
Concerning: Ibrahim Naif Alenezi, Fathia Ahmed Mersal, Amal Ahmed Elbilgahy · PLOS One · 2026
Why this paper was selected
Continuous-staged autonomous run: OA full-text critique engaging the coverage blind spots (identification, sample_data, reproducibility); span-grounded to the gold-OA full text.
AI/AGI centrality 2/5 · societal relevance 3/5 · source-journal note: Off-monitored: PLOS One is a peer-reviewed, gold open-access journal (CC BY) not in the journal's monitored top-tier determination; disclosed off-list. Critiqued at full text.
Summary
This is a single-university survey of 267 nursing/health master's students in Saudi Arabia about how they perceive using AI in research. It is honestly framed: the authors say upfront it cannot prove cause and effect, admit their volunteer sample skews toward tech-savvy students, and limit their conclusions to this one institution. After cleaning up an over-fired draft critique that was full of details not actually in the paper, three real problems remain. First, the paper interprets a statistical finding — that people more worried about privacy also reported more intention to adopt AI — as evidence of sophisticated "critical literacy," which reads more into a single correlation than it can bear. Second, there's a logical snag: having used AI before was supposedly required to take part, yet "prior AI use: yes/no" is used as a predictor and only 85% report prior use, so the entry rule and the data don't line up. Third, only summary tables are shared, so no one can independently re-run the analysis. The descriptive numbers are useful context; the bigger interpretive claims are overstated.
Central claims & evidence map
| Claim | Type | Evidence offered | Support | Overclaiming | Main weakness |
|---|---|---|---|---|---|
| The paper interprets a positive cross-sectional privacy-concerns coefficient as a disposition: "Privacy concerns appear to reflect critical literacy rather than barriers to adoption." | Causal | Privacy concerns appear to reflect critical literacy rather than barriers to adoption. | Strong | Major | A positive cross-sectional association between privacy concerns and adoption intention is recast as a psychological disposition ('critical literacy rather than barriers to adoption'). This is a causal-interpretive leap from a single correla |
| The abstract frames simultaneously-measured self-report subscales causally: perceived benefits are "the strongest predictor of intention to adopt" and privacy concerns "suggest informed and critical engagement." | Causal | Perceived benefits were the strongest predictor of intention to adopt AI for research purposes (β = 0.588, p < 0.001). Privacy concerns were positively associated with adoption intention (β = 0.230, p < 0.001), suggesting informed and critical engagement rather than resistance. | Moderate | Moderate | The abstract uses predictor/strength language ('strongest predictor', 'positively associated ... suggesting informed and critical engagement') that imports a directional, mechanistic reading the design cannot support: benefits, privacy conc |
| Prior AI experience is stated as an inclusion criterion, yet only 85.0% report prior use and "Prior AI Tool Use (Yes vs. No)" is entered as a regression predictor. | Methodological | who have prior experience using or evaluating AI tools, such as ChatGPT, DeepSeek, Zotero, QuillBot, Grammarly, or Mendeley, for research purposes | Strong | Moderate | Internal contradiction between eligibility and analysis. Prior experience using or evaluating AI tools is stated as an explicit inclusion criterion, yet the paper reports only 85.0% prior use and enters 'Prior AI Tool Use (Yes vs. No)' as a |
| The paper states "All relevant data are fully presented within the manuscript itself" — i.e. only aggregate tables, no analysable dataset. | Methodological | All relevant data are fully presented within the manuscript itself, and no additional supporting files were generated. | Strong | Minor | The analysis is not independently reproducible. The Data Availability Statement declares that all data are within the manuscript and no supporting files were generated, but the manuscript provides only aggregate tables — not the participant |
Per-claim assessment
C1. The paper interprets a positive cross-sectional privacy-concerns coefficient as a disposition: "Privacy concerns appear to reflect critical literacy rather than barriers to adoption."
A positive cross-sectional association between privacy concerns and adoption intention is recast as a psychological disposition ('critical literacy rather than barriers to adoption'). This is a causal-interpretive leap from a single correlation in a one-time-point, self-report design; given the paper's own reported VIF up to 4.8, a collinearity/suppression artifact is at least as plausible as a substantive trait, yet no such alternative is considered and the framing is asserted in the conclusion as a population-level characteristic.
C2. The abstract frames simultaneously-measured self-report subscales causally: perceived benefits are "the strongest predictor of intention to adopt" and privacy concerns "suggest informed and critical engagement."
The abstract uses predictor/strength language ('strongest predictor', 'positively associated ... suggesting informed and critical engagement') that imports a directional, mechanistic reading the design cannot support: benefits, privacy concerns and adoption intention are all measured simultaneously by the same respondent on the same instrument, so common-method variance and reverse causation (intending to adopt inflates perceived benefits) are confounded with the reported associations. The Methods section does correctly disclaim causality, so this is an inconsistency between hedged methods and causal-flavored framing rather than a uniform error.
C3. Prior AI experience is stated as an inclusion criterion, yet only 85.0% report prior use and "Prior AI Tool Use (Yes vs. No)" is entered as a regression predictor.
Internal contradiction between eligibility and analysis. Prior experience using or evaluating AI tools is stated as an explicit inclusion criterion, yet the paper reports only 85.0% prior use and enters 'Prior AI Tool Use (Yes vs. No)' as a regression predictor in Table 4. If the inclusion filter were applied, there could be no 'No' category and no ~15% non-users; the coexistence of the filter, the 85% figure, and the Yes/No predictor is logically incoherent and signals either inconsistent screening or mis-specification. (The draft's specific '40 participants (15.0%)' / 'Table 1' figures are not in the text and are not relied upon here.)
C4. The paper states "All relevant data are fully presented within the manuscript itself" — i.e. only aggregate tables, no analysable dataset.
The analysis is not independently reproducible. The Data Availability Statement declares that all data are within the manuscript and no supporting files were generated, but the manuscript provides only aggregate tables — not the participant- or item-level data needed to reproduce the regression, the HC3 standard errors, the G*Power calculation, or the Cronbach's alpha values. The unresolved prior-AI-use contradiction also cannot be checked without underlying data that is declared not to exist.
Scorecard
Sub-scores are 0–5 editorial judgements on fixed scales (higher is better, except methodological risk and overclaiming where higher is worse). They are contestable and open to a severity challenge from authors.
What the paper does
A single-institution cross-sectional survey of 267 nursing/health-profession master's students in Saudi Arabia on perceptions of AI in research, using a TAM-based validated instrument; the authors explicitly disclaim causal inference and name their convenience-sampling bias.
Interpretive overreach
A positive cross-sectional association between privacy concerns and adoption intention is recast as a psychological disposition ('critical literacy rather than barriers to adoption'). This is a causal-interpretive leap from a single correlation in a one-time-point, self-report design; given the paper's own reported VIF up to 4.8, a collinearity/suppression artifact is at least as plausible as a substantive trait, yet no such alternative is considered and the framing is asserted in the conclusion as a population-level characteristic.
Cross-sectional design vs causal-flavoured framing
The abstract uses predictor/strength language ('strongest predictor', 'positively associated ... suggesting informed and critical engagement') that imports a directional, mechanistic reading the design cannot support: benefits, privacy concerns and adoption intention are all measured simultaneously by the same respondent on the same instrument, so common-method variance and reverse causation (intending to adopt inflates perceived benefits) are confounded with the reported associations. The Methods section does correctly disclaim causality, so this is an inconsistency between hedged methods and causal-flavored framing rather than a uniform error.
Eligibility-vs-analysis contradiction
Internal contradiction between eligibility and analysis. Prior experience using or evaluating AI tools is stated as an explicit inclusion criterion, yet the paper reports only 85.0% prior use and enters 'Prior AI Tool Use (Yes vs. No)' as a regression predictor in Table 4. If the inclusion filter were applied, there could be no 'No' category and no ~15% non-users; the coexistence of the filter, the 85% figure, and the Yes/No predictor is logically incoherent and signals either inconsistent screening or mis-specification. (The draft's specific '40 participants (15.0%)' / 'Table 1' figures are not in the text and are not relied upon here.)
What the paper does well
This is a competently executed, unusually self-aware descriptive survey for its genre. The authors explicitly disclaim causal inference ("this design precludes causal inference and cannot establish temporal precedence"), name the convenience-sampling bias and its likely direction, restrict generalization to a single regional case study rather than a population, and ground the work in an established framework (TAM) using a previously validated published instrument administered in its original English to an English-instruction health-sciences cohort. The methods exceed many comparable perception studies: an a priori G*Power sample-size calculation, HC3 robust standard errors, reported VIF/outlier diagnostics, recomputed subscale reliabilities, and — notably — a full Table 4 with B, SE, 95% confidence intervals, standardized betas and p-values (so the draft's "no confidence intervals" charge is simply wrong). As context-specific, exploratory evidence from an under-studied non-Western health-professional population, which the paper repeatedly and accurately frames it as, the descriptive findings are informative and modestly stated.
Strongest critique
The flagship interpretive claim overreaches the design. A positive cross-sectional coefficient for privacy concerns is recast as a psychological disposition ("Privacy concerns appear to reflect critical literacy rather than barriers to adoption"), and benefits are labeled "the strongest predictor of intention to adopt," even though every subscale and the outcome are measured at one time point by the same respondent on the same instrument — so common-method variance and reverse causation are fully confounded with the reported associations, and a counterintuitive positive coefficient is at least as plausibly a collinearity artifact (the paper itself reports VIF up to 4.8) as a substantive trait. This is compounded by an unresolved data-integrity contradiction: prior AI experience is an explicit inclusion criterion, yet "prior AI use" is entered as a Yes-vs-No predictor and only 85% report prior use — logically incoherent if the filter was applied — and it cannot be checked because the declared dataset is only the manuscript's aggregate tables.
Strongest fair defence
This is a competently executed, unusually self-aware descriptive survey for its genre. The authors explicitly disclaim causal inference ("this design precludes causal inference and cannot establish temporal precedence"), name the convenience-sampling bias and its likely direction, restrict generalization to a single regional case study rather than a population, and ground the work in an established framework (TAM) using a previously validated published instrument administered in its original English to an English-instruction health-sciences cohort. The methods exceed many comparable perception studies: an a priori G*Power sample-size calculation, HC3 robust standard errors, reported VIF/outlier diagnostics, recomputed subscale reliabilities, and — notably — a full Table 4 with B, SE, 95% confidence intervals, standardized betas and p-values (so the draft's "no confidence intervals" charge is simply wrong). As context-specific, exploratory evidence from an under-studied non-Western health-professional population, which the paper repeatedly and accurately frames it as, the descriptive findings are informative and modestly stated.
Conclusion
REFUTE (with credit), but on a much narrower and better-grounded basis than the draft. The paper is a competent, self-aware single-institution descriptive survey that explicitly disclaims causal inference, names its convenience-sampling bias and likely direction, and restricts its claims to a context-specific case study. Three genuine, exactly span-grounded flaws nonetheless weaken its inferential and evidentiary core: (1) the abstract/conclusion recast a positive cross-sectional privacy-concerns coefficient as a psychological disposition ("critical literacy rather than barriers to adoption") and label simultaneously-measured self-report subscales as "the strongest predictor," language stronger than a single-time-point common-method design supports; (2) an unresolved internal contradiction — prior AI experience is an explicit eligibility requirement, yet prior AI use appears as a Yes-vs-No regression predictor and only 85% report prior use, so either ineligible respondents were included or the filter was not applied; and (3) the analysis is not independently reproducible because the Data Availability Statement declares only aggregate manuscript tables exist. These are real but bounded; treat the descriptive prevalence findings as suggestive context-specific evidence and the interpretive "critical literacy"/"pragmatic optimism" framings as overclaimed. IMPORTANT: the draft critique was heavily contaminated with material absent from the full text (a Model 1/Model 2 split, adjusted R²=0.993, "methodological circularity," literal "40 participants (15.0%)"/"Table 1," subscale r up to .71, a 97.4%/227 ChatGPT denominator) and with at least one false claim ("no confidence intervals reported anywhere" — Table 4 includes a 95% CI column). Those points were dropped as ungrounded.
Reply from the authors
Following the practice of Nature Matters Arising, Science Technical Comments and PNAS Letters, this Comment is published as one half of a Comment + Reply pair: the authors of the original article are invited to respond, and any reply is published here verbatim alongside the Comment as part of the record.
Reply: not yet invited. No reply has been received for publication.
The authors have a right of reply and no veto. A reply may request a factual correction, a methodological rebuttal, a clarification, a data/code update, or a severity challenge, and is published unedited. See the right-of-reply policy.
Automated re-evaluation after reply: Authors may reply at any time; this critique addresses claims, methods and inference only, never the authors.
References
Every external source this Comment cites, each with a verified link. 0 fabricated.
Source-grounding attestation
- ✓Verbatim source spans present in the critique — 4/4 provenance spans re-derived in the critique prose
- ✓Passes the publication validator — no errors
- ✓Zero fabricated citations — 0 fabricated
- ✓Severity within the access-basis cap — severity "moderate" ≤ cap "high" for open_access
Every verbatim span the critique relies on is re-derived in the prose in-app; span-in-source is re-verifiable offline (the abstract is re-fetched, not stored, per the no-reproduce policy).
Re-verify span-in-source offline: python3 scripts/verify-queue-critiques.py
Independent faithfulness review
A refute-by-default adversarial panel (two independent reviewers — an overreach lens and a mischaracterization lens — that fetched the real source) tried to prove this critique misread the paper. This is an AI adversarial review recorded with its reasoning, not a deterministic check.
FAITHFUL. I verified every load-bearing element of the sharpened critique against the real full text and found no invention, misquotation, inappropriate hedging, or denied credit. EXACT-SPAN CHECK (all four kept flaws): every verbatimSpan is an exact, contiguous substring of the source. (1) "Privacy concerns appear to reflect critical literacy rather than barriers to adoption." — verbatim in Conclusions/abstract. (2) "Perceived benefits were the strongest predictor of intention to adopt AI for research purposes (β = 0.588, p < 0.001). Privacy concerns were positively associated with adoption intention (β = 0.230, p < 0.001), suggesting informed and critical engagement rather than resistance." — verbatim in Results abstract. (3) "who have prior experience using or evaluating AI tools, such as ChatGPT, DeepSeek, Zotero, QuillBot, Grammarly, or Mendeley, for research purposes" — exact substring of the inclusion criterion (source continues "...including literature searches..."). (4) "All relevant data are fully presented within the manuscript itself, and no additional supporting files were generated." — verbatim in Data Availability. SUPPORTING-FACT CHECK: every cited statistic/feature is accurate — VIF range "1.2–4.8" (Table 4 diagnostics), adjusted R²=.560, 85.0% prior use, "Prior AI Tool Use (Yes vs. No)" predictor in Table 4, prior AI experience as an explicit inclusion crite
Version & correction history
| Version | Date | Change |
|---|---|---|
| v1.0 | 2026-06-28 | Initial publication (promoted from the continuous-staged queue; G92). |
No silent substantive corrections — every change is versioned and visible.
How to cite this Comment
Critical AI. Comment on “Postgraduate students' perceptions of artificial intelligence integration in research: A cross-sectional study” (Ibrahim Naif Alenezi et al., PLOS One, 2026). Critical AI; 2026. https://policywindow.org/critique/c/postgraduate-students-perceptions-ai-research-plos
A registered DOI will replace the URL once minted; until then the canonical URL is the persistent identifier. Highwire/Dublin-Core citation tags and a schema.org Review record are embedded in this page for Google Scholar and reference managers.
Verify this Comment. Its checkable facts (target DOI, access-basis severity cap, zero fabricated citations) are served — as the app’s self-report — at /critique/api/critiques/postgraduate-students-perceptions-ai-research-plos/verify; to confirm them independently of this site, re-derive the same checks (and resolve the target DOI) with npx tsx scripts/verify-critical-ai.ts --critique postgraduate-students-perceptions-ai-research-plos --live.
Content fingerprint 7f417f5746e5c4f2 (v1.0) — this Comment’s substantive content is content-addressed; a silent post-publication edit would change it.