{"$schema":"https://policywindow.org/critique/api/schema","critique_id":"CRIT-000029","slug":"genai-classroom-teaching-university-teachers","url":"https://policywindow.org/critique/c/genai-classroom-teaching-university-teachers","doi":null,"status":"published","critique_type":"editorially_approved_ai_native_critique","publication_date":"2026-06-29","current_version":"1.0","target_paper":{"title":"Factors influencing the adoption of generative artificial intelligence into classroom teaching by university teachers: An empirical study using SPSS PROCESS macros","authors":["Yong Xiang","Chenxin Yang","Zhigang Jin","Wanshu Zhao"],"journal":"PLOS One","doi":"10.1371/journal.pone.0324875","url":"https://doi.org/10.1371/journal.pone.0324875","publicationDate":"2025","paperType":"empirical","accessBasis":"open_access","fullTextUsed":true,"fictional":false,"doi_url":"https://doi.org/10.1371/journal.pone.0324875"},"source_journal":{"tier":"exception","rankingSources":["off-monitored: peer-reviewed gold-OA journal (PLOS ONE) not in the monitored determination; disclosed off-list"],"rankingNote":"Off-monitored: PLOS ONE is a peer-reviewed, gold open-access journal not in the journal's monitored top-tier determination; disclosed off-list. Critiqued at full text via the source store."},"selection_provenance":{"id":"genai-classroom-teaching-university-teachers","venue":"PLOS One","inMonitoredSet":false,"determinedTier":null,"recordedTier":"exception","effectiveTier":"exception","kind":"off_list","disclosed":true,"offListPeerReviewed":true},"selection":{"aiAgiCentralityScore":3,"societalRelevanceScore":4,"aiAgiCategories":["education","human_AI_interaction"],"selectionReason":"Autonomous production cycle (G101), deepening the education domain: a full-text critique of a PLS-SEM study of university teachers' generative-AI adoption, span-grounded to the gold-OA full text via the source store."},"scores":{"aiAgiContribution":3,"evidentiarySupport":2,"methodologicalRisk":4,"overclaiming":3,"reproducibilityOrAuditability":4,"societalImpactRelevance":4,"severity":"high","confidence":"high"},"severity_cap_for_access_basis":"high","plain_language_summary":"This PLOS One paper surveys 513 university teachers in China with a one-month, all-self-report questionnaire and SPSS PROCESS macros (Models 4 and 61) to test a social-cognitive-theory model of why teachers adopt generative AI. It reports adequate reliability/validity (Cronbach's alpha, CR, AVE, Fornell-Larcker, the stricter HTMT) and finds all six hypotheses supported. A full-text adversarial convergence panel returned a UNANIMOUS survives verdict — the defender could not restore any point. Four span-exact flaws hold. (1) The paper draws causal/mechanistic conclusions ('verify the mechanism', and it justifies its method on the grounds that SEM 'focuses on analyzing causal relationships between variables') from a single-wave, cross-sectional, all-self-report design that cannot establish causal direction, rule out reverse/reciprocal causation (social cognitive theory itself posits reciprocal determinism), or exclude common-method variance — and no CMV test is reported. (2) The sample is internally contradictory on its central eligibility criterion: eligibility is stated as ages 22–45, yet the Limitations report a high percentage aged 36–49, a band exceeding the stated ceiling. (3) Reproducibility is limited: the raw data are withheld, and the manuscript carries verifiability problems a reader cannot resolve (the reference number [24] is attached to two different works; the results text contains a garbled statistic, 'the positive OE effect on OE is significant'). (4) An unsupported absolute novelty claim ('there is no research on teachers' self-efficacy as a factor affecting teachers' acceptance of technology') is contradicted by the paper's own TAM/TPB and self-efficacy citations. The authors do credibly disclose the self-report subjectivity, generalizability limits, and skewed age distribution, and use a conventional PLS-SEM reliability/validity workflow with 5,000 bootstrap resamples — genuine strengths that bear on measurement quality but cannot repair causal inference, the age contradiction, the withheld data, or the false universal negative.","claims":[{"id":"C1","text":"Causal/mechanistic conclusions are drawn from a single-wave, cross-sectional, all-self-report design that cannot license them.","type":"causal","evidenceOffered":"Structural equation modeling focuses on analyzing causal relationships between variables","support":"weak","overclaiming":"major","assessment":"The paper sets out to 'verify the mechanism of the influence of each factor' and justifies its method on the grounds that 'Structural equation modeling focuses on analyzing causal relationships between variables,' but all constructs (self-efficacy, outcome expectations, self-improvement, external environment, willingness) were measured simultaneously in a one-month questionnaire window from a convenience sample, with no manipulation, no temporal lag, and no behavioural outcome. PROCESS Models 4/61 fit regression-based mediation/moderation coefficients with bootstrap CIs; they assume the causal ordering rather than test it, are equally consistent with reverse or reciprocal causation (which social cognitive theory itself posits), and cannot exclude common-method variance from all-self-report measures — and no CMV test (Harman, marker variable) is reported. The directional conclusions therefore outrun the design.","mainWeakness":"Cross-sectional all-self-report data cannot establish the causal mechanisms the paper claims; no CMV test, lag, or manipulation.","confidence":"high"},{"id":"C2","text":"The sample is internally contradictory on its own central eligibility criterion.","type":"descriptive","evidenceOffered":"this study has a high percentage of college teachers aged 36–49 years old","support":"weak","overclaiming":"moderate","assessment":"The Methods restrict respondents to college teachers aged 22–45, yet the Limitations report that the study has a high percentage of teachers aged 36–49 — a band whose upper end (46–49) exceeds the stated eligibility ceiling. Either the criterion was not enforced, the reported distribution is wrong, or the categories were mislabelled; the full text never reconciles them. The realized sample composition is therefore not reliably known, undermining claims about who the findings generalize to, and the convenience frame (emails harvested from university homepages) further limits representativeness.","mainWeakness":"A high-percentage age band (36–49) that exceeds the paper's own 22–45 eligibility ceiling, unreconciled in the text.","confidence":"high"},{"id":"C3","text":"Reproducibility is limited: raw data are withheld and the manuscript carries verifiability problems a reader cannot resolve.","type":"descriptive","evidenceOffered":"the raw data cannot be publicly disclosed","support":"weak","overclaiming":"none","assessment":"The raw data are withheld ('the raw data cannot be publicly disclosed'), so the PROCESS analyses cannot be independently rerun. Compounding this, the manuscript carries verifiability problems a reader cannot resolve from the text: the reference number [24] is attached to two different works in the same paragraph (Zhang & Qian on adolescent academic performance; Rahmati on pronunciation instruction, with the reference list showing [24] = Rahmati only), and the results text contains a garbled statistic ('the positive OE effect on OE is significant'). These make it difficult to verify which sources and which numbers support which claims. (Reported factually as verifiability defects in the text, not as any judgement of the authors.)","mainWeakness":"Withheld raw data plus in-text citation/statistic inconsistencies block independent verification.","confidence":"high"},{"id":"C4","text":"An unsupported absolute novelty claim is contradicted by the paper's own citations.","type":"descriptive","evidenceOffered":"there is no research on teachers’ self-efficacy as a factor affecting teachers’ acceptance of technology","support":"unsupported","overclaiming":"major","assessment":"The paper asserts a universal negative — 'there is no research on teachers' self-efficacy as a factor affecting teachers' acceptance of technology' — with no citation or systematic search to justify it, despite the paper itself citing the Technology Acceptance Model, the Theory of Planned Behavior, SEM studies of teachers' ICT adoption, teacher self-efficacy literature, and (in its own reference list) a generative-AI-acceptance-and-self-efficacy study. The sweeping 'no research exists' framing inflates the study's originality and is partly self-contradicted by the paper's own references; a charitable 'no AIGC-specific study' reading softens but does not rescue the literal text.","mainWeakness":"A universal-negative novelty claim with no supporting search, contradicted by the paper's own reference list.","confidence":"high"}],"sections":[{"id":"what","title":"What the paper does","body":"A PLS-SEM study: 513 Chinese university teachers, a one-month all-self-report questionnaire (March–April 2024), SPSS PROCESS Models 4/61, testing a social-cognitive-theory model of generative-AI ('AIGC') adoption. Reliability/validity reported (alpha, CR, AVE, Fornell-Larcker, HTMT); all six hypotheses supported."},{"id":"flaw1","title":"Statistical inference — causal claims from cross-sectional self-report","body":"The paper claims to 'verify the mechanism' and grounds its method on SEM analysing 'causal relationships', but the design is single-wave, cross-sectional, all-self-report, with no manipulation, lag, behavioural outcome, or common-method-variance test. PROCESS assumes the causal ordering rather than testing it; reverse/reciprocal causation (which SCT itself posits) and common-method variance are unaddressed, so the directional conclusions outrun the data."},{"id":"flaw2","title":"Sample / data — an internal eligibility contradiction","body":"Eligibility is stated as ages 22–45, yet the Limitations report a high percentage aged 36–49 — exceeding the stated ceiling and unreconciled in the text. The realized sample composition is therefore not reliably known, and the convenience frame further limits representativeness."},{"id":"flaw3","title":"Reproducibility — withheld data + in-text inconsistencies","body":"Raw data are withheld, so the analyses cannot be independently rerun, and the manuscript carries verifiability problems a reader cannot resolve: the reference number [24] is attached to two different works, and the results text contains a garbled statistic ('the positive OE effect on OE is significant'). Reported as verifiability defects in the text, not as any judgement of the authors."},{"id":"flaw4","title":"Overclaiming — an unsupported universal-negative novelty claim","body":"The paper asserts 'there is no research on teachers' self-efficacy as a factor affecting teachers' acceptance of technology' — an uncited universal negative contradicted by the paper's own TAM/TPB and teacher-self-efficacy citations. A 'no AIGC-specific study' reading softens but does not rescue the literal claim."},{"id":"strengths","title":"What the paper does well","body":"The reliability/validity workflow is conventionally complete (Cronbach's alpha 0.849 with items >0.700, CR>0.7, AVE>0.5, Fornell-Larcker, and the stricter HTMT<0.85), it cites appropriate methodological authorities (Hair et al.; Henseler et al.; Kock & Hadaya; Hayes), uses 5,000 bootstrap resamples with 95% CIs, and n=513 is adequate for the model. The authors also honestly disclose the self-report subjectivity, the generalizability limits, and the skewed age distribution, and propose concrete future remedies. These strengths are real but bear on measurement quality — none repairs the cross-sectional causal inference, the age contradiction, the withheld data plus in-text inconsistencies, or the universal-negative novelty claim."}],"strongest_critique":"The single most serious problem is the mismatch between the study's causal/mechanistic claims and its cross-sectional, all-self-report design: it sells itself as verifying 'the mechanism of the influence of each factor' and justifies its method on the grounds that SEM 'focuses on analyzing causal relationships between variables,' but every construct was measured simultaneously in a one-month questionnaire window from a convenience sample, with no manipulation, no temporal lag, no behavioural outcome, and no common-method-variance test. PROCESS Models 4/61 assume the causal ordering rather than test it; the chains are equally consistent with reverse or reciprocal causation (which social cognitive theory itself posits), and common-method variance can manufacture the observed associations. The directional conclusions are overstated relative to what a cross-sectional correlational design supports — and this sits alongside a self-contradictory age criterion, withheld data with in-text citation/statistic inconsistencies, and an uncited universal-negative novelty claim.","strongest_fair_defence":"The paper does several conventional things correctly and is transparent about some limits. It reports a defensible PLS-SEM reliability/validity workflow (Cronbach's alpha, CR>0.7, AVE>0.5, Fornell-Larcker, the stricter HTMT<0.85), cites appropriate authorities (Hair et al.; Henseler et al.; Kock & Hadaya for minimum sample size; Hayes for PROCESS), and uses 5,000 bootstrap resamples with 95% CIs — standard best practice for indirect-effect inference, with n=513 adequate for the model complexity. Crucially, the authors do not hide the design's weaknesses: the Limitations openly state the data 'relied on self-reporting ... which is somewhat subjective', acknowledge the limits 'restrict the generalizability of the findings', flag the skewed age distribution, and propose concrete future remedies. Faulting cross-sectional inference is fair, but the authors deserve credit for disclosing the self-report and generalizability constraints rather than concealing them.","final_judgment":"A publishable-genre but methodologically weak adoption-intention study whose conclusions should be read as exploratory correlational associations, not the causal mechanisms it claims. A full-text convergence panel returned a unanimous survives verdict (the defender could not restore any point). Four span-exact flaws hold: causal overreach from a one-month cross-sectional all-self-report design; a sample whose 36–49 age description contradicts its own 22–45 eligibility rule; reproducibility limits (withheld raw data, a double-assigned citation [24], a garbled results statistic); and an uncited universal-negative novelty claim the paper's own citations undercut. The genuine strengths (standard PLS reliability/validity reporting, 5,000-resample bootstrap CIs, honest disclosure of self-report and generalizability limits) are real but bear on measurement quality and cannot offset the reproducibility and causal-inference problems. Overall severity high, driven primarily by the reproducibility/verifiability issues and the causal overclaiming rather than any single fatal statistical error. Procedural note: produced by the autonomous production cycle (G101); every span independently verified an exact substring of the gold-OA full text; targets claims, methods and inference only, never the authors.","review_process":{"aiAgentsUsed":["claim_extraction","methods","statistics","adversarial","author_defence","plain_language","meta_review"],"reviewRounds":2,"humanEditor":{"name":"","role":"","approvalDate":"2026-06-29","declaredConflict":"none"},"expertCertification":{"used":false}},"author_response":{"notified":false,"status":"not_yet_invited","editorialActionAfterResponse":"Authors may reply at any time; this critique addresses claims, methods and inference only, never the authors."},"versions":[{"version":"1.0","date":"2026-06-29","note":"Initial publication (autonomous production cycle — education depth).","changeType":"initial"}],"transparency":{"modelCardUrl":"/critique/model-card","publicAuditSummary":"Full-text critique of a gold-OA (PLOS ONE) paper; every span verified an exact substring of the full text (source store), independently re-checked; DOI resolves (title+author+year matched via Crossref). Produced by the autonomous production cycle (G101) and run through the hardened convergence gate: UNANIMOUS survives, stable, no sustained defeat (the defender could not restore any point). Targets claims/methods/inference only, never the authors.","privateAuditRecordExists":true,"citationVerification":{"status":"complete","checkedSources":[{"label":"DOI 10.1371/journal.pone.0324875 (Crossref: title+author+year matched)","url":"https://doi.org/10.1371/journal.pone.0324875","verified":true},{"label":"Full text used for span verification","url":"https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0324875","verified":true}],"fabricatedCitations":0},"riskReview":{"copyright":"completed","defamation":"completed","note":"Gold-OA paper quoted sparingly under criticism/review; targets claims/methods/inference only — manuscript inconsistencies are reported as verifiability facts, never as author misconduct."}}}