{"$schema":"https://policywindow.org/critique/api/schema","critique_id":"CRIT-000019","slug":"student-perspectives-generative-ai-higher-ed","url":"https://policywindow.org/critique/c/student-perspectives-generative-ai-higher-ed","doi":null,"status":"published","critique_type":"editorially_approved_ai_native_critique","publication_date":"2026-06-27","current_version":"1.0","target_paper":{"title":"Student perspectives on the use of generative artificial intelligence technologies in higher education","authors":["Heather Johnston","Rebecca Wells","Elizabeth M. Shanks","Timothy Boey","Bryony N. Parsons"],"journal":"International Journal for Educational Integrity","doi":"10.1007/s40979-024-00149-4","url":"https://doi.org/10.1007/s40979-024-00149-4","publicationDate":"2024-02-08","paperType":"empirical","accessBasis":"abstract_only","fullTextUsed":false,"fictional":false,"doi_url":"https://doi.org/10.1007/s40979-024-00149-4"},"source_journal":{"tier":"exception","rankingSources":["off-monitored: peer-reviewed education venue not in the monitored determination; disclosed off-list"],"rankingNote":"Off-monitored: International Journal for Educational Integrity is a peer-reviewed education journal not in the monitored determination; disclosed off-list."},"selection_provenance":{"id":"student-perspectives-generative-ai-higher-ed","venue":"International Journal for Educational Integrity","inMonitoredSet":false,"determinedTier":null,"recordedTier":"exception","effectiveTier":"exception","kind":"off_list","disclosed":true,"offListPeerReviewed":true},"selection":{"aiAgiCentralityScore":1,"societalRelevanceScore":3,"aiAgiCategories":[],"selectionReason":"G88b coverage-moving batch (education white-space); validated G84 engine; span-grounded to the OpenAlex abstract."},"scores":{"aiAgiContribution":1,"evidentiarySupport":3,"methodologicalRisk":3,"overclaiming":1,"reproducibilityOrAuditability":2,"societalImpactRelevance":3,"severity":"moderate","confidence":"medium"},"severity_cap_for_access_basis":"moderate","plain_language_summary":"Researchers surveyed 2,555 University of Liverpool students about AI tools like ChatGPT to help update the university's academic-integrity rules. Most students had heard of these tools, supported light-touch aids like Grammarly far more than using ChatGPT to write a whole essay, and wanted a clear university policy. The survey is large and useful for its local purpose. Judged only from the abstract, its limits are about scope rather than over-claiming: it combines 'used' and merely 'considered using' into one headline figure, so that number is not a clean use rate; the link it reports between writing confidence and AI use is descriptive, so we cannot tell what drives it; and its policy recommendations (don't ban, ensure equal access) are reasonable normative positions that go beyond what a descriptive survey can by itself establish. It is one institution's snapshot, not a generalizable study.","claims":[{"id":"CLAIM-001","text":"Students who had higher levels of confidence in their academic writing were less likely to use or consider using them for academic purposes, and were also less likely to be supportive of other students using them.","type":"empirical","evidenceOffered":"Students who had higher levels of confidence in their academic writing were less likely to use or consider using them for academic purposes, and were also less likely to be supportive of other students using them.","support":"moderate","overclaiming":"none","assessment":"This is the abstract's most analytically ambitious finding, a directional association between writing confidence and both usage and attitudes. Read against the abstract, the paper reports it as an association, not a causal claim, so the fair reading is descriptive: with no mechanism identified and no adjustment reported, it cannot be distinguished from confounding by correlates of writing confidence or from a reversed relationship. As one institution's correlation it is informative; it simply does not license an explanation of why confident writers abstain, and the abstract does not assert one. The limit is interpretive scope, not over-claiming.","mainWeakness":"Reported as a descriptive association with no mechanism identified and no adjustment for plausible correlates of writing confidence (discipline, study level, prior attainment); it is informative as one institution's correlation but underdetermined as an explanation -- a limit on interpretive scope, not over-claiming by the authors.","confidence":"medium"},{"id":"CLAIM-002","text":"over half had used or considered using these for academic purposes","type":"empirical","evidenceOffered":"over half had used or considered using these for academic purposes","support":"moderate","overclaiming":"none","assessment":"Collapsing 'used' and 'considered using' into a single 'over half' figure is a construct-precision limit: behavioural use and mere contemplation are different constructs with different policy weight, and no split is reported. The abstract is transparent that the figure is combined, so this is not a distortion of the headline rate; it is a limit on how precisely uptake can be stated from the reported number. A breakdown would let the figure be read as an actual use rate.","mainWeakness":"Combines actual use with mere consideration in one headline figure with no reported split, so the number cannot be read as a use rate; 'academic purposes' is also left undefined. The abstract is transparent that the figure is combined, so this is a precision limit, not a distortion.","confidence":"medium"},{"id":"CLAIM-003","text":"these technologies should not be banned from university, but consideration must be made to ensure different groups of students have equal access to the technologies","type":"normative","evidenceOffered":"these technologies should not be banned from university, but consideration must be made to ensure different groups of students have equal access to the technologies","support":"weak","overclaiming":"none","assessment":"The recommendations are normative policy positions, and the abstract frames the study as policy-informing. As normative claims they are reasonable and appropriate to an applied integrity journal; the only fair observation is that they are value-based recommendations that go beyond what the descriptive survey results by themselves establish. This is a scope note about the descriptive/normative boundary, not a demand that a policy recommendation supply an empirical disparities analysis.","mainWeakness":"A normative policy recommendation, reasonable and appropriate to a policy-informing study, but value-based and therefore beyond what the descriptive survey results by themselves establish; the fair note is the descriptive/normative boundary, not a demand for an empirical disparities analysis.","confidence":"medium"}],"sections":[{"id":"s1","title":"Construct and measurement validity","body":"Two construct-precision points stand out, neither a distortion. First, the engagement headline combines behaviour and intention: \"over half had used or considered using these for academic purposes\" merges actual use with mere contemplation, two constructs with different policy weight, and no split is reported -- so the figure cannot be read as a use rate, though the abstract is transparent that it is a combined number. Second, the key explanatory variable, students' confidence in their academic writing, is presumably self-rated and single-item; the abstract gives no indication of how it was measured or validated. Both are limits on precision and interpretation rather than evidence of over-claiming."},{"id":"s2","title":"Inference and confounding","body":"The relationship the abstract reports -- that writing-confident students were less likely to use or consider AI and less likely to support peers using it -- is a descriptive association, and is best read as one. The abstract identifies no mechanism and reports no adjustment for plausible correlates of writing confidence (discipline, study level, prior attainment, EAL status); because confidence plausibly co-varies with these, the relationship could be confounded or operate in the reverse direction. This does not mean the finding is overstated -- the abstract presents it as an association, not a cause -- but it does bound what can be inferred: as one institution's correlation it is informative, as an explanation of why confident writers abstain it is underdetermined."},{"id":"s3","title":"Scope, generalizability, and policy claims","body":"The study is explicitly single-institution (University of Liverpool), which is entirely appropriate for its stated aim but bounds external validity; attitudes at one large UK university need not transfer. Its recommendations -- that \"these technologies should not be banned from university, but consideration must be made to ensure different groups of students have equal access to the technologies\" -- are normative policy positions. They are reasonable and well-suited to a policy-informing study; the only fair observation is that, as value-based recommendations, they extend beyond what the descriptive survey results by themselves establish. That is a note about the descriptive/normative boundary, not a demand that the recommendation be backed by an empirical disparities analysis the genre does not call for."},{"id":"s4","title":"What the study does well","body":"The large sample (2555 respondents) gives the descriptive proportions real weight, and the differentiated results (54.1% support for Grammarly-type tools versus 70.4% opposition to whole-essay ChatGPT use) show the instrument captured genuine nuance rather than a single pro/anti axis. The applied framing is honest about its purpose, and demand for 'a university wide policy' (41.1%) is exactly the actionable stakeholder signal an integrity office needs. For the International Journal for Educational Integrity, this is a fit-for-purpose contribution to the student-voice evidence base."}],"strongest_critique":"The fair critique is about scope and construct precision, not over-claiming -- read against an abstract, the paper does not overstate its case. Its most analytically ambitious finding, that \"Students who had higher levels of confidence in their academic writing were less likely to use or consider using them for academic purposes, and were also less likely to be supportive of other students using them.\", is reported as a descriptive association and should be read as one: the abstract asserts no causal mechanism and leaves the relationship unadjusted, so it cannot establish whether writing confidence itself, or a correlate such as discipline, study level, or prior attainment, tracks usage. As one institution's correlation it is informative; as an explanation of why confident writers abstain it is underdetermined, and the abstract does not claim otherwise. Two scope limits compound this. The headline engagement figure combines two distinct constructs -- \"over half had used or considered using these for academic purposes\" -- so it mixes behaviour with intention and cannot be read as a use rate; the abstract is transparent that the figure is combined, but the combination still limits how precisely uptake can be stated. And the work is single-institution (University of Liverpool), with a survey instrument built by a student library team with no reported psychometric validation, so external validity and instrument reliability are both unestablished from the abstract. These are calibration-and-scope limits on a competent, fit-for-purpose applied survey, not competence or integrity faults.","strongest_fair_defence":"This is a descriptive survey study with a clearly stated applied purpose -- to \"inform changes to the University of Liverpool Academic Integrity code of practice\" -- and on those terms it is appropriately scoped and genuinely useful. A sample of \"2555 students\" is large for a single-institution attitudinal survey and lends real weight to the descriptive proportions reported. The authors do not claim causality or generalizability beyond their institution; the headline results (awareness levels, differential support for Grammarly versus whole-essay ChatGPT use, demand for a university-wide policy) are exactly the kind of local stakeholder evidence that integrity-policy revision should rest on. The differentiated support figures (54.1% vs 70.4%) demonstrate the survey captured nuanced rather than monolithic attitudes. For a practice-oriented integrity journal, descriptive student-voice data is a legitimate and valuable contribution, not a methodological shortfall.","final_judgment":"A competent, large-sample descriptive survey that achieves its stated applied aim of informing one university's academic-integrity policy, and that is honest about being exactly that. Read against the abstract it does not over-claim: its credibility is strongest for the headline proportions, and its limits are matters of scope and construct precision rather than over-reach. The confidence-and-usage relationship is reported as a descriptive association and is best read narrowly, since the abstract identifies no mechanism and reports no adjustment; the combined 'used or considered using' figure mixes behaviour with intention and so is not a clean use rate; and the normative recommendations (no ban, equal access) are reasonable policy positions that extend beyond what a descriptive survey can by itself establish. None of these are integrity or competence concerns; they are calibration and inferential-scope limits typical of practitioner survey work. Severity is capped at moderate given abstract-only access.","review_process":{"aiAgentsUsed":["AGISS critique engine (validated G84 directive)"],"reviewRounds":1,"humanEditor":{"name":"","role":"","approvalDate":"","declaredConflict":"none"},"expertCertification":{"used":false}},"author_response":{"notified":false,"status":"not_yet_invited"},"versions":[{"version":"1.0","date":"2026-06-27","note":"Initial publication; strongest critique narrowed to its genre-appropriate core during pre-publication over-reach review.","changeType":"initial"}],"transparency":{"modelCardUrl":"/critique/model-card","publicAuditSummary":"G88b coverage-moving batch (education white-space); validated G84 engine; span-grounded to the OpenAlex abstract.","privateAuditRecordExists":true,"citationVerification":{"status":"complete","checkedSources":[{"label":"International Journal for Educational Integrity abstract (OpenAlex)","url":"https://doi.org/10.1007/s40979-024-00149-4","verified":true}],"fabricatedCitations":0},"riskReview":{"copyright":"completed","defamation":"completed","note":"Critiques claims and methods only; no author-motive language. Abstract-only; severity capped to moderate; fair-use of short abstract spans."}}}