EdTech Research - Applied Insights for Higher Education

A Validation Study of AI-Generated Prompts in CILS (Certification of Italian as a Foreign Language) B2 Exams

Browse All Topics

Artificial Intelligence

A Validation Study of AI-Generated Prompts in CILS (Certification of Italian as a Foreign Language) B2 Exams

Name: Giulia Peri, Sabrina Machetti, Paola Masillo
Journal / Publisher: Researching Generative AI in Applied Linguistics (chapter). Iowa State University Digital Press
Year of publication: 2025
ISBN: --
Volume/Number/Page: (pp. 197–218)
Keywords: generative AI, oral language assessment, automated feedback, personalized instruction, educational technology
DOI: https://doi.org/10.31274/isudp.2025.211.10
URL: https://www.iastatedigitalpress.com/plugins/books/211/chapter/1265

Citation:

Peri, G., Machetti, S., & Masillo, P. (2025). A validation study of AI-generated prompts in CILS (Certification of Italian as a Foreign Language) B2 exams. In C. A. Chapelle, G. H. Beckett, & B. E. Gray (Eds.), Researching generative AI in applied linguistics (pp. 197–218). Iowa State University Digital Press.

Abstract:

This study presents an ongoing validation analysis of generative AI (ChatGPT-4; OpenAI 2023) in creating test prompts for the written production component of the CILS (Certification of Italian as a Foreign Language) B2 exam. As AI technologies increasingly influence language assessment and test development, a key challenge is ensuring that AI-generated writing test prompts maintain validity, appropriateness, and alignment with assessment constructs. This issue is particularly relevant for high-stakes language certifications, where the demand for new test items is growing, but the number of trained item writers remains limited. The research aims to evaluate ChatGPT-4’s capability to generate writing test prompts reflecting the CILS B2 target domain and to investigate how these prompts are perceived by CILS experts and test-takers. Following an argument based validity framework (Chapelle & Voss, 2021), the study employs a mixed-methods approach to collect evidence for the domain definition inference.

Name	Provider	Purpose	Expiration
snowplowOutQueue_snowplow_cf
lastExternalReferrerTime
lastExternalReferrer
_cohortId

Name	Provider	Purpose	Expiration
_gat_UA-116650837-1			54 seconds
_gat_UA-116650837-3			54 seconds
_ga	Google	Cookie used by Google Analytics to distinguish unique users on a website.	1 year 1 month
_gid	Google	Google Analytics cookie used to record a unique identifier per user session, assigning an ID to each browsing session on a website.	1 day
_ga_BQFXZ9Z125			1 year 1 month
_ga_1X2QTVHZ35			1 year 1 month
_gat_gtag_UA_116650837_1			54 seconds
_gcl_au			3 months
wp-wpml_current_language			Session
wp_consent_functional			1 month
wp_consent_marketing			1 month
wp_consent_preferences			1 month
wp_consent_statistics			1 month
PHPSESSID	Own	Technical session cookie generated by the management system of a website in PHP.	Session