Context Controlled, Explainable, And Efficient Question Generation

Context Controlled, Explainable, And Efficient Question Generation With Transformers (C3 Qg)

Abstract: This invention introduces C3QG, a novel transformer-based system for context-controlled, explainable, and efficient question generation. Leveraging advanced evaluation metrics (BERTScore, METEOR), SHAP-based interpretability, and GPU-accelerated fine-tuning of FLAN-T5 transformers, the invention achieves significantly enhanced semantic accuracy and fluency in question generation. Strategic data optimization further ensures high computational efficiency and robustness, making it suitable for practical NLP-based educational systems requiring transparency and interpretability.

Patent Information

Application #

Filing Date

11 August 2025

Publication Number

35/2025

Publication Type

INA

Invention Field

COMPUTER SCIENCE

Status

Email

Parent Application

Applicants

Satyam Mishra

Beside Regal Palace, Shantipuram Colony

Phung Thao Vi

Thon 7, Lai Xuan, Thuy Nguyen, Hai Phong, Vietnam

Sundaram Mishra

Beside Regal Palace, Shantipuram Colony, Barabanki-225001, Uttar Pradesh, India

Rajni Mohana

Amity School of Engineering & Technology, Amity University Punjab, Mohali , India, 140306

Vishwanath Bijalwan

Amity School of Engineering & Technology, Amity University Punjab, Mohali , India, 140306

Tan Duc Tran

Faculty of Electrical and Electronic Engineering, Phenikaa University, Huu street, Ha Dong district, Ha Noi city, Viet Nam 12116

Inventors

1. Satyam Mishra

Beside Regal Palace, Shantipuram Colony

2. Phung Thao Vi

Thon 7, Lai Xuan, Thuy Nguyen, Hai Phong, Vietnam

3. Sundaram Mishra

Beside Regal Palace, Shantipuram Colony, Barabanki-225001, Uttar Pradesh, India

4. Rajni Mohana

Amity School of Engineering & Technology, Amity University Punjab, Mohali , India, 140306

5. Vishwanath Bijalwan

Amity School of Engineering & Technology, Amity University Punjab, Mohali , India, 140306

6. Tan Duc Tran

Faculty of Electrical and Electronic Engineering, Phenikaa University, Huu street, Ha Dong district, Ha Noi city, Viet Nam 12116

Specification

Description:The C3QG system comprises a pipeline as follows:
Input Layer: Accepts raw text paragraphs, tokenizes them, and processes them into model-compatible formats with instruction prefixes (e.g., "generate a medium difficulty question").
Transformer Model Execution: Uses FLAN-T5 pretrained and fine-tuned on domain-specific QG datasets to generate multiple candidate questions through beam search or top-p sampling.
Post-Processing Module: Filters duplicate, ungrammatical, or redundant outputs using confidence thresholds and paraphrasing detection.
Attribution Engine: Applies SHAP to the encoder-decoder attention weights to compute feature importance scores for each token.
Output Formatting: Returns a ranked list of generated questions with SHAP attribution maps and semantic metrics.
Technical Advantages:
• Transparent and controllable question generation
• Compatible with a wide range of educational texts
• Enables human-in-the-loop validation through token-level attribution
• Adaptive difficulty tuning for different learner levels
• Quantitative evaluation via semantic-aware metrics

Example Use-Case:
Consider a high-school learning platform integrated with C3QG. A teacher uploads a batch of 9th-grade biology textbook paragraphs. The system generates multiple-choice and short-answer questions in real time, each tailored to Bloom’s taxonomy levels (e.g., "Understand", "Apply"). Students can select difficulty modes, and for each question, a heatmap reveals the most influential words in the passage, enhancing transparency. Teachers receive semantic metrics (e.g., question complexity, alignment score) and can refine question batches for homework or assessment. In a pilot study, C3QG reduced manual workload by 60% while maintaining educational quality.
, C , Claims:We claim:
1. A computer-implemented system for generating contextually controlled and explainable natural language questions comprising:
o a data curation module configured to tokenize, clean, and augment input datasets;
o a transformer-based question generation engine based on fine-tuned FLAN-T5 architecture;
o a multi-metric evaluation module using BERTScore, METEOR, and SHAP values for assessing semantic accuracy, fluency, and interpretability;
o an adaptive difficulty control unit for modifying output question complexity based on learner profiles.
2. The system of claim 1, wherein SHAP-based interpretability highlights token-wise contribution in the generation process.
3. The system of claim 1, wherein the training is optimized using GPU-accelerated hardware including NVIDIA T4 Tensor Cores.
4. A method for generating explainable and efficient questions comprising:
• preprocessing input context through tokenization and noise reduction;
• fine-tuning transformer-based models for question synthesis;
• evaluating generated outputs with semantic similarity and interpretability metrics;
• adjusting output question difficulty dynamically based on predefined learner categories.
5. The method of claim 4, wherein the model architecture integrates hyperparameter optimization routines to ensure computational efficiency.
6. A non-transitory computer-readable storage medium storing instructions which, when executed by one or more processors, cause the system to perform the steps of the method as claimed in any of claims 4 or 5.

Documents

Application Documents

#	Name	Date
1	202511076023-STATEMENT OF UNDERTAKING (FORM 3) [11-08-2025(online)].pdf	2025-08-11
2	202511076023-FORM-9 [11-08-2025(online)].pdf	2025-08-11
3	202511076023-FORM 1 [11-08-2025(online)].pdf	2025-08-11
4	202511076023-DRAWINGS [11-08-2025(online)].pdf	2025-08-11
5	202511076023-DECLARATION OF INVENTORSHIP (FORM 5) [11-08-2025(online)].pdf	2025-08-11
6	202511076023-COMPLETE SPECIFICATION [11-08-2025(online)].pdf	2025-08-11