Thesis

MCQ generation using T5 Transformer

Rating

Sold

Pages

Grade

A+

Uploaded on

06-04-2024

Written in

2023/2024

We have used two t5 transformer trained in SQUAD and RACE dataset respectively for QA pair and distractors. In case of insufficient distractors, we have also integrates sens-to-vec word embedding algorithm.

Institution

Course

Content preview

TRIBHUVAN UNIVERSITY
INSTITUTE OF ENGINEERING
PULCHOWK CAMPUS

A
PROJECT REPORT
ON
MCQ GENERATION FROM TEXT USING TEXT-TO-TEXT
TRANSFER TRANSFORMER

SUBMITTED BY:
ANJAN DEV GC BHUJEL(PUL077BEI010)
PRANESH PYARA SHRESTHA(077BEI030)
TANGSANG CHONGBANG(077BEI047)
AMRIT SARKI(077BEI049)

SUBMITTED TO:

DEPARTMENT OF ELECTRONICS & COMPUTER ENGINEERING

March 10, 2024

,Acknowledgments
We wish to convey our heartfelt gratitude to the Department of Electronics and Computer
Engineering (DoECE), Pulchowk Campus, for graciously providing us the opportunity to
work on this project.

Our profound appreciation extends to our project supervisor Er. Santosh Giri, whose guid-
ance, monitoring, and insights have been significant throughout this transformative journey.
In addition, we would like to express our sincere thanks to all the distinguished faculty mem-
bers of the department, whose scholarly wisdom and unwavering commitment to teaching
have laid the bedrock for our academic advancement. Their devoted efforts have been in-
strumental in sculpting our ideas for the development of this project.

Lastly, our gratitude extends to our friends and colleagues who have stood by us stead-
fastly during this endeavor. Their encouragement, constructive critiques, and collaborative
spirit have enriched our learning endeavour.

i

,Abstract
This project presents a system to automate the manual, time-consuming and tiresome
method of creating quizzes for tests and assessments. Multiple Choice Questions (MCQs)
have been a widely used method of assessment, with their history going back to the early
20th century. And, they hold their significance even in today’s educational landscape. Glob-
ally recognized and standardized tests like SATs, GREs, JEEs, Government Examinations,
college entrance tests, all adapt the format of MCQs for their assessments. However, the
nature of crafting questions for such assessments manually becomes quite laborious and time-
consuming. So, this project applies the T5-Small model variant of the T5, to showcase how
the generation of MCQs from given textual contents can be automated. Two pre-trained
T5-small models have been fine-tuned on SQuAD, and RACE datasets for generation of
question-answer pair and distractors respectively. In case of insufficient distractors, we have
integrated the Sense-2-Vec, a word-embedding model to generate additional distractors. The
final obtained models were to be evaluated in terms of BLEU and ROUGE metrics. Also,
manual human evaluations were to be conducted to assess the quality of generated MCQs.
Furthermore, a web-application was implemented in Flask which enabled the users to input
paragraphs and receive the desired number of questions. Through this approach of automa-
tion, this project attempts to contribute to the field of Natural Language Processing and
education technology (ed-tech).

Keywords: MCQs, distractors, T5 Transformers, SQuAD, RACE, Sense-2-Vec, BLEU,
ROUGE, Natural Language Processing, Education Technology

ii

, Contents
Acknowledgements i

Abstract ii

Contents iv

List of Figures v

List of Tables vi

List of Abbreviations vii

1 Introduction 1
1.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Problem statements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.4 Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

2 Literature Review 4
2.1 Related work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.2 Related theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2.1 Transformers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2.2 Text-to-Text Transfer Transformer(T5) . . . . . . . . . . . . . . . . . 8
2.2.3 AdamW Optimizer . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2.4 BLEU (Bilingual Evaluation Understudy) . . . . . . . . . . . . . . . 10
2.2.5 ROUGE (Recall-Oriented Understudy for Gisting Evaluation) . . . . 11

3 Methodology 13
3.1 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.1.1 Data Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.1.2 Data pre-processing and Cleaning . . . . . . . . . . . . . . . . . . . . 15
3.2 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.2.1 Dataset splits: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.2.2 Environments Used . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

iii

Report Copyright Violation

Written for

Institution: Pulchowk Engineering College
Course: ANJ010

All documents for this subject (1)

Document information

Uploaded on: April 6, 2024
Number of pages: 46
Written in: 2023/2024
Type: THESIS
Supervisor(s): Santosh giri
Year: Unknown

Subjects

machine leaning
engineering project
t5 transformers
transformers
mcq generation project
attention is all you need
chat gpt
ai machine learning mcq generation project
ai

$10.99

Get access to the full document:

Written by students who passed

Immediately available after payment

Read online or as PDF

Get to know the seller

anjangcbhujel

Get to know the seller

anjangcbhujel Exam Questions

View profile

Sold

Member since

2 year

Number of followers

Documents

Last sold

0.0

0 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller anjangcbhujel. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $10.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 48766 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now

MCQ generation using T5 Transformer

Content preview

Written for

Document information

Subjects

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Working on your references?

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?